Project

bio-jaspar

0.0
No commit activity in last 3 years
No release in over 3 years
Basic tools for parsing, searching, and comparing JASPAR motifs; Based on Bio.motifs module in Biopython
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
 Dependencies

Development

>= 1.0.21
~> 2.0.1
~> 0.9.3
~> 3.12

Runtime

>= 1.4.2
~> 0.3.19
 Project Readme

bio-jaspar

Tools for JASPAR motif analysis

Build Status

This gem provides methods for:

  1. Reading and writing sequence motifs in JASPAR format
  2. Accessing a JASPAR5 formatted database
  3. Comparing, searching, and analyzing motifs in sequences

* Note: The JASPAR motif analysis tools consist of several modules that are directly imported from the Bio.motifs package in BioPython. Namely, those modules/submodules are: Bio.motifs, Bio.motifs.matrix, Bio.motifs.thresholds, Bio.motifs.jaspar. The functionality of this gem will be identical to the aforementioned modules/submodules.

Installation

gem install bio-jaspar

Usage

Loading the gem

require 'bio-jaspar'

Loading a motif/motifs from a JASPAR database

A connection to the JASPAR database is made by creating a JASPAR5 instance.

# Substitute the database credentials!
db = Bio::Jaspar::JASPAR5.new(
	:host => <db_host.org>,
	:name => <db_name>,
	:user => <db_user>,
	:password => <db_password>
)

Now, a motif can be retrieved by the matrix_id

m = db.fetch_motif_by_id("MA0049")
puts m.to_s

Or multiple motifs can be retrieved by various criteria

motifs = db.fetch_motifs(
	:collection => "CORE",
	:tax_group => ["fungi", "vertebrate"],
	:tf_class => "Helix-Turn-Helix",
	:min_ic => 2
)
motifs.each { |m| # do something with a motif }

Motif analysis

Many methods are available for motif analysis. Here are some examples:

m = db.fetch_motif_by_id("MA0049")

# Consensus sequence
m.consensus 					# BioRuby Sequence object
puts m.consensus

# Anticonsensus sequence
m.anticonsensus				# BioRuby Sequence object
puts m.anticonsensus

# Reverse complement motif
m.reverse_complement	# Bio::Motif::Motifs object

# Pseudocounts
m.pseudocounts

# Background
m.background

# Position weight matrix
m.pwm

# Position specific scoring matrix
m.pssm

Matrix methods are also available. Here are some examples:

m = db.fetch_motif_by_id("MA0049")

# Maximum possible score for the given motif
m.pssm.max

# Minimum possible score for the given motif
m.pssm.min

# Expected value of the motif score
m.pssm.mean

# Standard deviation of the given motif score
m.pssm.std

# Find hits with the PWM score above given threshold
m.pssm.search(Bio::Sequence.auto("ACCTGCCTAAAAAA"), threshold = 0.5)

Read/write Jaspar file

Already downloaded pfm, jaspar, sites files can be loaded/written using the Jaspar module

# Read a pfm file
f = File.open("test.pfm", "r")
Bio::Jaspar.read(f, "pfm")
f.close

# Write motifs into a jaspar file
motifs = db.fetch_motifs(
	:collection => "CORE",
	:tax_group => ["fungi", "vertebrate"],
	:tf_class => "Helix-Turn-Helix",
	:min_ic => 2
)
File.open("test.jaspar", "w") do |f|
	Bio::Jaspar.write(f, "jaspar")
end

Please refer to the rdoc for full information on all available methods & classes.

Project home page

Information on the source tree, documentation, examples, issues and how to contribute, see

http://github.com/wassermanlab/bioruby-jaspar

The BioRuby community is on IRC server: irc.freenode.org, channel: #bioruby.

Cite

If you use this software, please cite one of

Biogems.info

This Biogem is published at (http://biogems.info/index.html#bio-jaspar)

Copyright

Copyright (c) 2015 Jessica Lee. See LICENSE.txt for further details.