Project

dirseq

0.0
No commit activity in last 3 years
No release in over 3 years
There's a lot of open issues
FPKG (gene expression metric) calculator for metatranscriptomics
2005
2006
2007
2008
2009
2010
2011
2012
2013
2014
2015
2016
2017
2018
2019
2020
2021
2022
2023
2024
2025
 Dependencies

Development

~> 2.1
~> 2.4, >= 2.4.9
~> 0.10
~> 3.0

Runtime

~> 1.4, >= 1.4.2
 Project Readme

dirseq

Build Status

DirSeq work out whether RNAseq reads from metatranscriptomes are generally in the same direction as the ORF predicted, and provide gene-wise coverages using DNAseq mappings.

Note: this software is under active development!

Installation

Install some prerequisites via conda, and then dirseq itself:

conda create -c bioconda -n dirseq -y ruby samtools bedtools'>'2.24
conda activate dirseq
gem install dirseq

The following dependencies are installed above, but for completeness of documentation, dirseq requires these dependencies, on top of the Ruby ones:

  • samtools (tested with 0.1.19 and 1.0+)
  • bedtools (tested with 2.24.0) - old versions won't work.
  • Ruby (tested with 2.1.1)

Usage

Example usage:

Download the example data:

git clone https://github.com/wwood/dirseq
cd dirseq

Then run dirseq:

dirseq --bam spec/data/eg.bam --gff spec/data/eg.gff --measure-type count

Full usage help:

$ dirseq -h

    Usage: dirseq <arguments>

    Reports the coverage of a mapping in against each gene given in a GFF file

        --bam FILE                   path to mapping file [required]
        --gff FILE                   path to GFF3 file [required]

Optional parameters:

        --forward-read-only          consider only forward reads (i.e. read1) and ignore reverse reads. [default false]
        --ignore-directions          ignore directionality, give overall coverage [default: false i.e. differentiate between directions]
        --measure-type TYPE          what to count for each gene [options: count, coverage][default: coverage]
        --accepted-feature-types TYPE
                                     Print only features of these type(s) [default CDS]
        --comment-fields             Print elements from the comments in the GFF file [default ID]
        --sam-filter-flags           Apply these samtools filters [default: -F0x100 -F0x800]
        
Verbosity:

    -q, --quiet                      Run quietly, set logging to ERROR level [default INFO]
        --logger filename            Log to file [default stderr]
        --trace options              Set log level [default INFO]. e.g. '--trace debug' to set logging level to DEBUG

Running on EnrichM output, the output columns are changed relative to PROKKA-generated GFF files:

dirseq --bam spec/data/eg.bam --gff spec/data/eg.gff --measure-type count --comment-fields seq_id,annotations

Project home page

Information on the source tree, documentation, examples, issues and how to contribute, see

http://github.com/wwood/dirseq

The BioRuby community is on IRC server: irc.freenode.org, channel: #bioruby.

Cite

If you use this software, please cite

Woodcroft, B.J., Singleton, C.M., Boyd, J.A. et al. Genome-centric view of carbon processing in thawing permafrost. Nature 560, 49–54 (2018). https://doi.org/10.1038/s41586-018-0338-1

Copyright

Copyright (c) 2014-2021 Ben J. Woodcroft. See LICENSE.txt for further details.