awesome-pipeline by pditommaso

A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

created at July 19, 2014, 3:48 p.m.

Unknown languages

234 +0

5,895 +13

615 +0

GitHub
csvkit by wireservice

A suite of utilities for converting to and working with CSV, the king of tabular file formats.

created at April 1, 2011, 3 a.m.

Python

131 +0

5,808 +7

600 -1

GitHub
bioinformatics by ossu

microscope Path to a free self-taught education in Bioinformatics!

created at June 21, 2016, 11:11 p.m.

Unknown languages

356 +0

4,992 +18

842 +4

GitHub
biopython by biopython

Official git repository for Biopython (originally converted from CVS)

created at March 15, 2009, 9:09 p.m.

Python

171 +0

4,148 +17

1,709 +3

GitHub
deepvariant by google

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

created at Nov. 23, 2017, 1:56 a.m.

Python

160 +0

3,070 +9

695 -3

GitHub
nextflow by nextflow-io

A DSL for data-driven computational pipelines

created at March 27, 2013, 11:17 a.m.

Groovy

87 +0

2,536 +11

584 +1

GitHub
oneliners by stephenturner

Useful bash one-liners for bioinformatics.

created at Oct. 19, 2013, 11:33 a.m.

Unknown languages

158 +0

1,794 +0

510 +1

GitHub
samtools by samtools

Tools (written in C using htslib) for manipulating next-generation sequencing data

created at March 9, 2012, 2:49 a.m.

C

101 +0

1,543 +4

569 +3

GitHub
rust-bio by rust-bio

This library provides implementations of many algorithms and data structures that are useful for bioinformatics. All provided implementations are rigorously tested via continuous integration.

created at Jan. 25, 2015, 4:40 p.m.

Rust

70 +0

1,488 +2

197 +0

GitHub
bwa by lh3

Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)

created at Jan. 14, 2011, 1:36 a.m.

C

106 +0

1,441 +8

540 +2

GitHub
common-workflow-language by common-workflow-language

Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊

created at Sept. 25, 2014, 11:04 a.m.

Common Workflow Language

112 +0

1,439 +3

199 +0

GitHub
rnaseq_tutorial by griffithlab

Informatics for RNA-seq: A web resource for analysis on the cloud. Educational tutorials and working pipelines for RNA-seq analysis including an introduction to: cloud computing, critical file formats, reference genomes, gene annotation, expression, differential expression, alternative splicing, data visualization, and interpretation.

created at July 31, 2014, 6:58 p.m.

R

185 +0

1,309 +2

615 +0

GitHub
seqtk by lh3

Toolkit for processing sequences in FASTA/Q formats

created at March 23, 2012, 11:24 p.m.

C

62 +0

1,308 +2

306 +0

GitHub
MMseqs2 by soedinglab

MMseqs2: ultra fast and sensitive search and clustering suite

created at July 20, 2016, 6:17 a.m.

C

32 +2

1,241 +2

177 +0

GitHub
seqkit by shenwei356

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

created at Feb. 28, 2016, 10:04 a.m.

Go

27 +0

1,194 +6

154 +0

GitHub
MultiQC by MultiQC

Aggregate results from bioinformatics analyses across many samples into a single report.

created at Aug. 4, 2015, 1:47 p.m.

JavaScript

36 +1

1,161 +3

576 +1

GitHub
scipipe by scipipe

Robust, flexible and resource-efficient pipelines using Go and the commandline

created at March 7, 2015, 9:47 p.m.

Go

38 +0

1,052 +0

72 +0

GitHub
bcbio-nextgen by bcbio

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

created at Feb. 6, 2013, 11:14 a.m.

Python

87 +0

973 +1

357 +0

GitHub
cromwell by broadinstitute

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments

created at April 17, 2015, 7:39 p.m.

Scala

113 +0

953 +3

348 +1

GitHub
diamond by bbuchfink

Accelerated BLAST compatible local sequence aligner.

created at March 10, 2015, 11:19 p.m.

C++

37 +0

951 -1

169 -3

GitHub