awesome-pipeline by pditommaso

A curated list of awesome pipeline toolkits inspired by Awesome Sysadmin

created at July 19, 2014, 3:48 p.m.

Unknown languages

231 +0

6,223 +6

626 +1

GitHub
csvkit by wireservice

A suite of utilities for converting to and working with CSV, the king of tabular file formats.

created at April 1, 2011, 3 a.m.

Python

131 +0

6,036 +5

604 +1

GitHub
bioinformatics by ossu

microscope Path to a free self-taught education in Bioinformatics!

created at June 21, 2016, 11:11 p.m.

Unknown languages

368 +1

5,570 +25

930 +3

GitHub
biopython by biopython

Official git repository for Biopython (originally converted from CVS)

created at March 15, 2009, 9:09 p.m.

Python

170 +0

4,430 +9

1,767 +1

GitHub
deepvariant by google

DeepVariant is an analysis pipeline that uses a deep neural network to call genetic variants from next-generation DNA sequencing data.

created at Nov. 23, 2017, 1:56 a.m.

Python

156 +0

3,269 +5

728 +0

GitHub
nextflow by nextflow-io

A DSL for data-driven computational pipelines

created at March 27, 2013, 11:17 a.m.

Groovy

89 +0

2,790 +2

638 +3

GitHub
oneliners by stephenturner

Useful bash one-liners for bioinformatics.

created at Oct. 19, 2013, 11:33 a.m.

Unknown languages

155 +0

1,881 +5

519 +0

GitHub
samtools by samtools

Tools (written in C using htslib) for manipulating next-generation sequencing data

created at March 9, 2012, 2:49 a.m.

C

100 +0

1,641 +2

580 +0

GitHub
rust-bio by rust-bio

This library provides implementations of many algorithms and data structures that are useful for bioinformatics. All provided implementations are rigorously tested via continuous integration.

created at Jan. 25, 2015, 4:40 p.m.

Rust

71 +0

1,618 +4

207 +0

GitHub
bwa by lh3

Burrow-Wheeler Aligner for short-read alignment (see minimap2 for long-read alignment)

created at Jan. 14, 2011, 1:36 a.m.

C

107 +0

1,549 +3

555 -1

GitHub
MMseqs2 by soedinglab

MMseqs2: ultra fast and sensitive search and clustering suite

created at July 20, 2016, 6:17 a.m.

C

31 +0

1,489 +8

200 +1

GitHub
common-workflow-language by common-workflow-language

Repository for the CWL standards. Use https://cwl.discourse.group/ for support 😊

created at Sept. 25, 2014, 11:04 a.m.

Common Workflow Language

108 +0

1,456 +1

196 +0

GitHub
seqtk by lh3

Toolkit for processing sequences in FASTA/Q formats

created at March 23, 2012, 11:24 p.m.

C

62 +0

1,407 +2

307 +0

GitHub
rnaseq_tutorial by griffithlab

Informatics for RNA-seq: A web resource for analysis on the cloud. Educational tutorials and working pipelines for RNA-seq analysis including an introduction to: cloud computing, critical file formats, reference genomes, gene annotation, expression, differential expression, alternative splicing, data visualization, and interpretation.

created at July 31, 2014, 6:58 p.m.

R

182 +0

1,345 +1

620 +1

GitHub
seqkit by shenwei356

A cross-platform and ultrafast toolkit for FASTA/Q file manipulation

created at Feb. 28, 2016, 10:04 a.m.

Go

29 +2

1,333 +5

159 +0

GitHub
MultiQC by MultiQC

Aggregate results from bioinformatics analyses across many samples into a single report.

created at Aug. 4, 2015, 1:47 p.m.

JavaScript

35 +0

1,242 +3

606 +1

GitHub
scipipe by scipipe

Robust, flexible and resource-efficient pipelines using Go and the commandline

created at March 7, 2015, 9:47 p.m.

Go

38 +0

1,081 +3

71 +0

GitHub
diamond by bbuchfink

Accelerated BLAST compatible local sequence aligner.

created at March 10, 2015, 11:19 p.m.

C++

38 +0

1,078 +5

181 -1

GitHub
csvtk by shenwei356

A cross-platform, efficient and practical CSV/TSV toolkit in Golang

created at April 3, 2016, 2:31 p.m.

Go

23 +0

1,026 +3

85 +0

GitHub
cromwell by broadinstitute

Scientific workflow engine designed for simplicity & scalability. Trivially transition between one off use cases to massive scale production environments

created at April 17, 2015, 7:39 p.m.

Scala

111 +1

1,003 +2

359 +0

GitHub