snakePipes

_images/snakePipes_small.png

snakePipes are pipelines built using snakemake and python for the analysis of epigenomic datasets.

Below is the list of pipelines available in snakePipes

Pipeline

Description

createIndices

Create indices for an organism for further use within snakePipes

DNA-mapping

Basic DNA mapping using bowtie2, filter mapped files, QC and create coverage plots

ChIP-seq

Use the DNA mapping output and run ChIP/Input normalization and peak calling

ATAC-seq

Use the DNA mapping output and detect open chromatin regions for ATAC-seq data

HiC

Hi-C analysis workflow, from mapping to TAD calling

RNA-seq

RNA-Seq workflow : From mapping to differential expression using DEseq2

scRNA-seq

Single-cell RNA-Seq (CEL-Seq2) workflow : From mapping to differential expression

WGBS

Whole-genome Bisulfite-Seq analysis workflow, from mapping to DMR calling and differential methylation analysis

preprocessing

Merging technical replicates (e.g., across lanes), removing optical duplicates, running FastQC

Quick start

  • Assuming you have python3 with conda, install snakePipes with:

conda create -n snakePipes -c mpi-ie -c bioconda -c conda-forge snakePipes
  • Download genome fasta and annotations for an your organism, and build indexes, Check in createIndices

  • Download example fastq files for the human genome here

  • Execute the DNA-mapping pipeline using the example command.sh in the test data directory.

Running your own analysis

For a detail introduction to setting up snakePipes from scratch, please visit Setting up snakePipes

For each organism of interest, snakePipes requires fasta files, genome indexes and annotation files. Paths to these files are specified in the organism/<name>.yaml files. After installation, the location of these files could be revealed by the following command:

snakePipes info

You could either modify the existing files (add your own paths), or add a new file there. See more detail in Running snakePipes

snakePipes could either be executed locally, or on any snakemake-supported cluster infrastructure. See details for setting up the cluster command in Running snakePipes

Citation

If you adopt/run snakePipes for your analysis, cite it as follows :

Bhardwaj V, Heyne S, Sikora K, Rabbani L, Rauer M, Kilpert F, Richter AS, Ryan DP, Manke T. snakePipes: facilitating flexible, scalable and integrative epigenomic analysis. Bioinformatics. 2019 May 27. pii: btz436. doi:

10.1093/bioinformatics/btz436. [Epub ahead of print] PubMed PMID: 31134269. https://www.ncbi.nlm.nih.gov/pubmed/31134269

_images/logo_mpi-ie.jpg

This tool suite is developed by the Bioinformatics Unit at the Max Planck Institute for Immunobiology and Epigenetics, Freiburg.

Help and Support

For query/questions regarding snakePipes, please write on biostars with the tag #snakePipes

For feature requests or bug reports, please open an issue on our GitHub Repository.

Contents: