Skip to content

Different strategies for Downsampling: Downsampling is not just throwing a read

License

Notifications You must be signed in to change notification settings

SherineAwad/Downsampling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Snakemake License

Author: Sherine Awad

A snakemake pipeline for Downsampling Strategies

Dropping reads is not as simple as discarding lines in a SAM file at random using unix commands. As in the paired short-read sequencing technology, reads come in pairs. If we drop reads line by line, we risk dropping a read and keeping its mate, and vice versa. In addition to the issues of supplementary and secondary alignments. Here are different strategies of downsampling.

Change the config.yaml file appropriately according to your data. Update samples.tsv to include your samples. You can edit config file to change this name.

Then run: snakemake -jnumber_of_cores, for example for 5 cores use:

snakemake -j5 

and for a dry run use:

snakemake -j1 -n 

and to print the commands in a dry run use:

snakemake -j1 -n -p 

For the sake reproducibility, use conda to pull same versions of tools. Snakemake and conda have to be installed in your system:

snakemake --cores --use-conda

About

Different strategies for Downsampling: Downsampling is not just throwing a read

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages