Skip to content

Reprocessing of public data

cziegenhain edited this page Aug 9, 2019 · 1 revision

Note that when reprocessing public data, there may be problems with read ID lines generated by fastq-dump from GEO/SRA. Please ensure that read IDs are formatted as an Illumina sequencer would generate them, eg: @NB502120:138:HG7YJBGXB:1:11101:8150:1026

To enforce this, you can set the --origfmt --defline-qual '+' flags in fastq-dump.

Clone this wiki locally