rna-data-products

A Python and CWL pipeline for concatenating HuBMAP RNA-seq [Salmon] data into data products per organ and one large RNA-seq [Salmon] data product.

Pipeline steps

Create a UUIDs TSV file with all UUIDs and HuBMAP IDs of public processed data wanted for the run.
With the UUIDs TSV, create a data directory of all H5ADs needed for the run.
Make an AWS access key id and a secret access key to upload the files to S3 bucket.
Annotate and concatenate a raw data product and a processed data product.

Requirements

Check the list of python packages in docker/requirements.txt

How to run

Step 1

python3 make_uuids_tsv.py [tissue_type]

Step 2

python3 make_directory.py /hive/hubmap/data/ [uuids_file] [tissue_type]

Step 3

cwltool pipeline.cwl --[data_directory] --[uuids_file] --[tissue_type] --[access_key_id] --[secret_access_key]

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
bin		bin
data		data
docker		docker
steps		steps
LICENSE		LICENSE
README.md		README.md
docker_images.txt		docker_images.txt
make_directory.py		make_directory.py
make_uuids_tsv.py		make_uuids_tsv.py
pipeline.cwl		pipeline.cwl
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

rna-data-products

Pipeline steps

Requirements

How to run

Step 1

Step 2

Step 3

About

Releases

Packages

Contributors 2

Languages

License

hubmapconsortium/rna-data-products

Folders and files

Latest commit

History

Repository files navigation

rna-data-products

Pipeline steps

Requirements

How to run

Step 1

Step 2

Step 3

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages