Skip to content

A little QC tool that attempts to predict the genotype of mouse samples used, based on prior knowledge of high-quality SNPs from the Mouse Genomes Project

License

Notifications You must be signed in to change notification settings

FelixKrueger/reStrainingOrder

Repository files navigation

Build Status

reStrainingOrder - why do we need one?

reStrainingOrder is intended as QC tool that attempts to identify the genotype of pure strain or hybrid mouse samples. It can be be used to check public data as well as provide useful insight into mouse strains commonly used in your own lab.

To do this, reStrainingOrder harnesses single-nucleotide polymorphism (SNP) information collected by the Mouse Genomes Project, and constructs a fully N-masked genome similar to the approach of SNPsplit. The project has been updated to work with the latest release of the Mouse Genomes Project, which means that it will now assume the GRCm39 mouse genome build by default, and use the latest SNP annotation file (v8: mgp_REL2021_snps.vcf.gz).

reStrainingOrder is intended to work with most common types of Illumina sequencing - including RNA-seq, ChIP-seq, ATAC-seq or any kind of Bisulfite-seq. Supported aligners include Bowtie2, HISAT2, STAR, and Bismark (Oxford comma, anyone?).

Pure Strain example report

Here is an example of a pure C57BL/6 strain (Black6) strain. If the single-strain compatibility score is > 99%, one doesn't really need to look any further. The allele-ratio also indicated that the sample is purely Black6.

Hybrid example report

Here is an example of a 129S1/CAST hybrid strain (public data taken from this GEO entry, aligned with Bismark/Bowtie2). The Hybrid Strain Compatibility Scores indicate that a 129S1_SvImJ/CAST_EiJ hybrid is > 99% compatible with the data. The allele-ratios between 129 and CAST are almost 1:1.

Installation

Most of reStrainingOrder is written in Perl and is executed from the command line. To install reStrainingOrder simply download the latest release of the code from the Releases page and extract the files into a reStrainingOrder installation folder. The plotly.js JavaScript library comes packaged together with reStrainingOrder and does not need to be installed separately.

reStrainingOrder requires the following tools installed and ideally available in the PATH:

Documentation

The reStrainingOrder documentation can be found here: reStrainingOrder User Guide

Licences

reStrainingOrder itself is free software, reStrainingReport produces HTML graphs powered by Plot.ly which are also free to use and look at!

Credits

This project was started as part of the 2018 Cambridge area bioinformatics hackathon; reStrainingOrder was written by Felix Krueger, part of Babraham Bioinformatics, now part of Altos Bioinformatics.

About

A little QC tool that attempts to predict the genotype of mouse samples used, based on prior knowledge of high-quality SNPs from the Mouse Genomes Project

Resources

License

Stars

Watchers

Forks

Packages

No packages published