Start here: genome.py
This project applies techniques from reverse engineering to understand any DNA genome. The goal here is simply to build an understanding of DNA protein sequesnce(genome) using first principles.
Biological systems are fundamentally information processing systems. While not a perfect analogy, software provides a useful framework for thinking about biology. The table below provides a rough outline of this analogy
GenBank is the NIH genetic sequence database, an annotated collection of all publicly available DNA and RNA sequences.
translate.py
contains a function translate
that converts an RNA sequence to a chain of amino acids.
The translate
function is used in genome.py
to identify and annotate functions for all proteins encoded by the genome.
The OpenMM toolkit is used for molecular simulation of protein folding in fold.py
.