Skip to content
/ reboot Public

REBOOT: RE-interpreting BOOTstrap for phylogenetics

License

Notifications You must be signed in to change notification settings

bqminh/reboot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

REBOOT: RE-interpreting BOOTstrap supports for phylogenetics

Introduction

This project aims to evaluate the interpretation of support values generated by Felsenstein's bootstrap (FBP; Felsenstein 1985), transfer bootstrap expectation (TBE; Lemoine et al. 2018) and other bootstrap methods (e.g. Minh et al. 2013). To do so we rely on simulations where the true trees are known. The main idea is to look at the correlation between branch supports inferred by each method and the probability that the branch is true. While FBP is well known to be conservative (i.e., FBP support of >=70% typically means that the branch is true with a probability of >=0.95) (Hillis and Bull 1993), none is known about TBE. Thus, we want to assess if the suggested threshold of 70% applies well to TBE or not.

Simulations

Birth-death simulations

To replicate the results follow these steps:

  1. Run generate_datasets.R located in scripts/ folder (this relies on the functions.R script). This generates datasets and runs IQ-TREE on them.

  2. Run calculate_tbes.R. This uses BOOSTER to calculate TBE and FBP for each dataset).

  3. Run bootstrap_data.R. This generates the output file simulated.csv.

PANDIT simulations

  1. Download the simulated alignments based on the PANDIT database from DOI.

  2. Go to dna folder and for each sub-folder $i, run IQ-TREE:

     iqtree -s data.$i -m `cat model.$i` -b 100
    
  3. In each sub-folder, create a symbolic links:

     ln -s data.$i alignment.fasta
     ln -s tree.$i phylogram.phy
    
  4. Run calculate_tbes.R.

  5. Run bootstrap_data.R. This generates the output file pandit.csv.

Making plots from birth-death and PANDIT simulations

  1. Run bootstrap_plots.R. This combines the output from PANDIT analysis (pandit.csv) with the output from birth-death analysis (simulated.csv) and makes the plot.

References

About

REBOOT: RE-interpreting BOOTstrap for phylogenetics

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published