Gathering, updating, and calibrating phylogenies for The Open Tree of Life
Emily Jane McTavish
Heidelberg Institute for Theoretical Studies
The Open Tree of Life project is a collaborative effort to synthesize, share, and update a comprehensive phylogeny of all 2.3 million named species. We have completed a draft synthesis of a single tree from hundreds of phylogenetic estimates using taxonomy as a scaffold. This synthesis is not static but rather will be continually revised as new data become available. This undertaking requires development of both novel infrastructure and analysis tools. I will discuss three components of this project: Phylesystem, an open database and web application for community curation of phylogenies using a git-based datastore, PhyScraper, a pipeline to continually update phylogenetic estimates as new data is generated, and FastDate, an algorithm to rapidly generate maximum a posteriori estimates of time-calibrated trees, even for phylogenies with hundreds or thousands of tips. Together, these developments reduce impediments to accessing, analyzing and reusing the phylogenetic information which is essential to biological research today.