Skip to article frontmatterSkip to article content

Phylo2Vec Workshop

Abstract

Phylogenetic trees are diagrams that illustrate species’ shared evolutionary history. Represented as bifurcating binary trees, these trees also represent similar relationships such as tracing the similarities between languages.

Given the importance of this data structure, a plethora of corresponding software currently exists. However, the current standard for representing a phylogenetic tree is a string via the Newick format. Conversely, phylo2vec represents phylogenetic trees as integer vectors. This representation requires substantially less storage, and thus enables more efficient tree operations.

Our latest contributions have completely rewritten the core component of phylo2vec in Rust, boosting both performance and memory efficiency—while still maintaining the Python and R APIs. We’ve also implemented several optimizations to reduce time complexity.

The session will last 1 hour 30 minutes. Please bring a laptop so you can follow along and try things out yourself!

The workshop is co-organised by the University of Copenhagen, the eScience institute, and the Statistics Section of the Department of Mathematics at Imperial College London.

Learning Goals

Workshop Logistical Information

Preliminary Agenda

Note: Subject to change

TimeTopic/Activity
11:00-11:15Welcome and Introductions
11:15-11:30Overview of Phylo2Vec and its Applications
11:30-12:00Tutorial
11:00-12:30Exercise

List of Exercises

Running the Exercises

The quickest way to get started without manual installation is to use GitHub Codespaces - “a development environment that’s hosted in the cloud”. This allows you to run the workshop exercises directly in your browser without needing to set up a local environment. In order to access the Codespace, you need to register an account with GitHub.

Open in GitHub Codespaces

☝️ Click the button above to go to options window to launch a GitHub Codespace.