On the Approximation of Phylogenetic Distance Functions by Artificial Neural Networks
By: Benjamin K. Rosenzweig, Matthew W. Hahn
Potential Business Impact:
Helps scientists build family trees for living things.
Inferring the phylogenetic relationships among a sample of organisms is a fundamental problem in modern biology. While distance-based hierarchical clustering algorithms achieved early success on this task, these have been supplanted by Bayesian and maximum likelihood search procedures based on complex models of molecular evolution. In this work we describe minimal neural network architectures that can approximate classic phylogenetic distance functions and the properties required to learn distances under a variety of molecular evolutionary models. In contrast to model-based inference (and recently proposed model-free convolutional and transformer networks), these architectures have a small computational footprint and are scalable to large numbers of taxa and molecular characters. The learned distance functions generalize well and, given an appropriate training dataset, achieve results comparable to state-of-the art inference methods.
Similar Papers
The Evolution of Learning Algorithms for Artificial Neural Networks
Neural and Evolutionary Computing
Evolves computer brains to learn like us.
Walking on the Fiber: A Simple Geometric Approximation for Bayesian Neural Networks
Machine Learning (CS)
Lets computers learn with less guessing.
Hierarchical geometric deep learning enables scalable analysis of molecular dynamics
Machine Learning (CS)
Analyzes huge protein movements on one computer.