A Standardized Benchmark for Multilabel Antimicrobial Peptide Classification
By: Sebastian Ojeda , Rafael Velasquez , Nicolás Aparicio and more
Potential Business Impact:
Finds new germ-fighting peptides faster.
Antimicrobial peptides have emerged as promising molecules to combat antimicrobial resistance. However, fragmented datasets, inconsistent annotations, and the lack of standardized benchmarks hinder computational approaches and slow down the discovery of new candidates. To address these challenges, we present the Expanded Standardized Collection for Antimicrobial Peptide Evaluation (ESCAPE), an experimental framework integrating over 80.000 peptides from 27 validated repositories. Our dataset separates antimicrobial peptides from negative sequences and incorporates their functional annotations into a biologically coherent multilabel hierarchy, capturing activities across antibacterial, antifungal, antiviral, and antiparasitic classes. Building on ESCAPE, we propose a transformer-based model that leverages sequence and structural information to predict multiple functional activities of peptides. Our method achieves up to a 2.56% relative average improvement in mean Average Precision over the second-best method adapted for this task, establishing a new state-of-the-art multilabel peptide classification. ESCAPE provides a comprehensive and reproducible evaluation framework to advance AI-driven antimicrobial peptide research.
Similar Papers
Semi-supervised Latent Bayesian Optimization for Designing Antimicrobial Peptides
Machine Learning (CS)
Finds new germ-fighting medicines faster.
Improvement of AMPs Identification with Generative Adversarial Network and Ensemble Classification
Machine Learning (CS)
Finds new germ-fighting helpers for medicine.
PepEVOLVE: Position-Aware Dynamic Peptide Optimization via Group-Relative Advantage
Machine Learning (CS)
Designs better medicines faster.