AVP-Fusion: Adaptive Multi-Modal Fusion and Contrastive Learning for Two-Stage Antiviral Peptide Identification
By: Xinru Wen, Weizhong Lin, Xuan Xiao
Potential Business Impact:
Finds new medicines to fight viruses faster.
Accurate identification of antiviral peptides (AVPs) is critical for accelerating novel drug development. However, current computational methods struggle to capture intricate sequence dependencies and effectively handle ambiguous, hard-to-classify samples. To address these challenges, we propose AVP-Fusion, a novel two-stage deep learning framework integrating adaptive feature fusion and contrastive learning. Unlike traditional static feature concatenation, we construct a panoramic feature space using 10 distinct descriptors and introduce an Adaptive Gating Mechanism.This mechanism dynamically regulates the weights of local motifs extracted by CNNs and global dependencies captured by BiLSTMs based on sequence context. Furthermore, to address data distribution challenges, we employ a contrastive learning strategy driven by Online Hard Example Mining (OHEM) and BLOSUM62-based data augmentation, which significantly sharpens the model's decision boundaries. Experimental results on the benchmark Set 1 dataset demonstrate that AVP-Fusion achieves an accuracy of 0.9531 and an MCC of 0.9064, significantly outperforming state-of-the-art methods. In the second stage, leveraging transfer learning, the model enables precise subclass prediction for six viral families and eight specific viruses, even under limited sample sizes. In summary, AVP-Fusion serves as a robust and interpretable tool for high-throughput antiviral drug screening.
Similar Papers
AdaFusion: Prompt-Guided Inference with Adaptive Fusion of Pathology Foundation Models
CV and Pattern Recognition
Combines AI to better understand disease images.
AdaFusion: Prompt-Guided Inference with Adaptive Fusion of Pathology Foundation Models
CV and Pattern Recognition
Combines AI to better understand disease images.
PepEVOLVE: Position-Aware Dynamic Peptide Optimization via Group-Relative Advantage
Machine Learning (CS)
Designs better medicines faster.