Score: 1

Sequence-to-Image Transformation for Sequence Classification Using Rips Complex Construction and Chaos Game Representation

Published: December 10, 2025 | arXiv ID: 2512.10141v1

By: Sarwan Ali, Taslim Murad, Imdadullah Khan

Potential Business Impact:

Turns DNA code into pictures for better cancer fighting.

Business Areas:
Image Recognition Data and Analytics, Software

Traditional feature engineering approaches for molecular sequence classification suffer from sparsity issues and computational complexity, while deep learning models often underperform on tabular biological data. This paper introduces a novel topological approach that transforms molecular sequences into images by combining Chaos Game Representation (CGR) with Rips complex construction from algebraic topology. Our method maps sequence elements to 2D coordinates via CGR, computes pairwise distances, and constructs Rips complexes to capture both local structural and global topological features. We provide formal guarantees on representation uniqueness, topological stability, and information preservation. Extensive experiments on anticancer peptide datasets demonstrate superior performance over vector-based, sequence language models, and existing image-based methods, achieving 86.8\% and 94.5\% accuracy on breast and lung cancer datasets, respectively. The topological representation preserves critical sequence information while enabling effective utilization of vision-based deep learning architectures for molecular sequence analysis.

Country of Origin
πŸ‡΅πŸ‡° πŸ‡ΊπŸ‡Έ United States, Pakistan

Page Count
12 pages

Category
Computer Science:
Machine Learning (CS)