Score: 0

Robust Gene Prioritization via Fast-mRMR Feature Selection in high-dimensional omics data

Published: November 26, 2025 | arXiv ID: 2511.21211v1

By: Rubén Fernández-Farelo , Jorge Paz-Ruza , Bertha Guijarro-Berdiñas and more

Potential Business Impact:

Finds important genes for health research faster.

Business Areas:
Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

Gene prioritization (identifying genes potentially associated with a biological process) is increasingly tackled with Artificial Intelligence. However, existing methods struggle with the high dimensionality and incomplete labelling of biomedical data. This work proposes a more robust and efficient pipeline that leverages Fast-mRMR feature selection to retain only relevant, non-redundant features for classifiers. This enables us to build simpler and more effective models, as well as to combine different biological feature sets. Experiments on Dietary Restriction datasets show significant improvements over existing methods, proving that feature selection can be critical for reliable gene prioritization.

Page Count
6 pages

Category
Computer Science:
Machine Learning (CS)