Score: 0

Approximations for the Weighted Reversal, Transposition, and Indel Distance Problem with Intergenic Region Information

Published: December 31, 2025 | arXiv ID: 2512.25016v1

By: Gabriel Siqueira, Alexsandro Oliveira Alexandrino, Zanoni Dias

Genome rearrangement distances are an established method in genome comparison. Works in this area may include various rearrangement operations representing large-scale mutations, gene orientation information, the number of nucleotides in intergenic regions, and weights reflecting the expected frequency of each operation. In this article, we model genomes containing at most one copy of each gene by considering gene sequences, with orientations, and representing intergenic regions according to their nucleotide lengths. We looked at a problem called Weighted Reversal, Transposition, and Indel Distance, which seeks the minimal cost sequence composed by the rearrangement operations of reversals, transposition, and indels, capable of transforming one genome into another. We leverage a structure called Labeled Intergenic Breakpoint Graph to show an algorithm for that problem with guaranteed approximations considering some sets of weights for the operations.

Category
Computer Science:
Data Structures and Algorithms