Score: 1

Towards Multi-Level Transcript Segmentation: LoRA Fine-Tuning for Table-of-Contents Generation

Published: January 5, 2026 | arXiv ID: 2601.02128v1

By: Steffen Freisinger , Philipp Seeberger , Thomas Ranzenberger and more

Potential Business Impact:

Organizes spoken words into chapters for easier understanding.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Segmenting speech transcripts into thematic sections benefits both downstream processing and users who depend on written text for accessibility. We introduce a novel approach to hierarchical topic segmentation in transcripts, generating multi-level tables of contents that capture both topic and subtopic boundaries. We compare zero-shot prompting and LoRA fine-tuning on large language models, while also exploring the integration of high-level speech pause features. Evaluations on English meeting recordings and multilingual lecture transcripts (Portuguese, German) show significant improvements over established topic segmentation baselines. Additionally, we adapt a common evaluation measure for multi-level segmentation, taking into account all hierarchical levels within one metric.

Repos / Data Links

Page Count
5 pages

Category
Computer Science:
Computation and Language