Score: 2

BP-Seg: A graphical model approach to unsupervised and non-contiguous text segmentation using belief propagation

Published: May 22, 2025 | arXiv ID: 2505.16965v1

By: Fengyi Li , Kayhan Behdin , Natesh Pillai and more

BigTech Affiliations: LinkedIn

Potential Business Impact:

Splits long texts into meaningful parts.

Business Areas:

Text Analytics Data and Analytics, Software

Text segmentation based on the semantic meaning of sentences is a fundamental task with broad utility in many downstream applications. In this paper, we propose a graphical model-based unsupervised learning approach, named BP-Seg for efficient text segmentation. Our method not only considers local coherence, capturing the intuition that adjacent sentences are often more related, but also effectively groups sentences that are distant in the text yet semantically similar. This is achieved through belief propagation on the carefully constructed graphical models. Experimental results on both an illustrative example and a dataset with long-form documents demonstrate that our method performs favorably compared to competing approaches.

SegNSP: Revisiting Next Sentence Prediction for Linear Text Segmentation

Computation and Language

Helps computers understand where one topic ends.

7 Jan 2026 1

85%

Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation

CV and Pattern Recognition

Helps doctors see inside bodies without all the words.

15 Jul 2025 2

85%

Unsupervised Speech Segmentation: A General Approach Using Speech Language Models

Computation and Language

**Splits talking into meaningful parts automatically.**

7 Jan 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

10 pages

BP-Seg: A graphical model approach to unsupervised and non-contiguous text segmentation using belief propagation

Splits long texts into meaningful parts.

Technical Abstract

SegNSP: Revisiting Next Sentence Prediction for Linear Text Segmentation

Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation

Unsupervised Speech Segmentation: A General Approach Using Speech Language Models