Score: 0

Modeling Authorial Style in Urdu Novels Using Character Interaction Graphs and Graph Neural Networks

Published: December 14, 2025 | arXiv ID: 2512.12654v1

By: Hassan Mujtaba, Hamza Naveed, Hanzlah Munir

Potential Business Impact:

Finds authors by how characters talk to each other.

Business Areas:
Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Authorship analysis has traditionally focused on lexical and stylistic cues within text, while higher-level narrative structure remains underexplored, particularly for low-resource languages such as Urdu. This work proposes a graph-based framework that models Urdu novels as character interaction networks to examine whether authorial style can be inferred from narrative structure alone. Each novel is represented as a graph where nodes correspond to characters and edges denote their co-occurrence within narrative proximity. We systematically compare multiple graph representations, including global structural features, node-level semantic summaries, unsupervised graph embeddings, and supervised graph neural networks. Experiments on a dataset of 52 Urdu novels written by seven authors show that learned graph representations substantially outperform hand-crafted and unsupervised baselines, achieving up to 0.857 accuracy under a strict author-aware evaluation protocol.

Country of Origin
🇵🇰 Pakistan

Page Count
6 pages

Category
Computer Science:
Computation and Language