Hierarchical Ranking Neural Network for Long Document Readability Assessment
By: Yurui Zheng, Yijun Chen, Shaohong Zhang
Potential Business Impact:
Helps computers judge how hard a text is to read.
Readability assessment aims to evaluate the reading difficulty of a text. In recent years, while deep learning technology has been gradually applied to readability assessment, most approaches fail to consider either the length of the text or the ordinal relationship of readability labels. This paper proposes a bidirectional readability assessment mechanism that captures contextual information to identify regions with rich semantic information in the text, thereby predicting the readability level of individual sentences. These sentence-level labels are then used to assist in predicting the overall readability level of the document. Additionally, a pairwise sorting algorithm is introduced to model the ordinal relationship between readability levels through label subtraction. Experimental results on Chinese and English datasets demonstrate that the proposed model achieves competitive performance and outperforms other baseline models.
Similar Papers
Readability Reconsidered: A Cross-Dataset Analysis of Reference-Free Metrics
Computation and Language
Makes computers understand writing like people do.
Readability $\ne$ Learnability: Rethinking the Role of Simplicity in Training Small Language Models
Computation and Language
Simple words don't always make computers learn better.
LiteraryQA: Towards Effective Evaluation of Long-document Narrative QA
Computation and Language
Helps computers understand stories better.