Towards Computational Chinese Paleography
By: Yiran Rex Ma
Chinese paleography, the study of ancient Chinese writing, is undergoing a computational turn powered by artificial intelligence. This position paper charts the trajectory of this emerging field, arguing that it is evolving from automating isolated visual tasks to creating integrated digital ecosystems for scholarly research. We first map the landscape of digital resources, analyzing critical datasets for oracle bone, bronze, and bamboo slip scripts. The core of our analysis follows the field's methodological pipeline: from foundational visual processing (image restoration, character recognition), through contextual analysis (artifact rejoining, dating), to the advanced reasoning required for automated decipherment and human-AI collaboration. We examine the technological shift from classical computer vision to modern deep learning paradigms, including transformers and large multimodal models. Finally, we synthesize the field's core challenges -- notably data scarcity and a disconnect between current AI capabilities and the holistic nature of humanistic inquiry -- and advocate for a future research agenda focused on creating multimodal, few-shot, and human-centric systems to augment scholarly expertise.
Similar Papers
AncientBench: Towards Comprehensive Evaluation on Excavated and Transmitted Chinese Corpora
Computation and Language
Helps computers understand old Chinese writings.
The AI-Augmented Research Process: A Historian's Perspective
Computers and Society
AI helps historians analyze huge old texts quickly
Translation via Annotation: A Computational Study of Translating Classical Chinese into Japanese
Computation and Language
Helps computers translate old texts by learning from ancient notes.