Score: 0

Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation

Published: January 14, 2026 | arXiv ID: 2601.09648v1

By: Andrew Moore , Paul Rayson , Dawn Archer and more

Word Sense Disambiguation (WSD) has been widely evaluated using the semantic frameworks of WordNet, BabelNet, and the Oxford Dictionary of English. However, for the UCREL Semantic Analysis System (USAS) framework, no open extensive evaluation has been performed beyond lexical coverage or single language evaluation. In this work, we perform the largest semantic tagging evaluation of the rule based system that uses the lexical resources in the USAS framework covering five different languages using four existing datasets and one novel Chinese dataset. We create a new silver labelled English dataset, to overcome the lack of manually tagged training data, that we train and evaluate various mono and multilingual neural models in both mono and cross-lingual evaluation setups with comparisons to their rule based counterparts, and show how a rule based system can be enhanced with a neural network model. The resulting neural network models, including the data they were trained on, the Chinese evaluation dataset, and all of the code have been released as open resources.

Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation

Computation and Language

Helps computers understand words with many meanings.

20 Nov 2025 0

86%

NASP-T: A Fuzzy Neuro-Symbolic Transformer for Logic-Constrained Aviation Safety Report Classification

Artificial Intelligence

Makes AI understand safety rules for planes.

6 Oct 2025 0

86%

The NTNU System at the S&I Challenge 2025 SLA Open Track

Computation and Language

Tests speaking skills better by combining sound and words.

5 Jun 2025 0

View PDF Login to Bookmark

Creating a Hybrid Rule and Neural Network Based Semantic Tagger using Silver Standard Data: the PyMUSAS framework for Multilingual Semantic Annotation

Technical Abstract

Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation

NASP-T: A Fuzzy Neuro-Symbolic Transformer for Logic-Constrained Aviation Safety Report Classification

The NTNU System at the S&I Challenge 2025 SLA Open Track