Four Quadrants of Difficulty: A Simple Categorisation and its Limits
By: Vanessa Toborek, Sebastian Müller, Christian Bauckhage
Potential Business Impact:
Teaches computers better by showing easy lessons first.
Curriculum Learning (CL) aims to improve the outcome of model training by estimating the difficulty of samples and scheduling them accordingly. In NLP, difficulty is commonly approximated using task-agnostic linguistic heuristics or human intuition, implicitly assuming that these signals correlate with what neural models find difficult to learn. We propose a four-quadrant categorisation of difficulty signals -- human vs. model and task-agnostic vs. task-dependent -- and systematically analyse their interactions on a natural language understanding dataset. We find that task-agnostic features behave largely independently and that only task-dependent features align. These findings challenge common CL intuitions and highlight the need for lightweight, task-dependent difficulty estimators that better reflect model learning behaviour.
Similar Papers
Beyond Shallow Heuristics: Leveraging Human Intuition for Curriculum Learning
Computation and Language
Teaches computers by showing them easy words first.
A Shared Geometry of Difficulty in Multilingual Language Models
Computation and Language
Helps computers understand how hard problems are.
Revisiting Generalization Across Difficulty Levels: It's Not So Easy
Computation and Language
Teaches computers to learn from easy and hard lessons.