Towards Measurement Theory for Artificial Intelligence
By: Elija Perrier
Potential Business Impact:
Measures AI smartness fairly and reliably.
We motivate and outline a programme for a formal theory of measurement of artificial intelligence. We argue that formalising measurement for AI will allow researchers, practitioners, and regulators to: (i) make comparisons between systems and the evaluation methods applied to them; (ii) connect frontier AI evaluations with established quantitative risk analysis techniques drawn from engineering and safety science; and (iii) foreground how what counts as AI capability is contingent upon the measurement operations and scales we elect to use. We sketch a layered measurement stack, distinguish direct from indirect observables, and signpost how these ingredients provide a pathway toward a unified, calibratable taxonomy of AI phenomena.
Similar Papers
Safety by Measurement: A Systematic Literature Review of AI Safety Evaluation Methods
Artificial Intelligence
Tests AI for dangerous tricks and hidden goals.
Measurement to Meaning: A Validity-Centered Framework for AI Evaluation
Computers and Society
Helps check if AI truly understands, not just memorizes.
Position: Evaluating Generative AI Systems Is a Social Science Measurement Challenge
Computers and Society
Makes AI tests more fair and accurate.