A Data Annotation Requirements Representation and Specification (DARS)
By: Yi Peng , Hina Saeeda , Hans-Martin Heyn and more
With the rise of AI-enabled cyber-physical systems, data annotation has become a critical yet often overlooked process in the development of these intelligent information systems. Existing work in requirements engineering (RE) has explored how requirements for AI systems and their data can be represented. However, related interviews with industry professionals show that data annotations and their related requirements introduce distinct challenges, indicating a need for annotation-specific requirement representations. We propose the Data Annotation Requirements Representation and Specification (DARS), including an Annotation Negotiation Card to align stakeholders on objectives and constraints, and a Scenario-Based Annotation Specification to express atomic and verifiable data annotation requirements. We evaluate DARS with an automotive perception case related to an ongoing project, and a mapping against 18 real-world data annotation error types. The results suggest that DARS mitigates root causes of completeness, accuracy, and consistency annotation errors. By integrating DARS into RE, this work improves the reliability of safety-critical systems using data annotations and demonstrates how engineering frameworks must evolve for data-dependent components of today's intelligent information systems.
Similar Papers
RE for AI in Practice: Managing Data Annotation Requirements for AI Autonomous Driving Systems
Software Engineering
Makes self-driving cars safer by improving how AI learns.
Data Annotation Quality Problems in AI-Enabled Perception System Development
Software Engineering
Finds mistakes in AI car driving data.
Towards Human-AI Synergy in Requirements Engineering: A Framework and Preliminary Study
Software Engineering
Helps computers and people build better software together.