Fifteen Years of Child-Centered Long-Form Recordings: Promises, Resources, and Remaining Challenges to Validity
By: Loann Peurey , Marvin Lavechin , Tarek Kunze and more
Potential Business Impact:
Helps scientists understand how kids learn to talk.
Audio-recordings collected with a child-worn device are a fundamental tool in child language research. Long-form recordings collected over whole days promise to capture children's input and production with minimal observer bias, and therefore high validity. The sheer volume of resulting data necessitates automated analysis to extract relevant metrics for researchers and clinicians. This paper summarizes collective knowledge on this technique, providing entry points to existing resources. We also highlight various sources of error that threaten the accuracy of automated annotations and the interpretation of resulting metrics. To address this, we propose potential troubleshooting metrics to help users assess data quality. While a fully automated quality control system is not feasible, we outline practical strategies for researchers to improve data collection and contextualize their analyses.
Similar Papers
Enabling automatic transcription of child-centered audio recordings from real-world environments
Sound
Lets computers understand kids' talking from recordings.
Automated Analysis of Naturalistic Recordings in Early Childhood: Applications, Challenges, and Opportunities
Audio and Speech Processing
Helps computers understand babies' first words.
Longitudinal and Multimodal Recording System to Capture Real-World Patient-Clinician Conversations for AI and Encounter Research: Protocol
Computers and Society
Helps doctors understand patients better through recordings.