The Voice Timbre Attribute Detection 2025 Challenge Evaluation Plan
By: Zhengyan Sheng , Jinghao He , Liping Chen and more
Potential Business Impact:
Helps computers describe voices like humans do.
Voice timbre refers to the unique quality or character of a person's voice that distinguishes it from others as perceived by human hearing. The Voice Timbre Attribute Detection (VtaD) 2025 challenge focuses on explaining the voice timbre attribute in a comparative manner. In this challenge, the human impression of voice timbre is verbalized with a set of sensory descriptors, including bright, coarse, soft, magnetic, and so on. The timbre is explained from the comparison between two voices in their intensity within a specific descriptor dimension. The VtaD 2025 challenge starts in May and culminates in a special proposal at the NCMMSC2025 conference in October 2025 in Zhenjiang, China.
Similar Papers
The First Voice Timbre Attribute Detection Challenge
Sound
Helps computers understand how voices sound different.
Introducing voice timbre attribute detection
Sound
Helps computers tell voices apart by sound.
QvTAD: Differential Relative Attribute Learning for Voice Timbre Attribute Detection
Sound
Makes computer voices sound more like real people.