Stream DiscStream Disc

Research

Publications, whitepapers, and technical research on voice identity, AI security, and creator protection across streams and physical media.

ResearchAugust 16, 2025

Finding the Human Voice in AI: Insights on the Perception of AI-Voice Clones from Naturalness and Similarity Ratings

Linda Bakkouche, Charles McGhee, Emily Lau, Stephanie Cooper, Xinbing Luo, Madeleine Rees, Kai Alter, Brechtje Post, Julia Schwarz

AI-generated voice clones are important tools in language learning, audiobooks, and assistive technology, but often struggle to replicate key prosodic features such as dynamic F0 variation. This study evaluates listeners' ratings of naturalness and similarity for human speech, three AI voice clones (ElevenLabs, StyleTTS-2, XTTS-v2), and controlled prosodic manipulations.

Interspeech 2025, Rotterdam, The Netherlands·2025