Multilingual Dataset Integration Strategies for Robust Audio Deepfake Detection: A SAFE Challenge System

August 29, 2025 1 Views

arXiv:2508.20983v1 Announce Type: cross
Abstract: The SAFE Challenge evaluates synthetic speech detection across three tasks: unmodified audio, processed audio with compression artifacts, and laundered audio designed to evade detection. We systematically explore self-supervised learning (SSL) front-ends, training data compositions, and audio length configurations for robust deepfake detection. Our AASIST-based approach incorporates WavLM large frontend with RawBoost augmentation, trained on a multilingual dataset of 256,600 samples spanning 9 languages and over 70 TTS systems from CodecFake, MLAAD v5, SpoofCeleb, Famous Figures, and MAILABS. Through extensive experimentation with different SSL front-ends, three training data versions, and two audio lengths, we achieved second place in both Task 1 (unmodified audio detection) and Task 3 (laundered audio detection), demonstrating strong generalization and robustness.

Source link

Deep Insight Think Deeper. See Clearer

[D] Why does BYOL/JEPA like models work? How does EMA prevent model collapse?

[D] cool applications of ML in fixed income markets?

[D] AAAI considered 2nd tier now?

[R] Building a deep learning image model system to identify BJJ positions in matches

How to Context Engineer to Optimize Question Answering Pipelines

SSGaussian: Semantic-Aware and Structure-Preserving 3D Style Transfer

AUDETER: A Large-scale Dataset for Deepfake Audio Detection in Open Worlds

Should We Use LLMs As If They Were Swiss Knives?

Multilingual Dataset Integration Strategies for Robust Audio Deepfake Detection: A SAFE Challenge System

About AI Writer

Check Also

How to Context Engineer to Optimize Question Answering Pipelines

Leave a Reply Cancel reply

Weary managers of the world, get ready to learn a new skill: Leading all the AI agents and bots whose work you’ll be accountable for

AI Agents in Gaming: Revolutionizing Player Experience | ai agents Guide 2025

Moyen de pression à la STM: des autobus abandonnés en pleine rue à Montréal

How to Context Engineer to Optimize Question Answering Pipelines

ألمانيا تدشن أول حاسوب فائق وتدعو أوروبا للحاق بركب أميركا والصين | أخبار

Weary managers of the world, get ready to learn a new skill: Leading all the AI agents and bots whose work you’ll be accountable for

Demystifying Machine Learning: A Beginner’s Guide | machine learning Guide 2025

Demystifying Deep Learning: A Beginner’s Guide | deep learning Guide 2025

Unleashing Creativity: The Power of Generative AI in Art and Design | generative ai Guide 2025

Understanding ChatGPT: The Future of Conversational AI | chatgpt Guide 2025