An Empirical Evaluation under Misleading Scenarios

September 3, 2025 2 Views

[Submitted on 5 Nov 2024 (v1), last revised 2 Sep 2025 (this version, v2)]

Authors:Yunkai Dang, Mengxi Gao, Yibo Yan, Xin Zou, Yanggan Gu, Jungang Li, Jingyu Wang, Peijie Jiang, Aiwei Liu, Jia Liu, Xuming Hu

View a PDF of the paper titled Exploring Response Uncertainty in MLLMs: An Empirical Evaluation under Misleading Scenarios, by Yunkai Dang and 9 other authors

View PDF
HTML (experimental)

Abstract:Multimodal large language models (MLLMs) have recently achieved state-of-the-art performance on tasks ranging from visual question answering to video understanding. However, existing studies have concentrated mainly on visual-textual misalignment, leaving largely unexplored the MLLMs’ ability to preserve an originally correct answer when confronted with misleading information. We reveal a response uncertainty phenomenon: across nine standard datasets, twelve state-of-the-art open-source MLLMs overturn a previously correct answer in 65% of cases after receiving a single deceptive cue. To systematically quantify this vulnerability, we propose a two-stage evaluation pipeline: (1) elicit each model’s original response on unperturbed inputs; (2) inject explicit (false-answer hints) and implicit (contextual contradictions) misleading instructions, and compute the misleading rate – the fraction of correct-to-incorrect flips. Leveraging the most susceptible examples, we curate the Multimodal Uncertainty Benchmark (MUB), a collection of image-question pairs stratified into low, medium, and high difficulty based on how many of twelve state-of-the-art MLLMs they mislead. Extensive evaluation on twelve open-source and five closed-source models reveals a high uncertainty: average misleading rates exceed 86%, with explicit cues over 67.19% and implicit cues over 80.67%. To reduce the misleading rate, we then fine-tune all open-source MLLMs on a compact 2000-sample mixed-instruction dataset, reducing misleading rates to 6.97% (explicit) and 32.77% (implicit), boosting consistency by nearly 29.37% on highly deceptive inputs, and slightly improving accuracy on standard benchmarks. Our code is available at this https URL

Submission history

From: Yunkai Dang [view email]
[v1]
Tue, 5 Nov 2024 01:11:28 UTC (25,670 KB)
[v2]
Tue, 2 Sep 2025 08:24:41 UTC (17,646 KB)

Source link

Deep Insight Think Deeper. See Clearer

[D] Why does BYOL/JEPA like models work? How does EMA prevent model collapse?

[D] cool applications of ML in fixed income markets?

[D] AAAI considered 2nd tier now?

[R] Building a deep learning image model system to identify BJJ positions in matches

Mistral, the French AI giant, is reportedly on the cusp of securing a $14B valuation

[2210.00422] Stochastic optimization on matrices and a graphon McKean-Vlasov limit

An Empirical Evaluation under Misleading Scenarios

[2508.14085] Parameter-Aware Ensemble SINDy for Interpretable Symbolic SGS Closure

An Empirical Evaluation under Misleading Scenarios

Submission history

About AI Writer

Check Also

Mistral, the French AI giant, is reportedly on the cusp of securing a $14B valuation

Leave a Reply Cancel reply

صعود تاريخى أم استقرار.. آخر تطورات سعر الذهب فى مصر اليوم الخميس 4-9-2025

Mistral, the French AI giant, is reportedly on the cusp of securing a $14B valuation

AI Agents Are The New Personal Shoppers In Online Fashion

Predictive Analytics in Healthcare: How AI is Shaping Preventive Medicine | ai in healthcare Guide 2025

Eala routs Dutch, enters Guadalajara Open 2nd round

صعود تاريخى أم استقرار.. آخر تطورات سعر الذهب فى مصر اليوم الخميس 4-9-2025

Demystifying Machine Learning: A Beginner’s Guide | machine learning Guide 2025

Demystifying Deep Learning: A Beginner’s Guide | deep learning Guide 2025

Unleashing Creativity: The Power of Generative AI in Art and Design | generative ai Guide 2025

Understanding ChatGPT: The Future of Conversational AI | chatgpt Guide 2025