[Submitted on 23 May 2025 (v1), last revised 17 Sep 2025 (this version, v3)] View a PDF of the paper titled Understanding and Mitigating Overrefusal in LLMs from an Unveiling Perspective of Safety Decision Boundary, by Licheng Pan and 5 other authors View PDF HTML (experimental) Abstract:Large language models (LLMs) have demonstrated remarkable capabilities across a wide range of tasks, …
Read More »