表現ベースの広範な幻覚検出器は、分布外では汎化に失敗する

なぜ重要か: 企業や社会への影響が見込まれ、一般メディアにも波及する可能性があります。

ソースを読む（export.arxiv.org）

arXiv:2509.19372v1（発表種別：クロス）

要旨：本稿では、現状最先端（SOTA）の幻覚検出の有効性について批判的に評価し、その性能がRAGTruthデータセットにおいてデータとの見せかけの相関に大きく依存していることを明らかにする。この効果を制御すると、最先端の手法は、データセット間で広範なハイパーパラメータ調整を必要とする一方で、教師あり線形プローブよりも優れた性能を示さない。分布外汎化は現状では不可能であり、分析された全ての手法がランダムに近い性能を示す。本稿では、幻覚検出とその評価のためのガイドラインを提案する。

原文（英語）を表示

Title (EN): Representation-based Broad Hallucination Detectors Fail to Generalize Out of Distribution

arXiv:2509.19372v1 Announce Type: cross
Abstract: We critically assess the efficacy of the current SOTA in hallucination detection and find that its performance on the RAGTruth dataset is largely driven by a spurious correlation with data. Controlling for this effect, state-of-the-art performs no better than supervised linear probes, while requiring extensive hyperparameter tuning across datasets. Out-of-distribution generalization is currently out of reach, with all of the analyzed methods performing close to random. We propose a set of guidelines for hallucination detection and its evaluation.

Published: 2025-09-24 19:00 UTC

表現ベースの広範な幻覚検出器は、分布外では汎化に失敗する

コメントする コメントをキャンセル

コメントするコメントをキャンセル