歩行者安全のための巨大言語モデル：非信号交差点における運転手の譲り合い行動予測への応用

なぜ重要か: 企業や社会への影響が見込まれ、一般メディアにも波及する可能性があります。

ソースを読む（export.arxiv.org）

arXiv:2509.19657v1発表タイプ：クロス

要約：歩行者安全は都市モビリティの重要な要素であり、横断歩道における歩行者の意思決定と運転手の譲り合い行動の相互作用によって大きく影響されます。交差点における運転手と歩行者の相互作用をモデル化するには、これらの行動の複雑さを正確に捉える必要があります。従来の機械学習モデルは、固定された特徴表現と解釈可能性の低さのために、これらの多要素の相互作用に必要なニュアンスのある状況依存の推論を捉えるのに苦労することがよくあります。対照的に、大規模言語モデル（LLM）は異種交通データからパターンを抽出するのに適しており、運転手と歩行者の相互作用の正確なモデル化を可能にします。そのため、本論文では、ドメイン固有の知識、構造化された推論、および少ショットプロンプティングを取り入れた新規のプロンプト設計を通じて、マルチモーダルLLMを活用し、運転手の譲り合い行動の解釈可能で状況認識のある推論を可能にします。これは、歩行者と運転手の相互作用のモデリングの応用例です。最先端のLLMを従来の分類器とベンチマーク比較した結果、GPT-4oが常に最高の精度と再現率を達成し、Deepseek-V3が精度に優れていることがわかりました。これらの知見は、モデルのパフォーマンスと計算効率の間の重要なトレードオフを浮き彫りにし、実世界の歩行者安全システムへのLLMの展開に関する実践的なガイダンスを提供します。

原文（英語）を表示

Title (EN): Large Language Models for Pedestrian Safety: An Application to Predicting Driver Yielding Behavior at Unsignalized Intersections

arXiv:2509.19657v1 Announce Type: cross
Abstract: Pedestrian safety is a critical component of urban mobility and is strongly influenced by the interactions between pedestrian decision-making and driver yielding behavior at crosswalks. Modeling driver–pedestrian interactions at intersections requires accurately capturing the complexity of these behaviors. Traditional machine learning models often struggle to capture the nuanced and context-dependent reasoning required for these multifactorial interactions, due to their reliance on fixed feature representations and limited interpretability. In contrast, large language models (LLMs) are suited for extracting patterns from heterogeneous traffic data, enabling accurate modeling of driver-pedestrian interactions. Therefore, this paper leverages multimodal LLMs through a novel prompt design that incorporates domain-specific knowledge, structured reasoning, and few-shot prompting, enabling interpretable and context-aware inference of driver yielding behavior, as an example application of modeling pedestrian–driver interaction. We benchmarked state-of-the-art LLMs against traditional classifiers, finding that GPT-4o consistently achieves the highest accuracy and recall, while Deepseek-V3 excels in precision. These findings highlight the critical trade-offs between model performance and computational efficiency, offering practical guidance for deploying LLMs in real-world pedestrian safety systems.

Published: 2025-09-24 19:00 UTC

歩行者安全のための巨大言語モデル：非信号交差点における運転手の譲り合い行動予測への応用

コメントする コメントをキャンセル

コメントするコメントをキャンセル