LLM Reference
Multimodal

Roundabout-TAU

About

Visual language model benchmark for traffic anomaly understanding from surveillance video, using question-answering format to evaluate visual reasoning.

Resources

arXiv Paper