Multimodal
Roundabout-TAU
About
Visual language model benchmark for traffic anomaly understanding from surveillance video, using question-answering format to evaluate visual reasoning.
Visual language model benchmark for traffic anomaly understanding from surveillance video, using question-answering format to evaluate visual reasoning.