Skip to main content
King Abdullah University of Science and Technology
KAUST
Main navigation
Home
VLMs
Towards Scalable and Structured Understanding in Visual LLMs
Mohamed Elhoseiny, Associate Professor, Computer Science
Feb 23, 12:00
-
13:00
B9 L2 R2325
LLM
Visual Language Models
VLMs
visual computing
In this talk, we explore a suite of recent advances toward scalable, structured video comprehension using Large Vision Language Models (Video LLMs).