Skip to main content
Scientific Computing and Machine Learning
SCML
Optimization and Machine Learning
Main navigation
Home
People
All Profiles
Principal Investigators
Research Scientists
Postdoctoral Fellows
Students
Alumni
Former Members
Events
All Events
Events Calendar
News
Software
Projects
Topics
Courses
Theses
VLMs
Towards Scalable and Structured Understanding in Visual LLMs
Mohamed Elhoseiny, Associate Professor, Computer Science
Feb 23, 12:00
-
13:00
B9 L2 R2325
LLM
Visual Language Models
VLMs
visual computing
In this talk, we explore a suite of recent advances toward scalable, structured video comprehension using Large Vision Language Models (Video LLMs).