Do Vision-Language Pretrained Models Learn Primitive Concepts for Recognition and Reasoning?

Talk

Chen Sun

Brown University

Time:

02.08.2023 14:00 to 15:00

Location:

IRB 4105

URL:

https://talks.cs.umd.edu/talks/3402

Vision-language models pretrained on web-scale data have revolutionized deep learning in the last few years. They have demonstrated strong transfer learning performance on a wide range of tasks, even under the zero-shot setup, where text prompts serve as a natural interface for humans to specify a task, as opposed to collecting labeled data. These models are trained on composite data, such as visual scenes of multiple objects, or a sentence that describes that spatiotemporal event. However, it is not clear whether they do this by learning to reason over lower-level, spatio-temporal primitive concepts that humans naturally use to characterize these concepts, such as colors, shapes, or verbs that describe short actions. If they do so, it has important implications for the capacity of models to support compositional generalization, and for humans to interpret the reasoning procedures models undertake.

In this talk, I will present our recent attempts to answer this question. We study several representative vision-language (VL) models trained on images (e.g. CLIP) and videos (e.g. VideoBERT), and design corresponding “probing” frameworks to understand if VL pretraining: (1) improves lexical grounding, (2) encodes verb meaning, and (3) learns visually grounded primitive concepts. I will also discuss our ongoing approach on utilizing concept binding that emerges inside a pretrained neural network for visual reasoning tasks.

Upcoming Events

Event

04.19.2024 12:00 to 13:30

IRB-0318

Computer Science APT Meeting

Talk

04.25.2024 13:00 to 14:00

IRB 4105 or https://umd.zoom.us/j/95853135696?pwd=VVEwMVpxeElXeEw0ckVlSWNOMVhXdz09

Human-centered Explainable AI: Expanding Explainable & Responsible AI
Upol Ehsan

Event

04.26.2024 12:00 to 13:30

IRB-4105

Computer Science APT Meeting

Event

04.26.2024 13:00 to 14:00

IRB-5105

Computer Science Instructional Faculty Meeting

Event

04.26.2024 15:00 to 16:30

IRB-0318

Computer Science Education Committee Meeting

Event

05.03.2024 11:00 to 12:00

IRB-4105

Computer Science APT Meeting

Event

05.03.2024 12:00 to 13:30

IRB-4105

Computer Science FFL

Event

05.06.2024 12:00 to 13:00

IRB-2137

Computer Science Department Council Meeting

Event

05.17.2024 12:00 to 13:30

IRB-4105

Computer Science APT Meeting