Project

Semantic Parsing using Vision

We aim to learn language by distant supervision through captioned videos, similarly to how children learn language through interacting with the world around.

Many existing approaches to learning semantic parsers require annotated parse trees or are entirely unsupervised. In contrast, children acquire language through interacting with the world around them. We aim to learn language in a manner more similar to children: by distant supervision through captioned videos. Our model is a joint vision-language system that use learns the meaning of words by observing actions from a sentence depicted in a video. This work means more robust parsing that incorporates perception.

Group

Infolab

Communities

Cognitive AI Community of Research

Contact us

If you would like to contact us about our work, please refer to our members below and reach out to one of the group leads directly.

Last updated Apr 24 '20

Research Areas

AI & ML

Project

Semantic Parsing using Vision

Group

Communities

Contact us

Research Areas

Group

Communities

Members

Boris Katz