Jim Glass

Impact Areas

Education

Projects

Project

Language and Dialect Identification

One of the challenges of processing real-world spoken content, such as automatic speech recognition, is the potential presence of different languages and dialects. Language and Dialect identification can be a useful capability to identify which language is being spoken during a recording.

Sameer Khurana

Jim Glass

Leads

Sameer Khurana

Jim Glass

Research Areas

Sameer Khurana

Jim Glass

Project

Unsupervised Speech Processing

All humans process vast quantities of unannotated speech and manage to learn phonetic inventories, word boundaries, etc., and can use these abilities to acquire new word. Why can't ASR technology have similar capabilities? Our goal in this research project is to build speech technology using unannotated speech corpora.

Sameer Khurana

Leads

Sameer Khurana

Research Areas

Impact Areas

Education

Health Care

Sameer Khurana

Project

Automatic Speech Recognition

Automatic speech recognition (ASR) has been a grand challenge machine learning problem for decades. Our ongoing research in this area examines the use of deep learning models for distant and noisy recording conditions, multilingual, and low-resource scenarios.

Sameer Khurana

Jim Glass

Leads

Sameer Khurana

Jim Glass

Research Areas

Impact Areas

Sameer Khurana

Jim Glass

Project

Arabic Language Processing

The Arabic language is spoken by over one billion people around the world. Arabic presents a variety of challenges for speech and language processing technologies. In our group, we have several research topics examining Arabic, including dialect identification, speech recognition, machine translation, and language processing.

Mitra Mohtarami

Leads

Mitra Mohtarami

Research Areas

Mitra Mohtarami

Project

Low-Power Speech Processing Circuits

The creation of low-power circuits capable of speech recognition and speaker verification will enable spoken interaction on a wide variety of devices in the era of Internet of Things.

Leads

Research Areas

Computer Architecture

Impact Areas

Project

Community Question Answering

Our main goal is to automatically search for relevant answers among many responses provided for a given question (Answer Selection), and search for relevant questions to reuse their existing answers (Question Retrieval).

Mitra Mohtarami

Leads

Mitra Mohtarami

Research Areas

Impact Areas

Mitra Mohtarami

Project

Speaker Verification and Diarization

The speech signal contains information about the talker's identity, which can be used on its own, or in conjunction with other modalities, to determine a person's identity.

Jim Glass

Leads

Jim Glass

Research Areas

Jim Glass

Project

Fact-Checking and Reasoning

Our main goal is to develop fact checking algorithms that can assess the credibility of claims mentioned in the textual statements and provide interpretable valid evidence that explains why a certain claim is considered as factually true or fake.

Mitra Mohtarami

Jim Glass

Leads

Mitra Mohtarami

Jim Glass

Research Areas

Impact Areas

Mitra Mohtarami

Jim Glass

Project

Unsupervised Learning of Interpretable Representations from Sequential Data

Generation of sequential data involves multiple factors operating at different temporal scales. Take natural speech for example, the speaker identity tends to be consistent within an utterance, while the phonetic content changes from frame to frame. By explicitly modeling such hierarchical generative process under a probabilistic framework, we proposed a model that learns to factorizes sequence-level factors and sub-sequence-level factors into different sets of representations without any supervision.

Jim Glass

Leads

Jim Glass

Research Areas

Impact Areas

Jim Glass

Project

Health-related Biomarkers from Speech

From audio recordings of clinician-subject interactions, we determine the spoken language bio-markers that are associated with health outcomes, such as dementia and depression.

Leads

Research Areas

Impact Areas

Health Care

Project

Designing Information-Rich Embedding Spaces

Using regularization techniques, we limit the amount of information encoded in latent embeddings, creating cleaner embeddings which better align with the latent variables we are modelling.

Jim Glass

Leads

Jim Glass

Research Areas

Impact Areas

Jim Glass

Project

Crossing the Vision-Language Boundary

Building models that learn spoken language by seeing and hearing

Leads

Research Areas

Impact Areas

Project

Using Computers to Eat Healthier

Our goal is to build an AI-powered personal digital nutritionist that enables users to track the food they eat simply by speaking or typing natural English phrases.

Leads

Research Areas

Impact Areas

Health Care

Project

Understanding Language Representations in Deep Learning Models

Our goal is to explore language representations in computational models. We develop new models for representing natural language and investigate how existing models learn language, focusing on neural network models in key tasks like machine translation and speech recognition.

Leads

Research Areas

11 More

Groups

Community of Research

Embodied Intelligence Community of Research

Our goal is to understand the nature of intelligent behavior in the physical world, through the study of human intelligence and the design and implementation of intelligent robots.

+18

Jim Glass

Leads

Jim Glass

Research Areas

Robotics

Lead

Jim Glass

+18

Jim Glass

Research Group

Spoken Language Systems Group

Our goal is to create technology that makes it possible for everyone in the world to interact with with computers via natural spoken language.

Jim Glass

Leads

Jim Glass

Research Areas