Multimodal Intelligence Lab (MILab)

The Multimodal Intelligence Lab (MILab) is a University of Washington research group based at UW Tacoma. We develop multimodal AI systems that connect perception, language, memory, reasoning, and action. Our work spans foundation models, embodied agents, robotics, mobility intelligence, public safety, and human-centered decision support.

What We Study Research directions Meet the Team People at MILab

Multimodal Robotics

From perception to action

We connect visual evidence, language, and control for reliable manipulation and embodied decision-making in physical workspaces.

HA-VLN 2.0 benchmark and leaderboard figure

Human-Aware Intelligence

Reasoning with people, space, and goals

We model social context and spatial evidence so AI agents can act safely around people in shared spaces.

Language-conditioned world modeling for visual navigation figure

World Models

Learning environments from multimodal evidence

We build predictive models that connect instructions, observations, and possible future states for embodied planning.

Unified world model navigation results figure

Planning and Foresight

Predictive models for embodied decisions

We use memory and simulation to help agents compare possible futures before acting in dynamic scenes.

Agentic world modeling four regimes figure

Trustworthy AI Systems

From models to safe deployment

We design AI systems for mobility, public safety, and secure decision support under operational constraints and risks.

Research Directions

Multimodal models, embodied agents, and deployable systems for public-interest AI.

MILab connects model foundations, embodied reasoning, and deployable systems for mobility, public safety, and human-centered decision support.

Direction 01

Foundation Models

Multimodal models for perception, reasoning, learning, and generation across language, vision, video, audio, and structured signals.

Direction 02

Embodied Agents

Embodied agents for navigation, interaction, planning, memory, and control in human-aware physical and simulated environments.

Direction 03

Deployable Systems

Deployable systems for mobility, safety, sensing, monitoring, and decision-making with secure and responsible operation.

Lab Updates

Join MILab research at UW.

MILab welcomes prospective Ph.D. students, postdoctoral researchers, UW students across Seattle, Tacoma, and Bothell, and collaborators interested in multimodal AI, embodied intelligence, and deployable systems.

Review Pathways Join MILab

Multimodal Intelligence Lab (MILab)

From perception to action

Agents that understand people and places

Reasoning with people, space, and goals

Learning environments from multimodal evidence

Predictive models for embodied decisions

From models to safe deployment

Multimodal models, embodied agents, and deployable systems for public-interest AI.

Foundation Models

Embodied Agents

Deployable Systems

Recent news and selected publications

Recent News

2026 GSFEI Top Scholar Award

ACL 2026 paper acceptances

ICLR 2026 oral presentation

NeurIPS 2025 oral presentation

Carwein-Andrews Ph.D. Fellowship

CVPR Anti-UAV Best Paper

Recent Publications

HA-VLN 2.0: Human-Aware Navigation Benchmark

Language-Conditioned World Modeling for Visual Navigation

Human-Aware Vision-and-Language Navigation

Emotion-LLaMA: Multimodal Emotion Reasoning

Lossless Hierarchical Speculative Decoding

MaxSup: Representation Collapse in Label Smoothing

Join MILab research at UW.