dialpad Vision and Language & Multi-modal learning:Zero/few-shot learning, representation learning, continual learning.
Visual-question answering, crossmodal retrieval, multi-hop reasoning.
directions_run Synthetic data generation for compositionality and privacy protection:Simulated environments to provide a safe, controlled setting where agents can learn.
Virtual playgrounds that allow systems to experience and interact within the 3D space.
high_quality Dynamic evaluations and real-world applications:Data distribution and mitigation of spurious correlations.
Assessing the performance and effectiveness of models under varying conditions.
Fall 2025 lab dinner at Domo Sushi (December 5, 2025).
Keynote at the NYC Computer Vision Day (April 27, 2026).
Dinner after the NYC Computer Vision Day (April 27, 2026).
Lunch during CVPR 2026 (June 4, 2026).
CVPR 2026 Group Photo (June 5, 2026).