Date: October 19–20, 2025
Location: ICCV 2025, Honolulu, Hawaii
From Segment Anything to Generalized Visual Grounding — In this tutorial, Meta AI and its academic partners will overview frontier research on visual grounding. We will cover each building block necessary to move toward future general-purpose visual grounding systems, including universal image and video encoding, multimodal language understanding, semantic instance segmentation and tracking, and the latest in 3D reconstruction methods. We will provide practical guidance on using SAM open source models, resources, and tooling to tackle the field’s biggest open research problems.
Email us at: awestbury@meta.com