
Sindhu B. Hegde
Date : 11/04/2025
Abstract:
Humans gesture when they speak -- gesturing is an integral part of non-verbal communication. Yet, large-scale understanding of co-speech gestures remains relatively underexplored. In this talk, I will delve into different approaches for learning co-speech gesture representations, highlight key challenges, and outline promising directions to advance gesture understanding in real-world, multimodal settings.
Bio:
Sindhu Hegde is a second-year PhD student in the Visual Geometry Group (VGG) at the University of Oxford, supervised by Prof. Andrew Zisserman. Her research is in Computer Vision, particularly in multimodal learning, video understanding, and self-supervised learning. Prior to joining Oxford, she worked as a Lead Data Scientist @ Verisk Analytics. Before that, she pursued a Master’s by Research (MS) at Centre for Visual Information Technology (CVIT), IIIT Hyderabad, supervised by Prof. C V Jawahar (IIIT-H) and Prof. Vinay Namboodiri (University of Bath, UK). Her Master’s research focused on exploiting the redundancies in vision and speech modalities for cross-modal generation.