List of Projects
Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting
People Involved : Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla
Not the paper crowd counting community seems to want, but one it needs right now ! To address serious issues with training and evaluation of deep...
MeronymNet: A Hierarchical Model for Unified and Controllable Multi-Category Object Generation
People Involved : Rishabh Baghel, Abhishek Trivedi, Tejas Ravichandran, and Ravi Kiran Sarvadevabhatla
We introduce MeronymNet, a novel hierarchical approach for controllable, part-based generation of multi-category objects using a single unified model. We adopt a guided coarse-to-fine strategy involving semantically conditioned generation of bounding box layouts, pixel-level part layouts and ultimately, the object depictions themselves.
People Involved : Abhishek Trivedi, Ravi Kiran Sarvadevabhatla
A novel resizing-free approach for high-precision semi-automatic layout annotation.
People Involved : Sharan, S P and Aitha, Sowmya and Amandeep, Kumar and Trivedi, Abhishek and Augustine, Aaron, Ravi Kiran Sarvadevabhatla
Introducing (1) Indiscapes2 handwritten manuscript layout dataset - 150% larger than its predecessor Indiscapes (2) PALMIRA - a novel deep network ..
Syntactically Guided Generative Embeddings for Zero Shot Skeleton Action Recognition
People Involved : Pranay Gupta, Divyanshu Sharma, Ravi Kiran Sarvadevabhatla
We propose a language-guided approach to enable state of the art performance for the challenging problem of Zero Shot Recognition of human actions.
DocVisor: A Multi-purpose Web-based Interactive Visualizer for Document Image Analytics
People Involved :Khadiravana Belagavi, Pranav Tadimeti, Ravi Kiran Sarvadevabhatla
DocVisor is an open-source visualization tool for document layout analysis. With DocVisor, it is possible to visualize data from three prominent document analysis tasks: Full Document Analysis, OCR and Box-Supervised Region Parsing. DocVisor offers various features such as ground-truth and intermediate output visualization, sorting data by key metrics as well as comparison of outputs from various other models simultaneously.
People Involved : Akshay Praveen Deshpande, Vaishnav Rao Potlapalli, Ravi Kiran Sarvadevabhatla
DocVisor is an open-source visualization tool for document layout analysis. With DocVisor, it is possible to visualize data from three prominent document analysis tasks: Full Document Analysis, OCR and Box-Supervised Region Parsing. DocVisor offers various features such as ground-truth and intermediate output visualization, sorting data by key metrics as well as comparison of outputs from various other models simultaneously.
Quo Vadis, Skeleton Action Recognition ?
People Involved : Pranay Gupta, Anirudh Thatipelli, Aditya Aggarwal, Shubh Maheshwari, Neel Trivedi, Sourav Das, Ravi Kiran Sarvadevabhatla
In this paper, we study current and upcoming frontiers across the landscape of skeleton-based human action recognition.
An OCR for Classical Indic Documents Containing Arbitrarily Long Words
People Involved : Agam Dwivedi, Rohit Saluja, Ravi Kiran Sarvadevabhatla
Datasets (real, synthetic) and a CNN-LSTM Attention OCR for printed classical Indic documents containing very long words.
Topological Mapping for Manhattan-like Repetitive Environments
People Involved : Sai Shubodh Puligilla, Satyajit Tourani, Tushar Vaidya, Udit Singh Parihar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna
This paper explores the role of topological understanding and benefits of such an understanding to the robot SLAM framework.
Indiscapes: Instance segmentation networks for layout parsing of historical indic manuscripts
People Involved : Abhishek Prusty, Aitha Sowmya, Abhishek Trivedi, Ravi Kiran Sarvadevabhatla
We introduce Indiscapes - the largest publicly available layout annotated dataset of historical Indic manuscript images.