List of Projects

word Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting

People Involved : Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla

Not the paper crowd counting community seems to want, but one it needs right now ! To address serious issues with training and evaluation of deep...

word MeronymNet: A Hierarchical Model for Unified and Controllable Multi-Category Object Generation

People Involved : Rishabh Baghel, Abhishek Trivedi, Tejas Ravichandran, and Ravi Kiran Sarvadevabhatla

We introduce MeronymNet, a novel hierarchical approach for controllable, part-based generation of multi-category objects using a single unified model. We adopt a guided coarse-to-fine strategy involving semantically conditioned generation of bounding box layouts, pixel-level part layouts and ultimately, the object depictions themselves.

word BoundaryNet - An Attentive Deep Network with Fast Marching Distance Maps for Semi-automatic Layout Annotation

People Involved : Abhishek Trivedi, Ravi Kiran Sarvadevabhatla

A novel resizing-free approach for high-precision semi-automatic layout annotation.

 

word PALMIRA: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts

People Involved : Sharan, S P and Aitha, Sowmya and Amandeep, Kumar and Trivedi, Abhishek and Augustine, Aaron, Ravi Kiran Sarvadevabhatla

Introducing (1) Indiscapes2 handwritten manuscript layout dataset - 150% larger than its predecessor Indiscapes (2) PALMIRA - a novel deep network ..

word Syntactically Guided Generative Embeddings for Zero Shot Skeleton Action Recognition

People Involved : Pranay Gupta, Divyanshu Sharma, Ravi Kiran Sarvadevabhatla

We propose a language-guided approach to enable state of the art performance for the challenging problem of Zero Shot Recognition of human actions.

word DocVisor: A Multi-purpose Web-based Interactive Visualizer for Document Image Analytics

People Involved :Khadiravana Belagavi, Pranav Tadimeti, Ravi Kiran Sarvadevabhatla

DocVisor is an open-source visualization tool for document layout analysis. With DocVisor, it is possible to visualize data from three prominent document analysis tasks: Full Document Analysis, OCR and Box-Supervised Region Parsing. DocVisor offers various features such as ground-truth and intermediate output visualization, sorting data by key metrics as well as comparison of outputs from various other models simultaneously.

word MediTables: A New Dataset and Deep Network for Multi-category Table Localization in Medical Documents

People Involved : Akshay Praveen Deshpande, Vaishnav Rao Potlapalli, Ravi Kiran Sarvadevabhatla

DocVisor is an open-source visualization tool for document layout analysis. With DocVisor, it is possible to visualize data from three prominent document analysis tasks: Full Document Analysis, OCR and Box-Supervised Region Parsing. DocVisor offers various features such as ground-truth and intermediate output visualization, sorting data by key metrics as well as comparison of outputs from various other models simultaneously.

wordQuo Vadis, Skeleton Action Recognition ?

People Involved : Pranay Gupta, Anirudh Thatipelli, Aditya Aggarwal, Shubh Maheshwari, Neel Trivedi, Sourav Das, Ravi Kiran Sarvadevabhatla

                                              In this paper, we study current and upcoming frontiers across the landscape of skeleton-based human action recognition.

wordAn OCR for Classical Indic Documents Containing Arbitrarily Long Words

People Involved : Agam Dwivedi, Rohit Saluja, Ravi Kiran Sarvadevabhatla

Datasets (real, synthetic) and a CNN-LSTM Attention OCR for printed classical Indic documents containing very long words.  
 
 
 
 

word Topological Mapping for Manhattan-like Repetitive Environments

People Involved : Sai Shubodh Puligilla, Satyajit Tourani, Tushar Vaidya, Udit Singh Parihar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna

This paper explores the role of topological understanding and benefits of such an understanding to the robot SLAM framework.

wordIndiscapes: Instance segmentation networks for layout parsing of historical indic manuscripts

People Involved : Abhishek Prusty, Aitha Sowmya, Abhishek Trivedi, Ravi Kiran Sarvadevabhatla

We introduce Indiscapes - the largest publicly available layout annotated dataset of historical Indic manuscript images.