List of Projects
People Involved : Nikhil Bansal, Kartik Gupta, Kiruthika Kannan, Sivani Pentapati, Ravi Kiran Sarvadevabhatla
Pictionary, the popular sketch-based game forbids drawer from writing text(atypical content) on canvas. Intervention of such rule violations is impractical and not scalable in web-based online setting of this game involving large number of multiple concurrent sessions. Apart from malicious game play,...
PSUMNet: Unified Modality Part Streams are All You Need for Efficient Pose-based Action Recognition
People Involved : Neel Trived, Ravi Kiran Sarvadevabhatla
Pose-based action recognition is predominantly tackled by approaches which treat the input skeleton in a monolithic fashion, i.e. joints in the pose tree are processed as a whole. However, such approaches ignore the fact that action categories are often characterized by localized action dynamics involving only small subsets of part joint groups involving hands ...
UAV-based Visual Remote Sensing for Automated Building Inspection (UVRSABI)
People Involved : Kushagra Srivastava , Dhruv Patel , Aditya Kumar Jha , Mohit Kumar Jha, Jaskirat Singh, Ravi Kiran Sarvadevabhatla, Harikumar Kandath, Pradeep Kumar Ramancharla, K. Madhava Krishna,
We automate the inspection of buildings through UAV-based image data collection and a post-processing module to infer and quantify the details which helps in avoiding manual inspection, reducing the time and cost.
Wisdom of (Binned) Crowds: A Bayesian Stratification Paradigm for Crowd Counting
People Involved : Sravya Vardhani Shivapuja, Mansi Pradeep Khamkar, Divij Bajaj, Ganesh Ramakrishnan, Ravi Kiran Sarvadevabhatla
Not the paper crowd counting community seems to want, but one it needs right now ! To address serious issues with training and evaluation of deep...
MeronymNet: A Hierarchical Model for Unified and Controllable Multi-Category Object Generation
People Involved : Rishabh Baghel, Abhishek Trivedi, Tejas Ravichandran, and Ravi Kiran Sarvadevabhatla
We introduce MeronymNet, a novel hierarchical approach for controllable, part-based generation of multi-category objects using a single unified model. We adopt a guided coarse-to-fine strategy involving semantically conditioned generation of bounding box layouts, pixel-level part layouts and ultimately, the object depictions themselves.
People Involved : Abhishek Trivedi, Ravi Kiran Sarvadevabhatla
A novel resizing-free approach for high-precision semi-automatic layout annotation.
People Involved : Sharan, S P and Aitha, Sowmya and Amandeep, Kumar and Trivedi, Abhishek and Augustine, Aaron, Ravi Kiran Sarvadevabhatla
Introducing (1) Indiscapes2 handwritten manuscript layout dataset - 150% larger than its predecessor Indiscapes (2) PALMIRA - a novel deep network ..
Syntactically Guided Generative Embeddings for Zero Shot Skeleton Action Recognition
People Involved : Pranay Gupta, Divyanshu Sharma, Ravi Kiran Sarvadevabhatla
We propose a language-guided approach to enable state of the art performance for the challenging problem of Zero Shot Recognition of human actions.
DocVisor: A Multi-purpose Web-based Interactive Visualizer for Document Image Analytics
People Involved :Khadiravana Belagavi, Pranav Tadimeti, Ravi Kiran Sarvadevabhatla
DocVisor is an open-source visualization tool for document layout analysis. With DocVisor, it is possible to visualize data from three prominent document analysis tasks: Full Document Analysis, OCR and Box-Supervised Region Parsing. DocVisor offers various features such as ground-truth and intermediate output visualization, sorting data by key metrics as well as comparison of outputs from various other models simultaneously.
People Involved : Akshay Praveen Deshpande, Vaishnav Rao Potlapalli, Ravi Kiran Sarvadevabhatla
DocVisor is an open-source visualization tool for document layout analysis. With DocVisor, it is possible to visualize data from three prominent document analysis tasks: Full Document Analysis, OCR and Box-Supervised Region Parsing. DocVisor offers various features such as ground-truth and intermediate output visualization, sorting data by key metrics as well as comparison of outputs from various other models simultaneously.
Quo Vadis, Skeleton Action Recognition ?
People Involved : Pranay Gupta, Anirudh Thatipelli, Aditya Aggarwal, Shubh Maheshwari, Neel Trivedi, Sourav Das, Ravi Kiran Sarvadevabhatla
In this paper, we study current and upcoming frontiers across the landscape of skeleton-based human action recognition.
An OCR for Classical Indic Documents Containing Arbitrarily Long Words
People Involved : Agam Dwivedi, Rohit Saluja, Ravi Kiran Sarvadevabhatla
Datasets (real, synthetic) and a CNN-LSTM Attention OCR for printed classical Indic documents containing very long words.
Topological Mapping for Manhattan-like Repetitive Environments
People Involved : Sai Shubodh Puligilla, Satyajit Tourani, Tushar Vaidya, Udit Singh Parihar, Ravi Kiran Sarvadevabhatla, K. Madhava Krishna
This paper explores the role of topological understanding and benefits of such an understanding to the robot SLAM framework.
Indiscapes: Instance segmentation networks for layout parsing of historical indic manuscripts
People Involved : Abhishek Prusty, Aitha Sowmya, Abhishek Trivedi, Ravi Kiran Sarvadevabhatla
We introduce Indiscapes - the largest publicly available layout annotated dataset of historical Indic manuscript images.