CVIT Home CVIT Home
  • Home
  • People
    • Faculty
    • Staff
    • PhD Students
    • MS Students
    • Alumni
    • Post-doctoral
    • Honours Student
  • Research
    • Publications
    • Thesis
    • Projects
    • Resources
  • Events
    • Talks and Visits
    • Major Events
    • Visitors
    • Summer Schools
  • Gallery
  • News & Updates
    • News
    • Blog
    • Newsletter
    • Banners
  • Contact Us
  • Login

AKPujithaA K Pujitha

Areas of Interest: Medical Image Processing
 

Email: This email address is being protected from spambots. You need JavaScript enabled to view it.

 
Address: CVIT, IIIT-H
 

Phone:

 
Personal Home Page: http://researchweb.iiit.ac.in/~pujitha.ak/

Publications

  • Pujitha AK and Jayanthi Sivaswamy - Retinal Image Synthesis for CAD development ICIAR 2018, Portugal [PDF]

  • Pujitha AK and Jayanthi Sivaswamy - Crowdsourced annotations as an additional form of data augmentation for CAD development ACPR, Nanjing [PDF]

  • Pujitha Appan K, Jahnavi Gamalapati S and Jayanthi Sivaswamy - Detection of neovascularization in retinal images using semi-supervised learning Biomedical Imaging (ISBI 2017), 2017 IEEE 14th International Symposium on. IEEE, 2017. [PDF]


Projects

Logo Medical Image Processing

People Involved : Gopal Datt Joshi, Mayank Chawla, Arunava Chakravarty, Akhilesh Bontala, Shashank Mujjumdar, Rohit Gautam, Subbu, Sushma

Digital medical images are widely used for diagnostic purposes. Our goal is to develop algorithms for medical image analysis focusing on enhancement, segmentation, multi-modal registration and classification.

 

 

AjeetKumarSinghAjeet Kumar Singh

Areas of Interest: Computer Vision, Machine Learning, Document Analysis
 
Email: This email address is being protected from spambots. You need JavaScript enabled to view it.
 
Address: CVIT, IIIT-Hyderabad
 
Phone: 
 
Personal Home Page: https://in.linkedin.com/in/ajeetsingh0712

Publications

  • Minesh Mathew, Ajeet Kumar Singh and C V Jawahar - Multilingual OCR for Indic Scripts - Proceedings of 12th IAPR International Workshop on Document Analysis Systems (DAS'16), 11-14 April, 2016, Santorini, Greece. [PDF]

  • Ajeet Kumar Singh, Anand Mishra, Pranav Dabral and C V Jawahar - A Simple and Effective Solution for Script Identification in the Wild - Proceedings of 12th IAPR International Workshop on Document Analysis Systems (DAS'16), 11-14 April, 2016, Santorini, Greece. [PDF]

  • Ajeet Kumar Singh, C. V. Jawahar - Can RNNs Reliably Separate Script and Language at Word and Line Level Proceedings of the 13th IAPR International Conference on Document Analysis and Recognition, 23-26 Aug 2015 Nancy, France. [PDF]

  • Ajeet Kumar Singh and C.V. Jawahar - Can RNNs reliably separate script and language at word and line level? Document Analysis and Recognition (ICDAR), 2015 13th International Conference on. IEEE, 2015. [PDF]

  • Praveen Krishnan, Naveen Sankaran, Ajeet Kumar Singh and C. V. Jawahar - Towards a Robust OCR System for Indic Scripts Proceedings of the 11th IAPR International Workshop on Document Analysis Systems, 7-10 April 2014, Tours-Loire Valley, France. [PDF]


Projects

DigvijaySinghDigvijay Singh

Areas of Interest: Computer Vision, Machine Learning
 
Email: This email address is being protected from spambots. You need JavaScript enabled to view it.
 
Address: CVIT, IIIT-H
 
Phone:
 
Personal Home Page: http://researchweb.iiit.ac.in/~digvijay.singh/

Publications

  • G Nagendar, Digvijay Singh and C. V. Jawahar -  NeuroIoU: Learning a Surrogate Loss for Semantic Segmentation Proceedings of the British Machine Vision Conference, 03-06 Sep 2018, Northumbria[PDF]

  • Digvijay Singh, Vineeth Balasubramanian, C. V. Jawahar - Fine-Tuning Human Pose Estimations in Videos Proceedings of the IEEE Winter Conference on Applications of Computer Vision(WACV), 2016. [PDF]

  • Digvijay Singh, Ayush Minocha, Nataraj Jammalamadaka and C. V. Jawahar - Real-time Face Detection, Pose Estimation and Landmark Localization Proceedings of the IEEE National Conference on Computer Vision, Pattern Recognition, Image Processing and Graphics, 18-21 Dec. 2013, Jodhpur, India. [PDF]

  • Nataraj Jammalamadaka, Ayush Minocha, Digvijay Singh and C V Jawahar - Parsing Clothes in Unrestricted Images Proceedings of the 24th British Machine Vision Conference, 09-13 Sep. 2013, Bristol, UK. [PDF]

  • Digvijay Singh, Ayush Minosha, Nataraj Jammalamadaka and C.V. Jawahar - Near real-time face parsing Computer Vision, Pattern Recognition, Image Processing and Graphics (NCVPRIPG), 2013 Fourth National Conference on. IEEE, 2013. [PDF]


Projects

thumbnlFine-Tuning Human Pose Estimation in Videos

People Involved :Digvijay Singh, Vineeth Balasubramanian, C. V. Jawahar

A semi-supervised self-training method for fine-tuning human pose estimations in videos that provides accurate estimations even for complex sequences.

 

 

 

SuriyaSinghSuriya Singh

Areas of Interest: Computer Vision, Image Processing, Machine Learning and Pattern Recognition
 
Email: This email address is being protected from spambots. You need JavaScript enabled to view it.
 
Address: CVIT, IIIT-H
 
Phone:
 
Personal Home Page: http://researchweb.iiit.ac.in/~suriya.singh/

Publications

  • Anil Batra, Suriya Singh, Guan Pang, Saikat Basu, C. V. Jawahar and Manohar Paluri -  Improved Road Connectivity by Joint Learning of Orientation and Segmentation, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019, 15 - 21 June 2019, Long Beach, California, United States.[PDF]

  • Suriya Singh, Anil Batra, Guan Pang, Lorenzo Torresani, Saikat Basu, Manohar Paluri, and C. V. Jawahar - Self-Supervised Feature Learning for Semantic Segmentation of Overhead Imagery in 29th British Machine Vision Conference (BMVC), 2018, Newcastle, UK. [PDF]

  • Anurag Ghosh, Suriya Singh and C.V. Jawahar -  Towards Structured Analysis of Broadcast Badminton Videos IEEE Winter Conference on Applications of Computer Vision (WACV 2018), Lake Tahoe, CA, USA, 2018 [PDF]

  • Bharat Lal Bhatnagar, Suriya Singh, Chetan Arora and C.V. Jawahar - Unsupervised Learning of Deep Feature Representation for Clustering Egocentric Actions Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence (IJCAI-17) [PDF]

  • Suriya Singh, Chetan Arora, C.V. Jawahar - First Person Action Recognition Using Deep Learned Descriptors Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition(CVPR 16), 26 June - 01 July, 2016, Las Vegas, USA. [PDF]

  • Suriya Singh, C.V. Jawahar - Generic Action Recognition from Egocentric Videos Proceedings of the Fifth National Conference on Computer Vision Pattern Recognition, Image Processing and Graphics (NCVPRIPG 2015), 16-19 Dec 2015, Patna, India. [PDF]

  • Mohak Sukhwani, Suriya Singh, Anirudh Goyal, Aseem Behl, Pritish Mohapatra, Brijendra Kumar Bharti, C.V. Jawahar - Monocular Vision based Road Marking Recognition for Driver Assistance and Safety Proceedings of the IEEE Conference on Vehicular Electronics and Safety,16-17 Dec 2014, Hyderabad, India. [PDF]

  • Suriya Singh, Shushman Choudhury, Kumar Vishal and C.V. Jawahar - Currency Recognition on Mobile Phones Proceedings of the 22nd International Conference on Pattern Recognition, 24-28 Aug 2014, Stockholm, Sweden. [PDF]


Projects

relativeattributesTowards Structured Analysis of Broadcast Badminton Videos

People Involved :Anurag Ghosh, Suriya Singh and C. V. Jawahar

Sports video data is recorded for nearly every major tournament but remains archived and inaccessible to large scale data mining and analytics. It can only be viewed sequentially or manually tagged with higher-level labels which is time consuming and prone to errors. In this work, we propose an end-to-end framework for automatic attributes tagging and analysis of sport videos.

 

 
 
 

FirstPersonActionRecognitionUsingDeepLearnedDescriptorsFirst Person Action Recognition

People Involved : Suriya Singh, Chetan Arora, C. V. Jawahar

 

 

 

 

 

usecaseCurrency Recognition on Mobile Phones

People Involved : Suriya Singh, Shushman Choudhury, Kumar Vishal and C.V. Jawahar

In this project, we present an application for recognizing currency bills using computer vision techniques, that can run on a low-end smartphone. The application runs on the device without the need for any remote server. It is intended for robust, practical use by the visually impaired.

MohakSukhwaniMohak Sukhwani

Areas of Interest: Computer Vision, Machine Learning
 
Email: This email address is being protected from spambots. You need JavaScript enabled to view it.
 
Address: CVIT, IIIT-H
 
Phone:
 
Personal Home Page: http://researchweb.iiit.ac.in/~mohak.sukhwani/web/

Publications

  • Mohak Sukhwani and C.V. Jawahar - Frame level annotations for tennis videos Pattern Recognition (ICPR), 2016 23rd International Conference on. IEEE, 2016. [PDF]

  • Anurag Ghosh, Yash Patel, Mohak Sukhwani and C.V. Jawahar - Dynamic Narratives for Heritage Tour 3rd Workshop on Computer Vision for Art Analysis (VisART), European Conference on Computer Vision (ECCV), 2016 [PDF]

  • Mohak Sukhwani, C. V. Jawahar - Tennis Vid2Text : Fine-Grained Descriptions for Domain Specific Videos Proceedings of the 26th British Machine Vision Conference, 07-10 Sep 2015, Swansea, UK. [PDF]

  • Mohak Sukhwani, Suriya Singh, Anirudh Goyal, Aseem Behl, Pritish Mohapatra, Brijendra Kumar Bharti, C.V. Jawahar - Monocular Vision based Road Marking Recognition for Driver Assistance and Safety Proceedings of the IEEE Conference on Vehicular Electronics and Safety,16-17 Dec 2014, Hyderabad, India. [PDF]


Projects

capVSdesFine-Grained Descriptions for Domain Specific Videos

People Involved :Mohak Kumar Sukhwani, C. V. Jawahar

Generation of human like natural descriptions for multimedia content pose an interesting challenge for vision community. In our current work we tackle the challenge of generating descriptions for the videos. The proposed method demonstrates considerable success in generating syntactically and pragmatically correct text for lawn tennis videos and is notably effective in capturing majority of the video content. Unlike any previous work our method focuses on generating exhaustive and richer human like descriptions. We aim to provide reliable descriptions that facilitate the task of video analysis and help understand the ongoing events in the video. Large volumes of text data are used to compute associated text statistics which is thereafter used along with computer vision algorithms to produce relevant descriptions

 

More Articles …

  1. Gaurav Mittal
  2. Devendra Kumar Sahu
  3. Akhilesh BSR
  4. Subho Banerjee
  • Start
  • Prev
  • 7
  • 8
  • 9
  • 10
  • 11
  • 12
  • 13
  • 14
  • 15
  • 16
  • Next
  • End
  1. You are here:  
  2. Home
  3. People
  4. MS Students
  5. MS Students
Bootstrap is a front-end framework of Twitter, Inc. Code licensed under MIT License. Font Awesome font licensed under SIL OFL 1.1.