![]()
|
CVIT is building tools for document understating tasks with special emphasis on Indian languages. These include extensible multi-lingual optical character reader (OCR) systems, form processing systems, document database systems for scanned documents, reading aid for printed text, and on-line character recognition systems. Advanced prototypes of some of these systems have been demonstrated at the Institute's open house and other forums. The centre believes that the rich, multi-lingual scenario that exists in the country provides special challenges to document understanding research. Specific activities in this area include:
Content-based retrieval of relevant items from a large collection of images and video is a challenging problem. We are building a system that addresses some aspects of this problem. Issues related to the adaptation, advanced and natural querying schemes, feature selection, content representation, etc., are being explored at the centre. Domain specific systems for News Videos, Human faces, Medical Images, Logo Images, etc., are quite possible today and we are building them. Specific activities in this area include:
Understanding of dynamic and static objects from its multiple images is an important issue in computer vision systems. Geometric constraints on multiple views of the same scene has been an active area of research in Computer Vision recently. We have developed a new framework for recognition of some classes of objects and demonstrated its applications on real-life images. We are currently exploring the problem of tracking moving objects in multiple views and compression of the multiview video data for virtual presence. Specific activities in this area include:
Computer Graphics and Virtual Reality are areas with wide applications in training, engineering, etc. The centre is active in research into the visualization and navigation of large virtual environments, representation and visualization of molecular data, developing client-server models for inexpensive Virtual Reality, image-based rendering, and the study of software architectures for next-generation VR systems. Specific activities in this area include:
The centre works on processing of medical images, especially images of retina of the eye. Specific activities in this area include:
The areas of Pattern Recognition and Machine Learning encompasses a wide variety of topics that are essential for most visual information processing problems. Our primary areas of interest include classifier design and systems that learn from large collections of data. Specific activities in this area include:
The area of Dynamic Scene Analysis includes the processing, classification, understanding and summarisation of video streams from both single and multiple cameras. Specific activities in this area include:
Biometrics deals with the identification of people based on their physiological or behavioural characteristics, such as fingerprints, face, hand geometry, handwriting, speech, etc. The primary focus of research are on improving the performance of popular biometric modalities and to develop computer vision based solutions to biometric problems. Specific activities in this area include:
In addition to the above, our centre works on a variety of problems in various topics. Some of them are listed below:
|
| Center for Visual Information Technology, IIIT, Hyderabad |
| Last Modified: Fri Jan 13 17:40:05 IST 2011 |