The IIIT Scene Text Retrieval (STR) Dataset
[Project Page]
About
The IIIT STR dataset is harvested from Google and Flickr image search. Query words like coffee shop, motel, post office, high school, department were used to collect the images. Additionally, query words like sky, building were used in Flickr to collect some random distractors (images not containg text). The dataset contains 10,000 images in all. The images are manually annotated to say whether they contain a query word or not. Annotation for all the 50 query words used in our paper is available. Each query word appears 10-50 times in the dataset.
Downloads
Publications
Anand Mishra, Karteek Alahari and C. V. Jawahar.
Image Retrieval using Textual Cues
ICCV 2013 [PDF]
Bibtex
If you use this dataset, please cite:
@InProceedings{MishraICCV13, author = "Mishra, A. and Alahari, K. and Jawahar, C.~V.", title = "Image Retrieval using Textual Cues", booktitle = "ICCV", year = "2013", }
Related datasets
Contact
For any queries about the dataset feel free to contact Anand Mishra. Email: