The IIIT Scene Text Retrieval (STR) Dataset

[Project Page]

strfig


About

The IIIT STR dataset is harvested from Google and Flickr image search. Query words like coffee shop, motel, post office, high school, department were used to collect the images. Additionally, query words like sky, building were used in Flickr to collect some random distractors (images not containg text). The dataset contains 10,000 images in all. The images are manually annotated to say whether they contain a query word or not. Annotation for all the 50 query words used in our paper is available. Each query word appears 10-50 times in the dataset.


Downloads

IIIT STR (758 MB)
README


Publications

Anand Mishra, Karteek Alahari and C. V. Jawahar.
Image Retrieval using Textual Cues
ICCV 2013 [PDF]


Bibtex

If you use this dataset, please cite:

@InProceedings{MishraICCV13,
  author    = "Mishra, A. and Alahari, K. and Jawahar, C.~V.",
  title     = "Image Retrieval using Textual Cues",
  booktitle = "ICCV",
  year      = "2013",
}

Related datasets


Contact

For any queries about the dataset feel free to contact Anand Mishra. Email:This email address is being protected from spambots. You need JavaScript enabled to view it.