Sports-10K and TV Series-1M Video Datasets
About
We introduce two large video datasets namely Sports-10K and TV series-1M to demonstrate scene text retrieval in the context of video sequences. The first one is from sports video clips, containing many advertisement signboards, and the second is from four popular TV series: Friends, Buffy, Mr. Bean, and Open All Hours. The TV series-1M contains more than 1 million frames. Words such as central, perk, pickles, news, SLW27R (a car number) frequently appear in the TV series-1M dataset. All the image frames extracted from this dataset are manually annotated with the query text they may contain. Annotations are done by a team of three people for about 150 man-hours. We use 10 and 20 query words to demonstrate the retrieval performance on the Sports-10K and the TV series-1M datasets respectively.
Downloads
Please mail us at
Publications
Anand Mishra, Karteek Alahari and C. V. Jawahar.
Image Retrieval using Textual Cues
ICCV 2013 [PDF]
Bibtex
If you use this dataset, please cite:
@InProceedings{MishraICCV13, author = "Mishra, A. and Alahari, K. and Jawahar, C.~V.", title = "Image Retrieval using Textual Cues", booktitle = "ICCV", year = "2013", }
Related datasets
Contact
For any queries about the dataset feel free to contact Anand Mishra. Email: