Workshop on
Scaling-up Document Image Understanding

24th August, 2023 at ICDAR 2023, San Jose, California, USA

Thank you to all ScalDoc participants! A repository of ScalDoc workshop photos is available here: Gallery

Scope and Motivation

Document Analysis has long suffered from both a fragmented landscape of task-specific datasets and a strong focus on narrow-focused information extraction and document conversion tasks.

The Scaling-up document Image understanding workshop aims to open the discussion on possible ways for the community to align data preparation efforts and define large-scale (grand) challenges that drive progress in the field.

Through the workshop, we aspire to contribute to the definition of grand challenges for the community. This is meant to be one of a series of such events to be organized on our scientific forums in the near future.

The Deep Document Workshop at ICFHR 2022, was our first successful event in this series. We would like to set the seed for an initiative to create our own community’s document-oriented “ImageNet”, over which multiple long-term challenges can be defined.

The workshop is meant to discuss this possibility and help us define a way forward. Come, join us to explore this together.

Program Overview

The workshop includes invited talks from eminent researchers and practitioners and a panel discussion.

Date: 24thAugust, 2023

Venue: Adobe World Headquarters , 345 Park Avenue , San Jose, California, USA

Time - PDT Event Item
09:00 M - 09:05 AM Welcome & Introduction - Seiichi Uchida
09:05 AM - 09:50 AM Invited Talk - Yasuhisa Fujii ,
What are grand challenges in DAR? Do we already know?
09:50 AM - 10:10 AM Coffee Break
10:10 AM - 10:40 AM Pitch Session
10:40 AM - 11:25 AM Invited Talk - J. Stephen Downie ,
Beyond OCR: Non-Textual Opportunities and Challenges at the HathiTrust Research Center
11:25 AM - 12:25 PM Panel Discussion: Challenges and Datasets in DAR
12:25 AM - 12:30 PM Closing note


C. V. Jawahar

IIIT Hyderabad

Dimosthenis Karatzas

CVC, Universitat Autonoma de Barcelona

Anand Mishra

IIT Jodhpur

Andreas Fischer

HES-SO Switzerland

Seiichi Uchida

Kyushu University


J. Stephen Downie

University of Illinois Urbana-Champaign

Yasuhisa Fujii

Google Research

Pitch Session

Christopher Kermorvant

Founder, CEO - TEKLIA

Ravi Kiran S

IIIT Hyderabad

Shangbang Long

Google Research

Shubhi Asthana

IBM Almaden Research Center , San Jose

Štěpán Šimsa

Rossum AI labs

Vincent Christlein

Pattern Recognition Lab

Panel Discussion

Anand Mishra

IIT Jodhpur

Joseph Chazalon


Mickael Coustaty

La Rochelle Université

J. Stephen Downie

University of Illinois Urbana-Champaign

Yasuhisa Fujii

Google Research