Workshop on
Scaling-up Document Image Understanding

26th August, 2023 at ICDAR 2023, San Jose, California, USA

Scope and Motivation

Document Analysis has long suffered from both a fragmented landscape of task-specific datasets and a strong focus on narrow-focused information extraction and document conversion tasks.

The Scaling-up document Image understanding workshop aims to open the discussion on possible ways for the community to align data preparation efforts and define large-scale (grand) challenges that drive progress in the field.

Through the workshop, we aspire to contribute to the definition of grand challenges for the community. This is meant to be one of a series of such events to be organized on our scientific forums in the near future.

The Deep Document Workshop at ICFHR 2022, was our first successful event in this series. We would like to set the seed for an initiative to create our own community’s document-oriented “ImageNet”, over which multiple long-term challenges can be defined.

The workshop is meant to discuss this possibility and help us define a way forward. Come, join us to explore this together.

Program Overview

The workshop includes invited talks from eminent researchers and practitioners and a panel discussion.

Date: 26thAugust, 2023

Venue: Adobe World Headquarters , 345 Park Avenue , San Jose, California, USA

Time - PDT Event Item
01:30 PM - 01:40 PM Welcome & Introduction
01:40 PM - 03:10 PM Invited Talks
03:10 PM - 03:40 PM Pitch
03:40 PM - 04:40 PM Panel Discussion: DAR : Challenges & Datasets


C. V. Jawahar

IIIT Hyderabad

Dimosthenis Karatzas

CVC, Universitat Autonoma de Barcelona

Anand Mishra

IIT Jodhpur

Andreas Fischer

HES-SO Switzerland

Seiichi Uchida

Kyushu University


The workshop will feature invited speakers from academia and industry, with expertise spanning the major themes of the workshop including text recognition, graphical document analysis, document intelligence, table detection, computer vision, and NLP.

Panel Discussion