DALI @ MICCAI 2023, October 12, Vancouver, Canada

The 3rd MICCAI workshop on Data Augmentation, Labeling, and Imperfections

DALI: The 3rd MICCAI Workshop on Data Augmentation, Labeling, and Imperfections

The rapid expansion of data-intensive methods for supervised learning has led to an unprecedented demand for large quantities of annotated data. However, obtaining extensive collections of medical images is exceptionally challenging, as it necessitates rare and costly expertise for annotation. Furthermore, medical data are often noisy and imperfect due to missing entries and sensing heterogeneity. A forum for discussing contemporary and practical approaches for dealing with these challenges is urgently needed.

The MICCAI Workshop on Data Augmentation, Labeling, and Imperfections (DALI) aims to provide a venue for researchers to present and discuss their experiences related to these crucial topics, fostering a collaborative environment to tackle these challenges.

Important Dates

  • Paper Submission Opens: May 9, 2023
  • Paper Submission Deadline: June 25 July 10, 2023
  • Notification to Authors: July 16 July 31, 2023
  • Camera Ready Deadline: July 30 August 17, 2023
  • Workshop Day: October 12, 2023, Vancouver, Canada

Call For Papers

Training machine learning systems in the areas of image recognition, object detection, and image segmentation often demands an immense volume of expert-annotated data to achieve high accuracy. A larger number of labeled images enhances the performance of machine learning models by promoting better generalization and reducing overfitting. This necessity is even more critical for advanced learning architectures, such as vision transformers. Consequently, the most renowned benchmark datasets for general image recognition tasks comprise tens of thousands to millions of images.

Regrettably, acquiring such vast quantities of labeled data presents significant challenges in the medical imaging domain due to the high cost of annotation by domain experts and the scarcity of high-quality anonymized data stemming from privacy concerns. Additionally, there are distinct challenges associated with collecting annotated medical datasets. For example, while difficult to procure, instances of rare pathological conditions are crucial for accurately representing the data distribution. Furthermore, variations often exist among experts who provide labels, particularly for conditions that cause confusion among human experts and require the most assistance.

The goal of this workshop is to bring together and create a discussion forum for researchers in the MICCAI community, including those:

i. interested in the rigorous study of medical data as it relates to machine learning systems, ii. developing and promoting novel directions of research in such techniques, iii. contributing benchmark datasets, open challenges, and tasks that enable fair comparisons among existing and new techniques, and iv. applying such techniques to improve the performance of medical image computing systems.

The workshop will feature invited speakers presenting popular and emerging data augmentation and contemporary approaches for learning from small and noisy medical data. The workshop welcomes submissions that present new ideas, new results, new datasets, as well as discussion and evaluation of existing approaches. The topics of interest include but are not limited to:

  • Training and evaluation with noisy or uncertain labels
  • Data annotation tools and practices
  • Synthetic data for medical image analysis
  • Data-related foundation models
  • Multi-modal learning
  • One-shot/few-shot learning
  • Active learning
  • Semi-, weakly-, self-supervised learning
  • Deep learning for small, noisy and imperfect data
  • Domain adaptation/generalization
  • Erroneous label detection
  • Data curation
  • Principles and/or case studies of annotated datasets and benchmarks
  • Anonymization, PHI detection
  • Other related topics

Submissions to our workshop will be managed using the same platform as the main MICCAI conference, using Microsoft CMT. Workshop paper submission website is at: https://cmt3.research.microsoft.com/DALI2023

The DALI workshop will employ the same reviewing standards as the main conference. DALI workshop paper submissions should be anonymized to accommodate a double-blind review. Papers should be formatted using LaTeX or MS Word templates available at Lecture Notes in Computer Science. Manuscripts should be up to 8 pages (text, figures, and tables) plus up to 2 pages of references. In submitting a paper, authors implicitly acknowledge that no paper of substantially similar content has been or will be submitted to another conference or workshop until the decisions have been made by our workshop. Supplemental material submission is optional, which may include:

  • Videos of results that cannot be included in the main paper
  • Anonymized related submissions to other conferences and journals
  • Appendices or technical reports containing extended proofs and mathematical derivations that are not essential for the understanding of the paper

Contents of the supplemental material should be referred to appropriately in the paper, and reviewers are not obliged to look at it.

Camera Ready Submission Guidelines

Please carefully address the feedback provided by the reviewers. Submit the revised materials to the DALI CMT site as a single zip archive, named in the format dali23_id-X.zip, with “X” being replaced by your unique paper ID.

Your submission should include:

  1. Manuscript: Maximum of 8.5 pages, inclusive of text, figures, and tables, with an additional allowance of up to 2 pages for references. The file should be named manuscript.pdf.
  2. Supplementary Material (Optional): Name the file supplementary_material.pdf. Note that source files for supplementary materials aren’t mandatory.
  3. Changes Document: A detailed list of modifications made post-review. Name the file changes_after_review.pdf.
  4. Copyright Form: Download and fill out the copyright form. The form should be signed by the corresponding author. Digital signatures will not be accepted. Save this document as copyright.pdf.
  5. Source Files: Include a folder named src/, which houses the source files for your manuscript (e.g., .tex, .bib, .docx).

To ensure your paper is presented at MICCAI DALI 2023, a minimum of one paper author must register to attend on the second workshop day, October 12. As a general rule, this registration should be an “in-person” registration. The camera-ready submission portal will prompt you to provide the registration number of the author who will be presenting your work.



  • Workshop: Meeting Room 14, Vancouver Convention Center East Building Level 1
  • Coffee Break/Poster Session: The Poster Hall at Ground Level Exhibition B-C
  • Virtual Attendance: ConFLUX platform

Keynote Speakers


  • 1:30 - 1:40 PM PDT Opening, welcome and introduction
  • 1:40 - 1:55 PM PDT Oral: Masked Conditional Diffusion Models for Image Analysis with Application to Radiographic Diagnosis of Infant Abuse, Andy Tsai (Boston Children’s Hospital and Harvard Medical School)
  • 1:55 - 2:10 PM PDT Oral: Self-Supervised Single-Image Deconvolution with Siamese Neural Networks, Mikhail Papkov (University of Tartu)
  • 2:10 - 2:50 PM PDT Keynote: The devil is in the details: On the importance of professionalizing the whole image analysis pipeline, Lena Maier-Hein (German Cancer Research Center); [Details]
  • 2:50 - 3:30 PM PDT Keynote: Genetic Mutation and Biological Pathway Prediction from Whole Slide Images using Deep Learning for Cancer Detection and Diagnosis, Dimitris N. Metaxas (Rutgers University); [Details]
  • 3:30 - 4:00 PM PDT Coffee Break/Poster Session (Ground Level Exhibition B-C)
  • 4:00 - 4:30 PM PDT Poster Session (Ground Level Exhibition B)
  • 4:30 - 5:10 PM PDT Keynote: AI and Data Efficient Deep Learning to Accelerate Large-Scale Studies of Brain Diseases, Paul M. Thompson (University of Southern California); [Details]
  • 5:10 - 5:25 PM PDT Oral: Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports, Tom J van Sonsbeek (University of Amsterdam)
  • 5:25 - 5:40 PM PDT Oral: Data Augmentation Based on DiscrimDiff for Histopathology Image Classification, Xianchao Guan (Harbin Institute of Technology, Shenzhen)
  • 5:40 - 5:55 PM PDT Oral: URL: Combating Label Noise for Lung Nodule Malignancy Grading, Xianze Ai (Northwestern Polytechnical University)
  • 5:55 - 6:10 PM PDT Oral: A Realistic Collimated X-Ray Image Simulation Pipeline, Benjamin El-Zein (Friedrich-Alexander-University Erlangen-Nuremberg)
  • 6:10 - 6:30 PM PDT Closing and award announcement

Poster List (Ground Level Exhibition B)

  • 01 URL: Combating Label Noise for Lung Nodule Malignancy Grading
  • 02 Zero-shot Learning of Individualized Task Contrast Prediction from Resting-state Functional Connectomes
  • 03 Microscopy Image Segmentation via Point and Shape Regularized Data Synthesis
  • 04 A Unified Approach to Learning with Label Noise and Unsupervised Confidence Approximation
  • 05 Transesophageal Echocardiography Generation using Anatomical Models
  • 06 Data Augmentation Based on DiscrimDiff for Histopathology Image Classification
  • 07 Clinically Focussed Evaluation of Anomaly Detection and Localisation Methods Using Inpatient CT Head Data
  • 08 LesionMix: A Lesion-Level Data Augmentation Method for Medical Image Segmentation
  • 09 Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports
  • 10 Modular, Label-Efficient Dataset Generation for Instrument Detection for Robotic Scrub Nurses
  • 11 Adaptive Semi-Supervised Segmentation of Brain Vessels with Ambiguous Labels
  • 12 Proportion Estimation by Masked Learning from Label Proportion
  • 13 Active Learning Strategies on a Real-World Thyroid Ultrasound Dataset
  • 14 A Realistic Collimated X-Ray Image Simulation Pipeline
  • 15 Masked Conditional Diffusion Models for Image Analysis with Application to Radiographic Diagnosis of Infant Abuse
  • 16 Self-Supervised Single-Image Deconvolution with Siamese Neural Networks

Proceedings and CMIG Special Issue

Accepted DALI workshop papers will be published with MICCAI 2023 Proceedings in the Springer Lecture Notes in Computer Science (LNCS) series. A selection of the best papers will also be invited to submit revised and extended versions of their work to a Special Issue in CMIG (Computerized Medical Imaging and Graphics, IF: 5.7). The pre-selection of these papers will be based on editorial chair recommendations during the review process, with final decisions made by the program chairs. Extended papers should emphasize the application of DALI-related methods to address specific medical problems. Please note that even extended papers may be subject to rejection during the peer-review process of CMIG. The submission portal is open now. More details can be found here.

Awards and Sponsors

We are pleased to announce that two prestigious awards will be presented at the upcoming DALI workshop, and we are grateful to our generous sponsors for making these prizes possible. The Best Paper Award and People’s Choice Award, each with an amount of $500, are being sponsored by two leading companies in the industry: SenseTime and InferVision.

We are thrilled to have SenseTime and InferVision as sponsors for the DALI workshop and are grateful for their support in recognizing outstanding contributions in medical AI research. We encourage all attendees to take the opportunity to learn more about these companies and their groundbreaking work.


Program Chairs

Editorial Chairs

  • Lianrui Zuo, Johns Hopkins University, USA
  • Yihao Liu, Johns Hopkins University, USA

Advisory Board

  • Sharon Xiaolei Huang, The Pennsylvania State University, USA
  • Hien V. Nguyen, University of Houston, USA
  • Nicholas Heller, University of Minnesota, USA
  • Stephen Wong, Houston Methodist Hospital, USA
  • Daniel Rueckert, Technische Universität München, Germany
  • Jerry Prince, Johns Hopkins University, USA
  • Dimitris N. Metaxas, Rutgers University, USA
  • Ehsan Adeli, Stanford University, USA

Program Committee

  • Amogh Subbakrishna Adishesha, The Pennsylvania State University
  • Cheng Ouyang, Imperial College London
  • Chenyu You, Yale University
  • Christoph M. Friedrich, University of Applied Sciences and Arts Dortmund
  • Edward Kim, Drexel University
  • Fan Wang, Stony Brook University
  • Gilbert Lim, SingHealth
  • Haomiao Ni, The Pennsylvania State University
  • Jiachen Liu, The Pennsylvania State University
  • Kexin Ding, University of North Carolina at Charlotte
  • Luyang Luo, The Chinese University of Hong Kong
  • Michael Goetz, University Hospital Ulm
  • Muchao Ye, The Pennsylvania State University
  • Nicha Dvornek, Yale University
  • Nicholas Heller, University of Minnesota
  • Peiyu Duan, Yale University
  • Peng Jin, The Pennsylvania State University
  • Ruochen Wang, AbleTo, Inc.
  • Samira Zare, University of Houston
  • Samuel Remedios, Johns Hopkins University
  • Saumya Gupta, Stony Brook University
  • Sharon Xiaolei Huang, The Pennsylvania State University
  • Weidong Cai, University of Sydney
  • Weimin Lyu, Stony Brook University
  • Yubo Fan, Vanderbilt University
  • Yuli Wang, Johns Hopkins University
  • Zeju Li, Imperial College London
  • Zhangxing Bian, Johns Hopkins University
  • Zuhui Wang, Stony Brook University

SenseTime graphic      InferVision graphic

Latest Posts

The DALI 2023 Workshop Program Has Been Posted

The DALI 2023 workshop will be held at Vancouver Convention Center East Building Level 1 Meeting Room 14 1:30 - 6:30 pm PDT on Oct. 12. Virtual participants will be able to access our workshop’s Zoom room using the ConFLUX platform. Don’t forget to add DALI workshop to you agenda! Virtual presenters/participants must be registered for Oct 12 workshop in order to see DALI workshop on ConFLUX and thus access the Zoom room. We look forward to meeting you in Vancouver soon!

Introducing: DALI 2023

Welcome to the homepage for the 3rd MICCAI workshop on Data Augmentation, Labeling, and Imperfections (DALI). Previous edition can be found at MICCAI DALI 2022