DALI: The 3rd MICCAI Workshop on Data Augmentation, Labeling, and Imperfections

The rapid expansion of data-intensive methods for supervised learning has led to an unprecedented demand for large quantities of annotated data. However, obtaining extensive collections of medical images is exceptionally challenging, as it necessitates rare and costly expertise for annotation. Furthermore, medical data are often noisy and imperfect due to missing entries and sensing heterogeneity. A forum for discussing contemporary and practical approaches for dealing with these challenges is urgently needed.

The MICCAI Workshop on Data Augmentation, Labeling, and Imperfections (DALI) aims to provide a venue for researchers to present and discuss their experiences related to these crucial topics, fostering a collaborative environment to tackle these challenges.

Important Dates

Paper Submission Opens: May 9, 2023
Paper Submission Deadline: ~~June 25~~ July 10, 2023
Notification to Authors: ~~July 16~~ July 31, 2023
Camera Ready Deadline: ~~July 30~~ August 17, 2023
Workshop Day: October 12, 2023, Vancouver, Canada

Call For Papers

Training machine learning systems in the areas of image recognition, object detection, and image segmentation often demands an immense volume of expert-annotated data to achieve high accuracy. A larger number of labeled images enhances the performance of machine learning models by promoting better generalization and reducing overfitting. This necessity is even more critical for advanced learning architectures, such as vision transformers. Consequently, the most renowned benchmark datasets for general image recognition tasks comprise tens of thousands to millions of images.

Regrettably, acquiring such vast quantities of labeled data presents significant challenges in the medical imaging domain due to the high cost of annotation by domain experts and the scarcity of high-quality anonymized data stemming from privacy concerns. Additionally, there are distinct challenges associated with collecting annotated medical datasets. For example, while difficult to procure, instances of rare pathological conditions are crucial for accurately representing the data distribution. Furthermore, variations often exist among experts who provide labels, particularly for conditions that cause confusion among human experts and require the most assistance.

The goal of this workshop is to bring together and create a discussion forum for researchers in the MICCAI community, including those:

i. interested in the rigorous study of medical data as it relates to machine learning systems, ii. developing and promoting novel directions of research in such techniques, iii. contributing benchmark datasets, open challenges, and tasks that enable fair comparisons among existing and new techniques, and iv. applying such techniques to improve the performance of medical image computing systems.

The workshop will feature invited speakers presenting popular and emerging data augmentation and contemporary approaches for learning from small and noisy medical data. The workshop welcomes submissions that present new ideas, new results, new datasets, as well as discussion and evaluation of existing approaches. The topics of interest include but are not limited to:

Training and evaluation with noisy or uncertain labels
Data annotation tools and practices
Synthetic data for medical image analysis
Data-related foundation models
Multi-modal learning
One-shot/few-shot learning
Active learning
Semi-, weakly-, self-supervised learning
Deep learning for small, noisy and imperfect data
Domain adaptation/generalization
Erroneous label detection
Data curation
Principles and/or case studies of annotated datasets and benchmarks
Anonymization, PHI detection
Other related topics

Submissions to our workshop will be managed using the same platform as the main MICCAI conference, using Microsoft CMT. Workshop paper submission website is at: https://cmt3.research.microsoft.com/DALI2023

The DALI workshop will employ the same reviewing standards as the main conference. DALI workshop paper submissions should be anonymized to accommodate a double-blind review. Papers should be formatted using LaTeX or MS Word templates available at Lecture Notes in Computer Science. Manuscripts should be up to 8 pages (text, figures, and tables) plus up to 2 pages of references. In submitting a paper, authors implicitly acknowledge that no paper of substantially similar content has been or will be submitted to another conference or workshop until the decisions have been made by our workshop. Supplemental material submission is optional, which may include:

Videos of results that cannot be included in the main paper
Anonymized related submissions to other conferences and journals
Appendices or technical reports containing extended proofs and mathematical derivations that are not essential for the understanding of the paper

Contents of the supplemental material should be referred to appropriately in the paper, and reviewers are not obliged to look at it.

Camera Ready Submission Guidelines

Please carefully address the feedback provided by the reviewers. Submit the revised materials to the DALI CMT site as a single zip archive, named in the format dali23_id-X.zip, with “X” being replaced by your unique paper ID.

Your submission should include:

Manuscript: Maximum of 8.5 pages, inclusive of text, figures, and tables, with an additional allowance of up to 2 pages for references. The file should be named manuscript.pdf.
Supplementary Material (Optional): Name the file supplementary_material.pdf. Note that source files for supplementary materials aren’t mandatory.
Changes Document: A detailed list of modifications made post-review. Name the file changes_after_review.pdf.
Copyright Form: Download and fill out the copyright form. The form should be signed by the corresponding author. Digital signatures will not be accepted. Save this document as copyright.pdf.
Source Files: Include a folder named src/, which houses the source files for your manuscript (e.g., .tex, .bib, .docx).

To ensure your paper is presented at MICCAI DALI 2023, a minimum of one paper author must register to attend on the second workshop day, October 12. As a general rule, this registration should be an “in-person” registration. The camera-ready submission portal will prompt you to provide the registration number of the author who will be presenting your work.

Program

Location

Workshop: Meeting Room 14, Vancouver Convention Center East Building Level 1
Coffee Break/Poster Session: The Poster Hall at Ground Level Exhibition B-C
Virtual Attendance: ConFLUX platform

Keynote Speakers

Dimitris N. Metaxas, Rutgers University, USA [Keynote Info]
Lena Maier-Hein, German Cancer Research Center, Germany [Keynote Info]
Paul M. Thompson, University of Southern California, USA [Keynote Info]

Schedule

1:30 - 1:40 PM PDT Opening, welcome and introduction
1:40 - 1:55 PM PDT Oral: Masked Conditional Diffusion Models for Image Analysis with Application to Radiographic Diagnosis of Infant Abuse, Andy Tsai (Boston Children’s Hospital and Harvard Medical School)
1:55 - 2:10 PM PDT Oral: Self-Supervised Single-Image Deconvolution with Siamese Neural Networks, Mikhail Papkov (University of Tartu)
2:10 - 2:50 PM PDT Keynote: The devil is in the details: On the importance of professionalizing the whole image analysis pipeline, Lena Maier-Hein (German Cancer Research Center); [Details]
2:50 - 3:30 PM PDT Keynote: Genetic Mutation and Biological Pathway Prediction from Whole Slide Images using Deep Learning for Cancer Detection and Diagnosis, Dimitris N. Metaxas (Rutgers University); [Details]
3:30 - 4:00 PM PDT Coffee Break/Poster Session (Ground Level Exhibition B-C)
4:00 - 4:30 PM PDT Poster Session (Ground Level Exhibition B)
4:30 - 5:10 PM PDT Keynote: AI and Data Efficient Deep Learning to Accelerate Large-Scale Studies of Brain Diseases, Paul M. Thompson (University of Southern California); [Details]
5:10 - 5:25 PM PDT Oral: Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports, Tom J van Sonsbeek (University of Amsterdam)
5:25 - 5:40 PM PDT Oral: Data Augmentation Based on DiscrimDiff for Histopathology Image Classification, Xianchao Guan (Harbin Institute of Technology, Shenzhen)
5:40 - 5:55 PM PDT Oral: URL: Combating Label Noise for Lung Nodule Malignancy Grading, Xianze Ai (Northwestern Polytechnical University)
5:55 - 6:10 PM PDT Oral: A Realistic Collimated X-Ray Image Simulation Pipeline, Benjamin El-Zein (Friedrich-Alexander-University Erlangen-Nuremberg)
6:10 - 6:30 PM PDT Closing and award announcement

Poster List (Ground Level Exhibition B)

01 URL: Combating Label Noise for Lung Nodule Malignancy Grading
02 Zero-shot Learning of Individualized Task Contrast Prediction from Resting-state Functional Connectomes
03 Microscopy Image Segmentation via Point and Shape Regularized Data Synthesis
04 A Unified Approach to Learning with Label Noise and Unsupervised Confidence Approximation
05 Transesophageal Echocardiography Generation using Anatomical Models
06 Data Augmentation Based on DiscrimDiff for Histopathology Image Classification
07 Clinically Focussed Evaluation of Anomaly Detection and Localisation Methods Using Inpatient CT Head Data
08 LesionMix: A Lesion-Level Data Augmentation Method for Medical Image Segmentation
09 Knowledge Graph Embeddings for Multi-Lingual Structured Representations of Radiology Reports
10 Modular, Label-Efficient Dataset Generation for Instrument Detection for Robotic Scrub Nurses
11 Adaptive Semi-Supervised Segmentation of Brain Vessels with Ambiguous Labels
12 Proportion Estimation by Masked Learning from Label Proportion
13 Active Learning Strategies on a Real-World Thyroid Ultrasound Dataset
14 A Realistic Collimated X-Ray Image Simulation Pipeline
15 Masked Conditional Diffusion Models for Image Analysis with Application to Radiographic Diagnosis of Infant Abuse
16 Self-Supervised Single-Image Deconvolution with Siamese Neural Networks

Proceedings and CMIG Special Issue

Accepted DALI workshop papers will be published with MICCAI 2023 Proceedings in the Springer Lecture Notes in Computer Science (LNCS) series. A selection of the best papers will also be invited to submit revised and extended versions of their work to a Special Issue in CMIG (Computerized Medical Imaging and Graphics, IF: 5.7). The pre-selection of these papers will be based on editorial chair recommendations during the review process, with final decisions made by the program chairs. Extended papers should emphasize the application of DALI-related methods to address specific medical problems. Please note that even extended papers may be subject to rejection during the peer-review process of CMIG. The submission portal is open now. More details can be found here.

Awards and Sponsors

We are pleased to announce that two prestigious awards will be presented at the upcoming DALI workshop, and we are grateful to our generous sponsors for making these prizes possible. The Best Paper Award and People’s Choice Award, each with an amount of $500, are being sponsored by two leading companies in the industry: SenseTime and InferVision.

We are thrilled to have SenseTime and InferVision as sponsors for the DALI workshop and are grateful for their support in recognizing outstanding contributions in medical AI research. We encourage all attendees to take the opportunity to learn more about these companies and their groundbreaking work.

People

Program Chairs

Yuan Xue, Ohio State University, USA
Chen (Cherise) Chen, University of Oxford, UK
Chao Chen, Stony Brook University, USA

Editorial Chairs

Lianrui Zuo, Johns Hopkins University, USA
Yihao Liu, Johns Hopkins University, USA

Advisory Board

Sharon Xiaolei Huang, The Pennsylvania State University, USA
Hien V. Nguyen, University of Houston, USA
Nicholas Heller, University of Minnesota, USA
Stephen Wong, Houston Methodist Hospital, USA
Daniel Rueckert, Technische Universität München, Germany
Jerry Prince, Johns Hopkins University, USA
Dimitris N. Metaxas, Rutgers University, USA
Ehsan Adeli, Stanford University, USA

Program Committee

Amogh Subbakrishna Adishesha, The Pennsylvania State University
Cheng Ouyang, Imperial College London
Chenyu You, Yale University
Christoph M. Friedrich, University of Applied Sciences and Arts Dortmund
Edward Kim, Drexel University
Fan Wang, Stony Brook University
Gilbert Lim, SingHealth
Haomiao Ni, The Pennsylvania State University
Jiachen Liu, The Pennsylvania State University
Kexin Ding, University of North Carolina at Charlotte
Luyang Luo, The Chinese University of Hong Kong
Michael Goetz, University Hospital Ulm
Muchao Ye, The Pennsylvania State University
Nicha Dvornek, Yale University
Nicholas Heller, University of Minnesota
Peiyu Duan, Yale University
Peng Jin, The Pennsylvania State University
Ruochen Wang, AbleTo, Inc.
Samira Zare, University of Houston
Samuel Remedios, Johns Hopkins University
Saumya Gupta, Stony Brook University
Sharon Xiaolei Huang, The Pennsylvania State University
Weidong Cai, University of Sydney
Weimin Lyu, Stony Brook University
Yubo Fan, Vanderbilt University
Yuli Wang, Johns Hopkins University
Zeju Li, Imperial College London
Zhangxing Bian, Johns Hopkins University
Zuhui Wang, Stony Brook University

SenseTime graphic InferVision graphic

DALI @ MICCAI 2023, October 12, Vancouver, Canada