Springen zu

Overview of ImageCLEFmedical 2024 – Caption Prediction and Concept Detection

Konferenzpaper

Schnelle Fakten

Interne Autorenschaft
Weitere Publizierende
M.Sc. Johannes Rückert, Asma Ben Abacha, Alba Garcia Seco de Herrera, M.Sc. Louise Bloch, M.Sc. Ahmad Idrissi-Yaghir, Tabea M. G. Pakull, Cynthia S. Schmidt, Henning Müller
Veröffentlichung
- 2024
Publikationszweck
Forschung
Organisationseinheit
Informatik
Fachgebiete
- Informatik allgemein
Forschungsstrukturen
- Medizinische Informatik (MI)
Forschungsfeld
- Künstliche Intelligenz und Big Data

Zitat

J. Rückert, A. Ben Abacha, A. Garcia Seco de Herrera, L. Bloch, R. Brüngel, A. Idrissi-Yaghir, H. Schäfer, B. Bracke, H. Damm, T. M. G. Pakull, C. S. Schmidt, H. Müller, and C. M. Friedrich, “Overview of ImageCLEFmedical 2024 – Caption Prediction and Concept Detection,” in Working Notes of the Conference and Labs of the Evaluation Forum (CLEF 2024), 2024, pp. 1437–1455 [Online]. Available: http://ceur-ws.org/Vol-3740/paper-132.pdf

Abstract

The ImageCLEFmedical 2024 Caption task on caption prediction and concept detection follows similar challenges
held from 2017–2023. The goal is to extract Unified Medical Language System (UMLS) concept annotations and/or
define captions from image data. Predictions are compared to original image captions. Images for both tasks
are part of the Radiology Objects in COntext version 2 (ROCOv2) dataset. For concept detection, multi-label
predictions are compared against UMLS terms extracted from the original captions with additional manually
curated concepts via the F1-score. For caption prediction, the semantic similarity of the predictions to the original
captions is evaluated using the BERTScore. The task attracted strong participation with 50 registered teams,
14 teams submitted 82 graded runs for the two subtasks. Participants mainly used multi-label classification
systems for the concept detection subtask, the winning team DBS-HHU utilized an ensemble of four different
Convolutional Neural Networks (CNNs). For the caption prediction subtask, most teams used encoder-decoder
frameworks with various backbones, including transformer-based decoders and Long Short-Term Memories
(LSTMs), with the winning team PCLmed using medical vision-language foundation models (Med-VLFMs) by
combining general and specialist vision models.

Fachhochschule Dortmund - Studium, Studiengänge, Bewerbung

Informationen für

Sprache

Overview of ImageCLEFmedical 2024 – Caption Prediction and Concept Detection

Schnelle Fakten

Zitat

Abstract

Über die Publikation

Verknüpfte Publikationen und Verweise

Referenzen und Relationen

Weiterführende Informationen