Skip to main content
Open access
Research Article
25 September 2024
26th Irish Machine Vision and Image Processing Conference (IMVIP 2024)

OxML challenge 2023: Carcinoma classification using data augmentation

Abstract

Carcinoma is the prevailing type of cancer and can manifest in various body parts. It is widespread and can potentially develop in numerous locations within the body. In the medical domain, data for carcinoma cancer is often limited or unavailable due to privacy concerns. Moreover, when available, it is highly imbalanced, with a scarcity of positive class samples and an abundance of negative ones. The OXML 2023 challenge provides a small and imbalanced dataset, presenting significant challenges for carcinoma classification. To tackle these issues, participants in the challenge have employed various approaches, relying on pre-trained models, preprocessing techniques, and few-shot learning. Our work proposes a novel technique that combines padding augmentation and ensembling to address the carcinoma classification challenge. In our proposed method, we utilize ensembles of five neural networks and implement padding as a data augmentation technique, taking into account varying image sizes to enhance the classifier’s performance. Using our approach, we made place into top three and declared as winner.

Formats available

You can view the full content in the following formats:

Information and Authors

Information

Published in

History

Published ahead of print: 25 September 2024
Published in print: 01 October 2024
Published online: 07 October 2024

Inspec keywords

  1. cancer
  2. data augmentation
  3. image classification
  4. learning (artificial intelligence)
  5. pattern classification

Keywords

  1. body parts
  2. can
  3. carcinoma cancer
  4. carcinoma classification challenge
  5. data augmentation technique
  6. imbalanced dataset
  7. medical domain
  8. negative ones
  9. numerous locations
  10. OXML 2023 challenge
  11. OxML challenge
  12. padding augmentation
  13. positive class samples
  14. prevailing type
  15. privacy concerns
  16. small dataset

Subjects

  1. Patient diagnostic methods and instrumentation
  2. Image recognition
  3. Biomedical measurement and imaging
  4. Computer vision and image processing techniques
  5. Data handling techniques
  6. Neural nets
  7. Biology and medical computing

Authors

Affiliations

Kislay Raj
CRT-AI Centre, School of Computing, Dublin City University, Republic of Ireland
Teerath Kumar
CRT-AI Centre, School of Computing, Dublin City University, Republic of Ireland
Alessandra Mileo
INSIGHT Research Centre, School of Computing, Dublin City University, Republic of Ireland
Malika Bendechache
Lero Research Centre, School of Computer Science, University of Galway, Republic of Ireland

Metrics and Citations

Metrics

Citations

If you have the appropriate software installed, you can download article citation data to the citation manager of your choice. Simply select your manager software from the list below and click Download.

View Options

View options

PDF

View PDF
Access content
Login options

Figures

Tables

Media

Share

Share

Copy the content Link

Share on social media