Skip to content

wenh06/cinc2024

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

386 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CinC2024

docker-ci-and-test format-check

Digitization and Classification of ECG Images: The George B. Moody PhysioNet Challenge 2024

Challenge Website

The figure below demonstrates the framework of the proposed method in this project (more details can be found in the conference paper).

The framework

💀💀💀 SUPER BIG MISTAKE: The loss function for the classification head of the official phase (multi-label) is NOT changed from the cross-entropy loss used in the unofficial phase (single-label) to the asymmetric loss. See the commit for the details. 💀💀💀

The Conference

Conference Website | Unofficial Phase Leaderboard | Official Phase Leaderboard

     

👉 Back to TOC

Description of the files/folders(modules)

Files

Click to view the details

Folders(Modules)

Click to view the details

👉 Back to TOC

Performance comparison of classification backbones

Curves of F1 score using different backbone sizes (all ConvNeXt architecture) are collected in the following image.

Final results table

Details of the final results can be found in the official results page.

F-measure SNR
Rank 9/16 13/16
Leaderboard 0.33 -0.733
Color scans of clean papers 0.332 -0.148
Black-and-white scans of clean papers 0.327 -1.267
Mobile phone photos of clean papers 0.306 -9.019
Mobile phone photos of stained papers 0.316 -8.545
Mobile phone photos of deteriorated papers 0.306 -6.398
Color scans of deteriorated papers 0.331 -1.636
Black-and-white scans of deteriorated papers 0.319 -3.559
Screenshots of computer monitor 0.288 -6.532

Possible solutions for the digitization task

Click to view the details
  • End-to-end model (NOT adopted): A single model that takes the input image and produces the digitized ECG signal directly.

  • Several-stage solution (adopted): A multi-stage solution that consists of several models, possibly including:

    • OCR model: Recognizes the ECG signal names and its locations in the input image, as well as other metadata. For example, using EasyOCR, or Tesseract, or TrOCR.

    • Object detection model: Detects the area (bounding box) of the ECG signal in the input image. This bounding box, together with the location of the ECG signal names, can be used to crop each channel of the ECG signal.

    • Edge sharpening algorithm: Enhances and extracts the grid lines and the ECG signal from the cropped patches of the input image.

    • Segmentation model: Segments the ECG signal from the cropped patches of the input image. This model can be a U-Net, a DeepLabV3, or a Mask R-CNN, etc.

The end-to-end model is simpler in terms of implementation, but it may be harder to train and optimize. Its effectiveness can not be guaranteed.

The several-stage solution may be easier to train and optimize. However, it requires more effort to design and implement the models and algorithms. (Actually a system of models and algorithms.)

👉 Back to TOC

Citation

If you find this repository useful for your research, please consider citing the following paper:

@inproceedings{Kang_cinc2024,
       title = {{A Multi-Stage Framework for Simultaneous Digitization and Classification of Electrocardiogram Images}},
      author = {Kang, Jingsu and WEN, Hao},
   booktitle = {{2024 Computing in Cardiology Conference (CinC)}},
      series = {{CinC2024}},
      volume = {51},
        issn = {2325-887X},
         doi = {10.22489/cinc.2024.128},
   publisher = {{Computing in Cardiology}},
        year = {2024},
       month = {12},
  collection = {{CinC2024}}
}

👉 Back to TOC

CinC2020 | CinC2021 | CinC2022 | CinC2023 | CinC2025

About

Digitization and Classification of ECG Images: The George B. Moody PhysioNet Challenge 2024

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Contributors