How You Split Matters: Data Leakage and Subject Characteristics Studies in Longitudinal Brain MRI Analysis

Dewinda Julianensi Rumala
Preprint Code Publication Page

MICCAI FAIMI 2023

This work has been honored with the Best Poster Presentation Award at MICCAI FAIMI 2023 workshop.

Download Poster View Award Certificate

Introduction

Methods

Table S1. Dataset Statistics. The ratio of females to males and the average age are based on the number of images rather than the number of subjects.

CollectionData GroupNo of SubjectsFemale / Male Age No of Scans
Before AugmentationAfter Augmentation
5-Fold dataCN4185/6576.68 ± 4.15150300
MCI4550/10074.19 ± 8.57150300
AD2530/2074.22 ± 8.9050300
Hold-out dataCN817/1374.81 ± 3.1330-
MCI118/2275.42 ± 7.2630-
AD716/178.26 ± 6.5230-

Evaluation Scheme

Data Splitting Strategies during CV:

Illustration showing different data split strategies for longitudinal brain MRI: subject-wise, record-wise, and late splitting.
A toy example of different data split strategies for longitudinal brain MRI. (a) Subject-wise splitting groups all image scans based on the subjects into k-folds. (b) Record-wise splitting groups image scans based on different visit times into k-folds. (c) Late splitting groups image scans based on transformation technique into k-folds.

Training Setups

Table S2. Overview of the parameters used across all experiments: the learning rate was reduced using the ReduceLROnPlateau scheduler from TensorFlow by a factor of 0.1 when validation loss did not decrease after 10 epochs. Adam was used as the optimizer.

ParameterValue
Learning rate0.0001
Epsilon0.0001
Beta 10.9
Beta 20.99
Epoch100
Batch size24

Results and Analysis

GradCAM Visualization

Gif Samples

Data GroupSubject-wise SplitRecord-wise SplitLate Split
CN
[O] Correctly classified

[O] Correctly classified

[O] Correctly classified

[X] Misclassified

[X] Misclassified

[X] Misclassified
AD
[O] Correctly classified

[O] Correctly classified

[O] Correctly classified

[X] Misclassified

[X] Misclassified

[X] Misclassified

Discussion

Limitations

References

  1. Chaibub Neto, E., Pratap, A., Perumal, T.M., Tummalacherla, M., Snyder, P., Bot, B.M., Trister, A.D., Friend, S.H., Mangravite, L., Omberg, L.: Detecting the impact of subject characteristics on machine learning-based diagnostic applications. npj Digital Medicine 2(1), 99 (Oct 2019). https://doi.org/10.1038/s41746-019-0178-x
  2. Yagis, E., Atnafu, S.W., García Seco de Herrera, A., Marzi, C., Scheda, R., Giannelli, M., Tessa, C., Citi, L., Diciotti, S.: Effect of data leakage in brain MRI classification using 2D convolutional neural networks. Scientific Reports 11(1), 22544 (Nov 2021). https://doi.org/10.1038/s41598-021-01681-w

Acknowledgement

Special thanks to Directorate General of Higher Education and Research Technology, Indonesia, Prof. I Ketut Eddy Purnama (Sepuluh Nopember Institute of Technology, Indonesia) and Prof. Tae-Seong Kim (Kyung Hee University, South Korea).