Towards Self-Configuring Radiomics for Robust and Reproducible Predictive Performance

Bohn, Jonas

[thumbnail of PhD_Thesis_Jonas_Bohn_29_10_2025_submitted.pdf]

Preview

PDF, English - main document
Download (103MB) | Terms of use

Citation of documents: Please do not cite the URL that is displayed in your browser location input, instead use the DOI, URN or the persistent URL below, as we can guarantee their long-time accessibility.

DOI: 10.11588/heidok.00038100
URN: urn:nbn:de:bsz:16-heidok-381000

Abstract

Introduction: Radiomics aims to extract quantitative features from medical images that capture underlying biological and clinical characteristics. Despite its promise for precision oncology, radiomics research continues to suffer from poor reproducibility and limited generalization across studies, software, and imaging modalities. This thesis addresses these fundamental limitations by systematically analyzing how methodological design choices—such as feature extraction, preprocessing, and model selection—affect the robustness and transferability of radiomic biomarkers. To enable this large-scale methodological investigation, I developed the Radiomics Processing Toolkit (RPTK), a fully automated and open-source framework that standardizes radiomics experimentation and benchmarking across heterogeneous datasets. Using RPTK, I conducted comprehensive evaluations on seven open-source cancer imaging cohorts and demonstrated the framework’s applicability in two clinical studies on lung cancer immunotherapy response prediction and colorectal neoplasia detection. Materials and Methods: My work integrates radiomics analyses performed on retrospective data, including seven public datasets and two proprietary cohorts, comprising 3,189 Computer Tomography (CT) and Magnetic Resonance (MR) scans from 3,116 patients with a total of 3,273 segmented regions of interest (ROI). The seven open-source datasets include retrospective MR and CT cancer imaging data concerning different tasks for cancer classification from 931 patients. The proprietary data collection include a multi-timepoint (prior treatment and during treatment) CT dataset for lung cancer immunotherapy treatment response prediction (the Predict study) consisting of 73 patients and a large-scale CT liver imaging datasets with 1,997 patients investigating in colorectal neoplasia detection (LiverCRC study). The mean patient age was 62 ± 17 years, and 53.6 % of participants were male. Radiomics features were extracted using two independent feature-extraction tools, PyRadiomics and Medical Image Radiomics Processor (MIRP), enabling standardized cross-extractor comparisons in compliance with the Image Biomarker Standardisation Initiative (IBSI). The RPTK framework integrates adaptive preprocessing, standardized feature extraction, and robust feature stability filtering to enhance reproducibility and robustness for subsequent model training. Six machine learning models were trained to predict tumor malignancy, treatment response, colorectal neoplasia, or cancer subtypes based on the selected feature sets from each extractor. The performance of RPTK was tested against a state-of-the-art radiomics tool (AutoRadiomics) and six different deep learning models. Results: Across seven open-source datasets, RPTK outperformed both AutoRadiomics and deep learning models (Residual Networks (ResNet) and Densely connected convolutional Networks (DenseNet)), achieving a mean test Area Under the Receiver Operating Characteristic curve (AUROC) of 0.81 ± 0.12 compared to 0.68 ± 0.15 and 0.60 ± 0.16, respectively. In the Predict study, longitudinal delta-radiomics analysis with RPTK improved early prediction of immunotherapy response compared to single-timepoint analyses with RPTK, and the inclusion of clinical variables further enhanced model performance in RPTK. RPTK achieved a test AUROC of 0.75 ± 0.10 using delta radiomics, outperforming AutoRadiomics (0.51 ± 0.14) and the best deep learning model (0.56 ± 0.14). In the LiverCRC study, RPTK reached a test AUROC of 0.86 ± 0.04, significantly exceeding AutoRadiomics (0.65 ± 0.03) and deep learning (0.60 ± 0.03), demonstrating scalability and generalization in large multi-thousandsample datasets. Beyond these comparisons, RPTK also matched or outperformed 12 additional published test AUROC values reported on the integrated open-source datasets. Conclusion: Collectively, the results demonstrate that RPTK provides robust, state-of-the-art predictive performance across diverse imaging datasets and clinically relevant tasks. Its modular design enables fair cross-framework benchmarking while maintaining flexibility for clinical data integration and ensuring methodological transparency. The open-source release of RPTK fosters community-driven validation and supports future clinical implementation. This work thus represents both a methodological advancement and a step toward reliable, reproducible, and clinically translatable radiomics.

Document type:	Dissertation
Supervisor:	Brors, Prof. Dr. Benedikt
Place of Publication:	Heidelberg
Date of thesis defense:	10 February 2026
Date Deposited:	16 Feb 2026 06:58
Date:	2026
Faculties / Institutes:	The Faculty of Bio Sciences > Dean's Office of the Faculty of Bio Sciences
DDC-classification:	500 Natural sciences and mathematics 570 Life sciences 600 Technology (Applied sciences)
Uncontrolled Keywords:	Radiomics, Medical Imaging