TY  - GEN
UR  - https://archiv.ub.uni-heidelberg.de/volltextserver/34903/
N2  - Spatial omics data shows potential to reveal novel insights into the underlying mechanisms of cancer. Yet the high-dimensional and highly correlated feature space imposes challenges on analysis. In this thesis, the implementation of convolutional autoencoders to extract explainable features for biomarker discovery is examined, exemplified on tumor hypoxia.

Mass spectrometry imaging and spatial transcriptomics experiments were performed on consecutive tissue slices of head and neck squamous cell carcinoma tumor models. To
advance accessibility of these spatial omics modalities, data was reduced by convolutional autoencoders and the resulting latent space features were ranked for association with tumor hypoxia through random forest feature importance measures. With the help of a newly proposed recovery method, the contribution of original features to a latent feature
was derived, thereby retaining biological relevant information. The derived genes and peptides were compared against the ranked genes and peptides of a random forest only model. The feature sets of the autoencoder approaches achieved consistently higher scores when evaluated using the structural similarity index measure. In contrast, the features of the random forest only models contained many more noisy hypoxia associations caused by the multicollinearity of features.

Several promising unimodal and multimodal biomarker candidates of mass spectrometry imaging and spatial transcriptomics data for tumor hypoxia were identified. Multimodal biomarkers were identified through correlation analysis of aligned serial tissue slices from
both spatial omics modalities in four samples. For a more elaborate integration, it was outlined how the molecular information of multiple spatial omics modalities may be
combined without error-prone alignment of consecutive tissue slices. Instead, the spatial omics modalities may be learned directly from the readily available microscopy images
using convolutional neural networks. Then, the learned molecular information may be predicted from microscopy images of other spatial omics modalities. Preliminary results demonstrated that the learning of the latent space features of autoencoders yielded more accurate predictions than when learning was performed on the raw and sparse spatial omics features. However, it necessitates further investigation whether also hypoxia-associated
features can be acquired accurately from microscopy images.

Overall, the findings show that convolutional autoencoders accompanied by random forest models retain more biological relevant information for biomarker discovery than without
prior feature extraction. Considering the increasing amount of available (spatial) omics data, deep learning feature extraction will become evermore important. This thesis
contributes to the overall understanding of autoencoders by showcasing how specific characteristics in spatial omics data reflect in the latent space and how they can be
addressed through hyperparameter configurations.
CY  - Heidelberg
AV  - public
TI  - Enhancing Biomarker Discovery in Tumor Hypoxia for Head and Neck Squamous Cell Carcinoma: Advancing Spatial Omics Data Accessibility through Convolutional Autoencoders
ID  - heidok34903
KW  - Autoencoder
KW  -  Random Forest
KW  -  Biomarker Discovery
KW  -  Mass Spectrometry Imaging
KW  -  Spatial Transcriptomics
A1  - Bitto, Verena
Y1  - 2024///
ER  -