title: Learning the Parts of Omics: Inference of Molecular Signatures with Non-negative Matrix Factorization creator: Quintero Moreno, Andrés Felipe subject: ddc-004 subject: 004 Data processing Computer science subject: ddc-500 subject: 500 Natural sciences and mathematics description: Background: Feature extraction and signature identification are two critical steps to understand diverse biological processes. Signatures are defined as groups of molecular features that are sufficient to identify certain genotype or phenotype. In particular, Non-negative Matrix Factorization (NMF) has been used to identify signatures in complex genomic datasets. However, running a basic NMF analysis is a challenging task with a steep learning curve and long computing time; furthermore, the usability of these algorithms is lessened by limited resources to interpret the results obtained from them. This creates a pressing need for the development of tools that mitigate such obstacles. Results: In this study we developed ButchR and ShinyButchR, a fast and user-friendly toolkit to decompose datasets (slicing genomics) and learn signatures using NMF. The package can be freely installed from GitHub at https://github.com/wurst-theke/ButchRr. We used ButchR to identify a new regulatory subtype in neuroblastoma, which showed mesenchymal characteristics and was phenotypically associated to multipotent Schwann cell precursors. Additionally, we created a new workflow to infer regulatory relationships between genes and their _cis_-regulatory elements for individual cells, followed by inference of regulatory-signatures. Conclusions: ButchR/ShinyButchR is an useful toolkit for analyzing multiple types of data, and inferring signatures that are able to capture relevant biological information. This toolkit is a new valuable resource to the scientific community, and it can be used to understand complex biological processes. date: 2021 type: Dissertation type: info:eu-repo/semantics/doctoralThesis type: NonPeerReviewed format: application/pdf identifier: https://archiv.ub.uni-heidelberg.de/volltextserverhttps://archiv.ub.uni-heidelberg.de/volltextserver/30629/1/thesis.pdf identifier: DOI:10.11588/heidok.00030629 identifier: urn:nbn:de:bsz:16-heidok-306292 identifier: Quintero Moreno, Andrés Felipe (2021) Learning the Parts of Omics: Inference of Molecular Signatures with Non-negative Matrix Factorization. [Dissertation] relation: https://archiv.ub.uni-heidelberg.de/volltextserver/30629/ rights: info:eu-repo/semantics/openAccess rights: http://archiv.ub.uni-heidelberg.de/volltextserver/help/license_urhg.html language: eng