Directly to content
  1. Publishing |
  2. Search |
  3. Browse |
  4. Recent items rss |
  5. Open Access |
  6. Jur. Issues |
  7. DeutschClear Cookie - decide language by browser settings

Method development for comparative cancer genomics

Pyl, Paul-Theodor

PDF, English - main document
Download (12MB) | Terms of use

Citation of documents: Please do not cite the URL that is displayed in your browser location input, instead use the DOI, URN or the persistent URL below, as we can guarantee their long-time accessibility.


With the cost of sequencing continuously dropping and the increased availability of se- quencing technologies the coming years will bring a wealth of sequencing data that will be tremendously interesting and challenging to analyse and interpret. It is becoming more and more clear that both the algorithmic approaches as well as the handling of the data itself will prove challenging and some of the legacy approaches and file formats that were developed in the wake of the first large-scale sequencing projects (e.g. the human genome sequencing project) will prove unsuitable or at least inconvenient to use once projects that aim to analyse sequencing data from thousands of samples become the norm rather than the exception. In this thesis I will discuss my work in the field of sequencing analysis during my PhD studies. I will touch upon some of the challenges and problems researchers are facing in the field today and will present the approaches and solutions I have developed to deal with those issues. To this end I will present my work in methods development for sequencing analy- sis in general and specifically in the context of cancer genomics and give examples of the application of those methods in projects that I have been involved in. My two main contribu- tions to available methods for sequencing analysis are the HTSeq Python Library and the h5vc R/Bioconductor package (Anders et al., 2014; Pyl et al., 2014). I co-developed the former with Simon Anders and am the lead developer of the latter. Both pieces of software are available through public repositories, and are well-documented and -maintained. The projects in which those methods have found application are the HeLa Kyoto sequenc- ing project (Landry et al., 2013) and a set of three cancer genomics projects involving cohorts of up to 18 whole genome sequencing (WGS) samples and up to 21 whole exome sequencing (WES) samples, respectively. I will discuss my methodological contributions to these projects as well as relevant biological results in Section 5.

Item Type: Dissertation
Supervisor: Bertone, Prof. Dr. Paul
Date of thesis defense: 26 November 2014
Date Deposited: 11 Dec 2014 07:56
Date: 2014
Faculties / Institutes: The Faculty of Bio Sciences > Dean's Office of the Faculty of Bio Sciences
Service facilities > European Molecular Biology Laboratory (EMBL)
About | FAQ | Contact | Imprint |
OA-LogoDINI certificate 2013Logo der Open-Archives-Initiative