Directly to content
  1. Publishing |
  2. Search |
  3. Browse |
  4. Recent items rss |
  5. Open Access |
  6. Jur. Issues |
  7. DeutschClear Cookie - decide language by browser settings

Using next-generation DNA sequence data for genetic association tests based on allele counts with and without consideration of zero inflation

González Silos, Rosa ; Karadag, Özge ; Peil, Barbara ; Fischer, Christine ; Kabisch, Maria ; Legrand, Carine ; Lorenzo Bermejo, Justo

In: BMC Proceedings, 10 (2016), Nr. 41. pp. 1-8. ISSN 1753-6561

[thumbnail of 12919_2016_Article_62.pdf]
Preview
PDF, English
Download (910kB) | Lizenz: Creative Commons LizenzvertragUsing next-generation DNA sequence data for genetic association tests based on allele counts with and without consideration of zero inflation by González Silos, Rosa ; Karadag, Özge ; Peil, Barbara ; Fischer, Christine ; Kabisch, Maria ; Legrand, Carine ; Lorenzo Bermejo, Justo underlies the terms of Creative Commons Attribution 3.0 Germany

Citation of documents: Please do not cite the URL that is displayed in your browser location input, instead use the DOI, URN or the persistent URL below, as we can guarantee their long-time accessibility.

Abstract

The relationship between genetic variability and individual phenotypes is usually investigated by testing for association relying on called genotypes. Allele counts obtained from next-generation sequence data could be used for this purpose too. Genetic association can be examined by treating alternative allele counts (AACs) as the response variable in negative binomial regression. AACs from sequence data often contain an excess of zeros, thus motivating the use of Hurdle and zero-inflated models. Here we examine rough type I error rates and the ability to pick out variants with small probability values for 7 different testing approaches that incorporate AACs as an explanatory or as a response variable. Model comparisons relied on chromosome 3 DNA sequence data from 407 Hispanic participants in the Type 2 Diabetes Genetic Exploration by Next-generation sequencing in Ethnic Samples (T2D-GENES) project 1 with complete information on diastolic blood pressure and related medication. Our results suggest that in the investigation of the relationship between AAC as response variable and individual phenotypes as explanatory variable, Hurdle-negative binomial regression has some advantages. This model showed a good ability to discriminate strongly associated variants and controlled overall type I error rates. However, probability values from Hurdle-negative binomial regression were not obtained for approximately 25 % of the investigated variants because of convergence problems, and the mass of the probability value distribution was concentrated around 1.

Document type: Article
Journal or Publication Title: BMC Proceedings
Volume: 10
Number: 41
Publisher: BioMed Central
Place of Publication: London
Date Deposited: 05 Dec 2016 13:47
Date: 2016
ISSN: 1753-6561
Page Range: pp. 1-8
Faculties / Institutes: Service facilities > German Cancer Research Center (DKFZ)
Medizinische Fakultät Heidelberg > Institut für Humangenetik
Medizinische Fakultät Heidelberg > Institut für Medizinische Biometrie und Informatik
DDC-classification: 610 Medical sciences Medicine
Additional Information: from Genetic Analysis Workshop 19, Vienna, Austria. 24-26 August 2014 erschienen in: BMC Proceedings 2016, 10 (Suppl 7):41
About | FAQ | Contact | Imprint |
OA-LogoDINI certificate 2013Logo der Open-Archives-Initiative