eprintid: 20230 rev_number: 11 eprint_status: archive userid: 1589 dir: disk0/00/02/02/30 datestamp: 2016-03-01 08:46:06 lastmod: 2024-03-21 05:36:04 status_changed: 2016-03-01 08:46:06 type: article metadata_visibility: show creators_name: Bermejo, Justo Lorenzo title: Above and beyond state-of-the-art approaches to investigate sequence data: summary of methods and results from the population-based association group at the Genetic Analysis Workshop 19 subjects: ddc-570 divisions: i-911800 note: erschienen in: BMC Genetics 2016, 17 (Suppl 2):2. Genetic Analysis Workshop 19, Vienna, Austria, 24-26 August 2014 abstract: This paper summarizes the contributions from the Population-Based Association group at the Genetic Analysis Workshop 19. It provides an overview of the new statistical approaches tried out by group members in order to take best advantage of population-based sequence data. Although contributions were highly heterogeneous regarding the applied quality control criteria and the number of investigated variants, several technical issues were identified, leading to practical recommendations. Preliminary analyses revealed that Hurdle-negative binomial regression is a promising approach to investigate the distribution of allele counts instead of called genotypes from sequence data. Convergence problems, however, limited the use of this approach, creating a technical challenge shared by environment-stratified models used to investigate rare variant-environment interactions, as well as by rare variant haplotype analyses using well-established public software. Estimates of relatedness and population structure strongly depended on the allele frequency of selected variants for inference. Another practical recommendation was that dissenting probability values from standard and small-sample tests of a particular hypothesis may reflect a lack of validity of large-sample approximations. Novel statistical approaches that integrate evolutionary information showed some advantage to detect weak genetic signals, and Bayesian adjustment for confounding was able to efficiently estimate causal genetic effects. Haplotype association methods may constitute a valuable complement of collapsing approaches for sequence data. This paper reports on the experience of members of the Population-Based Association group with several novel, promising approaches to preprocessing and analyzing sequence data, and to following up identified association signals. date: 2016 publisher: BioMed Central id_scheme: DOI ppn_swb: 1655313843 own_urn: urn:nbn:de:bsz:16-heidok-202308 language: eng bibsort: BERMEJOJUSABOVEANDBE2016 full_text_status: public publication: BMC Genetics volume: 17 number: 2 place_of_pub: London pagerange: 1-12 issn: 1471-2156 citation: Bermejo, Justo Lorenzo (2016) Above and beyond state-of-the-art approaches to investigate sequence data: summary of methods and results from the population-based association group at the Genetic Analysis Workshop 19. BMC Genetics, 17 (2). pp. 1-12. ISSN 1471-2156 document_url: https://archiv.ub.uni-heidelberg.de/volltextserver/20230/1/12863_2015_Article_310.pdf