UNIVERSITE
Vittorio Perduca (MAP5, Université Paris Descartes)
Phenotype simulation under a disease model and applications to power analysis of GWAs
Genome Wide Association studies (GWAs) are widely used to investigate the connection between genotypic and phenotypic variation with respect to a given trait (e.g. a given disease). Assessing the statistical power of such studies is crucial. Power is empirically estimated by simulating realistic samples under a disease model H1. For this purpose, the gold standard consists in simulating the genotypes given the observed phenotypes (case or control) ; thus ensuring that the total number of cases stays unchanged. This method is implemented in the software of reference Hapgen. I will present an alternative approach for simulating samples under H1 that does not require generating new genotypes for each simulation but only phenotypes. This method is based on a backward sampling algorithm and make it possible to simulate new phenotypic datasets under the constraints that a) the phenotypes are in accordance with the corresponding observed genotypes under the chosen model H1 ; b) the total number of cases is the same as in the observed dataset. I will show that our backward sampling algorithm outperforms other standard approaches such as simple rejection algorithm and MCMC. Moreover our algorithm is faster than Hapgen. At last, I will discuss the results of a power analysis on a fictive GWAs we conducted on real data from the 1000 Genomes Project.
This is joint work with Gregory Nuel (MAP5, Université Paris Descartes), Christine Sinoquet and Raphael Mourad (Université de Nantes).
Dans la même rubrique :
- Ségolen Geffray (IRMA, UMR 7501, Université de Strasbourg)
- Bertrand Michel (LSTA, Université Pierre et Marie Curie)
- Van Hanh Nguyen (Laboratoire Statistique et Génome, Université d’Evry et Université Paris-Sud 11)
- Tristan Mary-Huard (AgroParisTech, UMR INRA/AgroParisTech MIA 518)
- Yves Rozenholc (Université Paris Descartes)
- Sébastien Gerchinovitz (DMA, Ecole normale supérieure et Université Paris-Sud)
- Maud Delattre (Laboratoire de Mathématiques, Université Paris Sud)
- Serge Cohen (CNRS/UPS3352 IPANEMA / Synchrotron SOLEIL)
- Julien Stirnemann (MAP5, Maternité et médecine materno-foetale, GHU Necker-Enfants Malades, Université Paris Descartes et CNRS)
- Laureen Ribassin-Majed (MAP5, Université Paris Descartes et CNRS)
- Aurélie Fischer (MAP5 et LSTA, Universités Paris Descartes et Pierre et Marie Curie)
- Anne-Cécile Dragon (CEBC et MAP5, Université Paris Descartes et CNRS)
- Niels Keiding (Department of Biostatistics, University of Copenhagen)
- Christophe Pouzat (Laboratoire de Physiologie Cérébrale, Université Paris Descartes)
- Gaëlle Chagny (MAP5, Université Paris Descartes)
- Marc Vincent (Bases moléculaires de la réponse aux xénobiotiques, UMR-S775, Université Paris Descartes)
- Aurélien Garivier (LTCI Telecom ParisTech, CNRS UMR 5141)
- Pierre Neuvial (Laboratoire Statistique et Génome, Évry, UMR CNRS 8071/Université d’Evry/INRA)
- Simon Cauchemez (School of Public Health and Imperial College, London)
- Meïli Baragatti (IML, université de la Méditerranée et Ipsogen)
