Publication

Constructing Hypothetical Risk Data from the Area under the ROC Curve: Modelling Distributions of Polygenic Risk.

Downloadable Content

Persistent URL
Last modified
  • 02/20/2025
Type of Material
Authors
    Suman Kundu, Emory UniversityJannigje G. Kers, Vrije Universiteit AmsterdamAnna Janssens, Emory University
Language
  • English
Date
  • 2016
Publisher
  • Public Library of Science
Publication Version
Copyright Statement
  • © 2016 Kundu et al
License
Final Published Version (URL)
Title of Journal or Parent Work
ISSN
  • 1932-6203
Volume
  • 11
Issue
  • 3
Start Page
  • e0152359
End Page
  • e0152359
Grant/Funding Information
  • This work was supported by a Vidi grant from the Netherlands Organization for Scientific Research (NWO) and a consolidator grant from the European Research Council (GENOMICMEDICINE).
Supplemental Material (URL)
Abstract
  • BACKGROUND: Modeling studies using hypothetical polygenic risk data can be an efficient tool for investigating the effectiveness of downstream applications such as targeting interventions to risk groups to justify whether empirical investigation is warranted. We investigated the assumptions underlying a method that simulates risk data for specific values of the area under the receiver operating characteristic curve (AUC). METHODS: The simulation method constructs risk data for a hypothetical population based on the population disease risk, and the odds ratios and frequencies of genetic variants. By systematically varying the parameters, we investigated under what conditions AUC values represent unique ROC curves with unique risk distributions for patients and nonpatients, and to what extend risk data can be simulated for precise values of the AUC. RESULTS: Using larger number of genetic variants each with a modest effect, we observed that the distributions of estimated risks of patients and nonpatients were similar for various combinations of the odds ratios and frequencies of the risk alleles. Simulated ROC curves overlapped empirical curves with the same AUC. CONCLUSIONS: Polygenic risk data can be effectively and efficiently created using a simulation method. This allows to further investigate the potential applications of stratifying interventions on the basis of polygenic risk.
Author Notes
Keywords
Research Categories
  • Health Sciences, Epidemiology
  • Biology, Genetics
  • Biology, Biostatistics

Tools

Relations

In Collection:

Items