Publication

Prioritizing individual genetic variants after kernel machine testing using variable selection

Downloadable Content

Persistent URL
Last modified
  • 05/14/2025
Type of Material
Authors
    Qianchuan He, Fred Hutchinson Cancer Research CenterTianxi Cai, Harvard UniversityYan Liu, Emory UniversityNi Zhao, Fred Hutchinson Cancer Research CenterQuaker E. Harmon, National Institute of Environmental Health SciencesLynn Almli, Emory UniversityElisabeth Binder, Emory UniversityStephanie M. Engel, University of North CarolinaKerry Ressler, Emory UniversityKaren Conneely, Emory UniversityXihong Lin, Harvard UniversityMichael C. Wu, Fred Hutchinson Cancer Research Center
Language
  • English
Date
  • 2016-12-01
Publisher
  • Wiley: 12 months
Publication Version
Copyright Statement
  • © 2016 WILEY PERIODICALS, INC.
Final Published Version (URL)
Title of Journal or Parent Work
ISSN
  • 0741-0395
Volume
  • 40
Issue
  • 8
Start Page
  • 722
End Page
  • 731
Grant/Funding Information
  • This research was supported in part by NIH R21HD060207, R01HG007508, R01HG006292, the Fred Hutchinson Cancer Research Center Institutional Research Support, and the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences.
Supplemental Material (URL)
Abstract
  • Kernel machine learning methods, such as the SNP-set kernel association test (SKAT), have been widely used to test associations between traits and genetic polymorphisms. In contrast to traditional single-SNP analysis methods, these methods are designed to examine the joint effect of a set of related SNPs (such as a group of SNPs within a gene or a pathway) and are able to identify sets of SNPs that are associated with the trait of interest. However, as with many multi-SNP testing approaches, kernel machine testing can draw conclusion only at the SNP-set level, and does not directly inform on which one(s) of the identified SNP set is actually driving the associations. A recently proposed procedure, KerNel Iterative Feature Extraction (KNIFE), provides a general framework for incorporating variable selection into kernel machine methods. In this article, we focus on quantitative traits and relatively common SNPs, and adapt the KNIFE procedure to genetic association studies and propose an approach to identify driver SNPs after the application of SKAT to gene set analysis. Our approach accommodates several kernels that are widely used in SNP analysis, such as the linear kernel and the Identity by State (IBS) kernel. The proposed approach provides practically useful utilities to prioritize SNPs, and fills the gap between SNP set analysis and biological functional studies. Both simulation studies and real data application are used to demonstrate the proposed approach.
Author Notes
  • Address for Correspondence: Qianchuan He, Ph.D and Michael C. Wu, Ph.D, Public Health Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, Phone: (206) 667-7068, (206) 667-6603, mcwu@fredhutch.org, ghe@fredhutch.org
Keywords
Research Categories
  • Health Sciences, Epidemiology
  • Health Sciences, Public Health
  • Biology, Genetics

Tools

Relations

In Collection:

Items