用户名: 密码: 验证码:
An efficient weighted tag SNP-set analytical method in genome-wide association studies
详细信息    查看全文
  • 作者:Bin Yan (1)
    Shudong Wang (1) (2) (3)
    Huaqian Jia (1)
    Xing Liu (1)
    Xinzeng Wang (1)

    1. College of Mathematics and Systems Science
    ; Shandong University of Science and Technology ; Qingdao ; Shandong ; 266590 ; China
    2. College of Computer and Communication Engineering
    ; China University of Petroleum ; Qingdao ; Shandong ; 266580 ; China
    3. State Key Laboratory of Mining Disaster Prevention and Control Co-founded by Shandong Province and the Ministry of Science and Technology
    ; Shandong University of Science and Technology ; Qingdao ; Shandong ; 266590 ; China
  • 关键词:Association test ; GWAS ; Linkage disequilibrium ; SNP ; set ; Tag SNP
  • 刊名:BMC Genetics
  • 出版年:2015
  • 出版时间:December 2015
  • 年:2015
  • 卷:16
  • 期:1
  • 全文大小:976 KB
  • 参考文献:1. Dering, C, Hemmelmann, C, Pugh, E, Ziegler, A (2011) Statistical analysis of rare sequence variants: an overview of collapsing methods. Genet Epidemiol 35: pp. S12-7 CrossRef
    2. Sasieni, PD (1997) From genotypes to genes: doubling the sample size. Biometrics. 53: pp. 1253-61 CrossRef
    3. Wang R, Peng J, Wang P. SNP set analysis for detecting disease association using exon sequence data. BMC Proc. 2011;5 Suppl 9:S91.
    4. Hunter, DJ, Kraft, P, Jacobs, KB, Cox, DG, Yeager, M, Hankinson, SE (2007) A genome-wide association study identifies alleles in FGFR2 associated with risk of sporadic postmenopausal breast cancer. Nat Genet. 39: pp. 870-4 CrossRef
    5. Yeager, M, Orr, N, Hayes, RB, Jacobs, KB, Kraft, P, Wacholder, S (2007) Genome-wide association study of prostate cancer identifies a second risk locus at 8q24. Nat Genet. 39: pp. 645-9 CrossRef
    6. Hageman, GS, Anderson, DH, Johnson, LV, Hancox, LS, Taiber, AJ, Hardisty, LI (2005) A common haplotype in the complement regulatory gene factor H (HF1/CFH) predisposes individuals to age-related macular degeneration. Proc Natl Acad Sci U S A. 102: pp. 7227-32 CrossRef
    7. Moskvina, V, Schmidt, KM (2008) On multiple-testing correction in genome-wide association studies. Genetic epidemiology. Genet Epidemiol. 32: pp. 567-73 CrossRef
    8. Wu, MC, Kraft, P, Epstein, MP, Taylor, DM, Chanock, SJ, Hunter, DJ (2010) Powerful SNP-set analysis for case鈥揷ontrol genome-wide association studies. Am J Hum Genet. 86: pp. 929-42 CrossRef
    9. Schaid, DJ, McDonnell, SK, Hebbring, SJ, Cunningham, JM, Thibodeau, SN (2005) Nonparametric tests of association of multiple genes with human disease. Am J Hum Genet. 76: pp. 780-93 CrossRef
    10. Fan, R, Knapp, M (2003) Genome association studies of complex diseases by case鈥揷ontrol designs. Am J Hum Genet. 72: pp. 850-68 CrossRef
    11. Mukhopadhyay, I, Feingold, E, Weeks, DE (2010) Association tests using kernel-based measures of multi-locus genotype similarity between individuals. Genet Epidemiol. 34: pp. 213-21
    12. Wessel, J, Schork, NJ (2006) Generalized genomic distance鈥揵ased regression methodology for multilocus association analysis. Am J Hum Genet. 79: pp. 792-806 CrossRef
    13. Jin, L, Zhu, W, Yu, Y, Kou, C, Meng, X, Tao, Y (2014) Nonparametric tests of associations with disease based on U-statistics. Ann Hum Genet. 78: pp. 141-53 CrossRef
    14. Gauderman, WJ, Murcray, C, Gilliland, F, Conti, D (2007) Testing association between disease and multiple SNPs in a candidate gene. Genet Epidemiol. 31: pp. 383-95 CrossRef
    15. Cristianini, N, Shawe-Taylor, J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge university press, Cambridge, UK CrossRef
    16. Zhang, K, Deng, M, Chen, T, Waterman, MS, Sun, F (2002) A dynamic programming algorithm for haplotype block partitioning. Proc Natl Acad Sci. 99: pp. 7335-9 CrossRef
    17. Ke, X, Cardon, LR (2003) Efficient selective screening of haplotype tag SNPs. Bioinformatics. 19: pp. 287-8 CrossRef
    18. Stram, DO, Haiman, CA, Hirschhorn, JN, Altshuler, D, Kolonel, LN, Henderson, BE (2003) Choosing haplotype-tagging SNPS based on unphased genotype data using a preliminary sample of unrelated subjects with an example from the Multiethnic Cohort Study. Hum Hered. 55: pp. 27-36 CrossRef
    19. Haplotype tagging SNP (htSNP) selection in the Multiethnic Cohort Study [http://www-hsc.usc.edu/~stram/tagsnps.html]
    20. Hill, WG, Robertson, A (1968) Linkage disequilibrium in finite populations. Theor Appl Genet. 38: pp. 226-31 CrossRef
    21. Miller, R, Siegmund, D (1982) Maximally selected chi square statistics. Biometrics 38: pp. 1011-1016 CrossRef
    22. Hoeffding, W (1948) A class of statistics with asymptotically normal distribution. Ann Math Stat. 19: pp. 293-325 CrossRef
    23. Kimeldorf, G, Wahba, G (1971) Some results on Tchebycheffian spline functions. J Math Anal Appl. 33: pp. 82-95 CrossRef
    24. Liu, D, Ghosh, D, Lin, X (2008) Estimation and testing for the effect of a genetic pathway on a disease outcome using logistic kernel machine regression via logistic mixed models. BMC bioinf. 9: pp. 1-11 CrossRef
    25. Zhang, D, Lin, X (2003) Hypothesis testing in semiparametric additive mixed models. Biostatistics. 4: pp. 57-74 CrossRef
    26. Basile, VS, Ozdemir, V, Masellis, M, Meltzer, HY, Lieberman, JA, Potkin, SG (2001) Lack of association between serotonin-2A receptor gene (HTR2A) polymorphisms and tardive dyskinesia in schizophrenia. Mol Psychiatry. 6: pp. 230-4 CrossRef
    27. Frisch, A, Michaelovsky, E, Rockah, R, Amir, I, Hermesh, H, Laor, N (2000) Association between obsessive-compulsive disorder and polymorphisms of genes encoding components of the serotonergic and dopaminergic pathways. Eur Neuropsychopharmacol. 10: pp. 205-9 CrossRef
    A haplotype map of the human genome. Nature 437: pp. 1299-320 CrossRef
    28. UCSC Genome Bioinformatics website Illumina Human Hap 650v3 array [https://cgwb.nci.nih.gov/cgi-bin/hgTrackUi?g=snpArray]
    29. Su, Z, Marchini, J, Donnelly, P (2011) HAPGEN2: simulation of multiple disease SNPs. Bioinformatics. 27: pp. 2304-5 CrossRef
  • 刊物主题:Life Sciences, general; Animal Genetics and Genomics; Microbial Genetics and Genomics; Plant Genetics & Genomics; Genetics and Population Dynamics;
  • 出版者:BioMed Central
  • ISSN:1471-2156
文摘
Background Single-nucleotide polymorphism (SNP)-set analysis in Genome-wide association studies (GWAS) has emerged as a research hotspot for identifying genetic variants associated with disease susceptibility. But most existing methods of SNP-set analysis are affected by the quality of SNP-set, and poor quality of SNP-set can lead to low power in GWAS. Results In this research, we propose an efficient weighted tag-SNP-set analytical method to detect the disease associations. In our method, we first design a fast algorithm to select a subset of SNPs (called tag SNP-set) from a given original SNP-set based on the linkage disequilibrium (LD) between SNPs, then assign a proper weight to each of the selected tag SNP respectively and test the joint effect of these weighted tag SNPs. The intensive simulation results show that the power of weighted tag SNP-set-based test is much higher than that of weighted original SNP-set-based test and that of un-weighted tag SNP-set-based test. We also compare the powers of the weighted tag SNP-set-based test based on four types of tag SNP-sets. The simulation results indicate the method of selecting tag SNP-set impacts the power greatly and the power of our proposed method is the highest. Conclusions From the analysis of simulated replicated data sets, we came to a conclusion that weighted tag SNP-set-based test is a powerful SNP-set test in GWAS. We also designed a faster algorithm of selecting tag SNPs which include most of information of original SNP-set, and a better weighted function which can describe the status of each tag SNP in GWAS.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700