Robust spline-based variable selection in varying coefficient model
详细信息    查看全文
  • 作者:Long Feng (1)
    Changliang Zou (1)
    Zhaojun Wang (1)
    Xianwu Wei (1)
    Bin Chen (2)

    1. LPMC and Institute of Statistics
    ; Nankai University ; Tianjin聽 ; 300071 ; China
    2. School of Mathematics and Statistics
    ; Jiangsu Normal University ; Xuzhou ; Jiangsu ; China
  • 关键词:KLASSO ; Oracle property ; Polynomial spline ; Rank regression ; Robust estimation ; Robust model selection ; SCAD
  • 刊名:Metrika
  • 出版年:2015
  • 出版时间:January 2015
  • 年:2015
  • 卷:78
  • 期:1
  • 页码:85-118
  • 全文大小:475 KB
  • 参考文献:1. Chiang CT, Rice JA, Wu CO (2001) Smoothing spline estimation for varying coefficient models with repeatedly measured dependent variables. J Am Stat Assoc 96:605鈥?19 CrossRef
    2. Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348鈥?360 CrossRef
    3. Fan J, Zhang W (1999) Statistical estimation in varying-coefficient models. Ann Stat 27:1491鈥?518 CrossRef
    4. Fan J, Zhang W (2008) Statistical methods with varying coefficient models. Stat Interface 1:179鈥?95 CrossRef
    5. Hastie TJ, Tibshirani RJ (1993) Varying-coefficient models. J R Stat Soc Ser B 55:757鈥?96 (with discussion)
    6. Hettmansperger TP, McKean JW (1998) Robust nonparametric statistical methods. Arnold, London
    7. Hoover DR, Rice JA, Wu CO, Yang L-P (1998) Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika 85:809鈥?22 CrossRef
    8. Huang JZ, Shen H (2004) Functional coefficient regression models for nonlinear time series: a polynomial spline approach. Scand J Statist 31:515鈥?34 CrossRef
    9. Huang JZ, Wu CO, Zhou L (2002) Varying-coefficient models and basis function approximations for the analysis of repeated measurements. Biometrika 89:111鈥?28 CrossRef
    10. Huang JZ, Wu CO, Zhou L (2004) Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Stat Sinica 14:763鈥?88
    11. Jaeckel LA (1972) Estimating regression coefficients by minimizing the dispersion of residuals. Ann Math Stat 43:1449鈥?458 CrossRef
    12. Kauermann G, Tutz G (1999) On model diagnostics using varying coefficient models. Biometrika 86:119鈥?28 CrossRef
    13. Koul HL, Sievers GL, McKean JW (1987) An estimator of the scale parameter for the rank analysis of linear models under general score functions. Scand J Stat 14:131鈥?41
    14. Kim M-O (2007) Quantile regression with varying coefficients. Ann Stat 35:92鈥?08 CrossRef
    15. Leng C (2009) A simple approach for varying-coefficient model selection. J Stat Plan Inferface 139:2138鈥?146 CrossRef
    16. Leng C (2010) Variable selection and coefficient estimation via regularized rank regression. Stat Sinica 20:167鈥?81
    17. Li R, Liang H (2008) Variable selection in semi-parametric regression model. Ann Stat 36:261鈥?86 CrossRef
    18. Lin Y, Zhang HH (2006) Component selection and smoothing in smoothing spline analysis of variance models. Ann Stat 34:2272鈥?297 CrossRef
    19. Sievers G, Abebe A (2004) Rank estimation of regression coefficients using iterated reweighted least squares. J Stat Comput Simul 74:821鈥?31 CrossRef
    20. Tang Y, Wang H, Zhu Z, Song X (2012) A unified variable selection approach for varying coefficient models. Stat Sinica 22:601鈥?28 CrossRef
    21. Terpstra J, McKean J (2005) Rank-based analysis of linear models using R. J Stat Softw 14:1鈥?6
    22. Tibshirani RJ (1996) Regression shrinkage and selection via the LASSO. J R Stat Soc Ser B 58:267鈥?88
    23. Wang L, Chen G, Li H (2007) Group scad regression analysis for microarray time course gene expression data. Bioinformatics 23:1486鈥?494 CrossRef
    24. Wang L, Kai B, Li R (2009) Local rank inference for varying coefficient models. J Am Stat Assoc 104:1631鈥?645 CrossRef
    25. Wang L, Li H, Huang J (2008) Variable selection in nonparametric varying-coefficient models for analysis of repeated measurements. J Am Stat Assoc 103:1556鈥?569 CrossRef
    26. Wang L (2009) Wilcoxon-type generalized Bayesian information criterion. Biometrika 96:163鈥?73 CrossRef
    27. Wang L, Li R (2009) Weighted Wilcoxon-type smoothly clipped absolute deviation method. Biometrics 65:564鈥?71 CrossRef
    28. Wang H, Li G, Jiang G (2007) Robust regression shrinkage and consistent variable selection via the LAD-LASSO. J Bus Econ Stat 25:347鈥?55 CrossRef
    29. Wang H, Xia Y (2009) Shrinkage estimation of the varying coefficient model. J Am Stat Assoc 104:747鈥?57 CrossRef
    30. Wu CO, Chiang C-T, Hoover DR (1998) Asymptotic confidence regions for kernel smoothing of a varying-coefficient model with longitudinal data. J Am Stat Assoc 93:1388鈥?402 CrossRef
    31. Zhao P, Xue L (2009) Variable selection for semiparametric varying coefficient partially linear models. Stat Prob lett 79:2148鈥?157 CrossRef
    32. Zou H (2006) The adaptive LASSO and its oracle properties. J Am Stat Assoc 101:1418鈥?429 CrossRef
    33. Zou H, Yuan M (2008) Composite quantile regression and the oracle model selection theory. Ann Stat 36:1108鈥?126 CrossRef
  • 刊物类别:Mathematics and Statistics
  • 刊物主题:Statistics
    Statistics
    Statistics for Business, Economics, Mathematical Finance and Insurance
    Probability Theory and Stochastic Processes
    Economic Theory
  • 出版者:Physica Verlag, An Imprint of Springer-Verlag GmbH
  • ISSN:1435-926X
文摘
The varying coefficient model is widely used as an extension of the linear regression model. Many procedures have been developed for the model estimation, and recently efficient variable selection procedures for the varying coefficient model have been proposed as well. However, those variable selection approaches are mainly built on the least-squares (LS) type method. Although the LS method is a successful and standard choice in the varying coefficient model fitting and variable selection, it may suffer when the errors follow a heavy-tailed distribution or in the presence of outliers. To overcome this issue, we start by developing a novel robust estimator, termed rank-based spline estimator, which combines the ideas of rank inference and polynomial spline. Furthermore, we propose a robust variable selection method, incorporating the smoothly clipped absolute deviation penalty into the rank-based spline loss function. Under mild conditions, we theoretically show that the proposed rank-based spline estimator is highly efficient across a wide spectrum of distributions. Its asymptotic relative efficiency with respect to the LS-based method is closely related to that of the signed-rank Wilcoxon test with respect to the t test. Moreover, the proposed variable selection method can identify the true model consistently, and the resulting estimator can be as efficient as the oracle estimator. Simulation studies show that our procedure has better performance than the LS-based method when the errors deviate from normality.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700