In general, high-dimensional data often display heterogeneity. If the heteroscedasticity is neglected in the regression model, it will produce inefficient inferences of the regression coefficients. Quantile regression is not only robust to outliers, but also accommodates the heterogeneity. We hence apply multiple quantiles regression to identify the heteroscedasticity, seek for common features of quantile coefficients and eliminate irrelevant variables simultaneously based on the regularized approach. We also give the theoretical properties of the proposed method under some regularity conditions. Simulation studies are conducted to illustrate the finite sample performance of the proposed method, implying that our method is able to identify the covariates that affect the variance of the response. In addition, we apply the proposed method to analyze Boston house data, and find that this data set contains heteroscedasticity, which is caused by the variables of rm, dis, rad and ptratio.
会议ID:478 492 996
王明秋,副教授,硕士生导师。现为统计学院应用统计系主任,中国现场统计研究会数据科学与人工智能分会理事。研究兴趣包括稳健估计、非参数统计推断、变量选择、高维数据分析等。先后多次前往香港大学、南方科技大学进行学术访问。主持或参与国家自然科学基金4项、省部级项目3项。在国内外学术刊物上Journal of Statistical Planning and Inference、Journal of Nonparametric Statistics、Acta Mathematica Sinica-English Series等发表论文30余篇。先后被评为曲阜师范大学师德先进个人、曲阜师范大学优秀教师。