Methods and Theory on Model Selection and Model Averaging

Speaker:Prof.Yuhong Yang(University of Minnesota)
Schedule:9:50-12:15,Sept.21,Sept.22,Sept.29,Oct.6,2022 (updated)
Venue:近春园西楼第三会议室Conference Room 3,Jin Chun Yuan West Bldg.;Zoom Meeting ID: 271 534 5558 Passcode: YMSC


Model selection and its diagnosis are foundational elements in modern statistical and machine learning applications that serve the purpose of obtaining reliable information and reproducible results. In this short course, we introduce the principles and theories on model selection and model averaging and their applications in high-dimensional regression. Model selection methods include information criteria (AIC, BIC etc), cross validation, penalized regression (LASSO, SCAD, MCP) and more. We will learn to understand their differences, connections, performances, limitations, proper uses, and approaches to achieving the best performance without knowing which method is the best for the data at hand. In addition, we will study new tools to characterize model selection reliability. When model selection uncertainty is high, model averaging/combining typically offers more accurate prediction and more reliable conclusions. Theoretical results covered include model selection consistency, consistent cross validation, adaptive minimax optimal regression learning in high-dimensional regression, and optimalities of model averaging methods.