研究生(外文):NIAN, YOU-REN
論文名稱(外文):Linear regression model with skew normal distribution
指導教授(外文):Su, Nan-Cheng
口試委員(外文):Chang, Sheng-MaoSu, Nan-ChengHuang, Chia-Hui
外文關鍵詞:Skew normal distributionMultivariatelinear regression modelLeast squares estimatorMethod of moment estimator
It is always the question that the data is independent. When the error terms is not independently following the normal distribution, the traditional linear regression model is not the appropriate model in the data. Considering the situation of data is dependent or data is no loner following the normal distribution, we try to use the skew normal distribution to replace the normal distribution as the assumption of the error. Therefore, we calculate the method of moment estimator, least squares estimator and maximum likelihood estimator. Besides, we try using the above estimators to fit two data. In the end, we extend the linear regression model to the linear mixed model.
1 Introduction 1
2 Linear Regression Model 3
2.1 Method of moment estimate (MME) . . . . . . . . . . 3
2.2 Method of Least Squares Estimate . . . . . . . . . . . 5
3 Skew-Normal Distribution 7
3.1 The errors is correlated . . . . . . . . . . . . . . . . . 7
3.2 The errors is independent . . . . . . . . . . . . . . . . 9
4 Simulation study 11
4.1 Normal case . . . . . . . . . . . . . . . . . . . . . . . 11
4.2 Skew-normal case . . . . . . . . . . . . . . . . . . . . 13
4.3 Independent Skew-normal case . . . . . . . . . . . . . 15
5 Example 23
5.1 Example 1: House prices in Iowa . . . . . . . . . . . . 23
5.2 Example 2: University Admissions . . . . . . . . . . . 24
6 Conclusion 28
Reference 30

List of Figures
4.1 n = 50, ε ~iid N(0, 9), β= (1, 0.01)' . . . . . . . . . . . 13
4.2 n = 50, ε ~iid N(0, 9); β = (0.01, 0.01)' . . . . . . . . . 14
4.3 n = 100, ε~SN_100(0, 9 x I_100, 1), β = (1,2)' . . . . . 16
4.4 n = 100, ε~SN_100(0, 9 x I_100, 1), β = (1, 0.01)' . . . 17
4.5 n = 20, ε~iid SN(0, 3, 1), β = (0.01, 0.01)' . . . . . . . 18
4.6 n = 20, ε~iid SN(0, 3, 1), β = (1, 2)' . . . . . . . . . . 19
4.7 n = 50, ε~iid SN(0, 3, 1), β = (0.01, 0.01)' . . . . . . . 19
4.8 n = 50, ε~iid SN(0, 3, 1), β = (1, 2)' . . . . . . . . . . 20
4.9 n = 100, ε~iid SN(0, 3, 1), β = (0.01, 0.01)' . . . . . . 20
4.10 n = 100, ε~iid SN(0, 3, 1), β = (1, 2)' . . . . . . . . . . 21
4.11 n = 300, ε~iid SN(0, 3, 1), β = (0.01, 0.01)' . . . . . . 21
4.12 n = 300, ε~iid SN(0, 3, 1), β = (1, 2)' . . . . . . . . . . 22
5.1 Diagnostic for residuals at house data . . . . . . . . . 26
5.2 Diagnostic for residuals at GPA data . . . . . . . . . 27

List of Tables
4.1 ε ~iid N(0, 9), β = (1, 2)' . . . . . . . . . . . . . . . . . 12
4.2 ε ~iid N(0, 9), β = (1, 0.01)' . . . . . . . . . . . . . . . 12
4.3 ANOVA test on simulation data with ε ~iid N(0, 9), β = (1; 2)' , n = 300 . . . . . . . . . . . . . . . . . . . . . 12
4.4 ANOVA test on simulation data with ε ~iid N(0, 9), β = (1, 0.01)' , n = 300 . . . . . . . . . . . . . . . . . . . 13
4.5 ε ~ SN_100(0, 9 x I_100, 1), β = (1, 2)' . . . . . . . . . . 14
4.6 ε ~ SN_100(0, 9 x I_100, 1), β = (1, 0.01)' . . . . . . . . 15
4.7 ANOVA test on simulation data with ε ~ SN_100(0. 9 x I_100, 1), β = (1, 2)' , n = 300 . . . . . . . . . . . . . . 15
4.8 ANOVA test on simulation data with ε ~ SN_100(0, 9 x I_100, 1), β = (1, 0.01)' , n = 300 . . . . . . . . . . . . . 16
4.9 ε ~iid SN(0, 3, 1), β = (1, 2)' . . . . . . . . . . . . . . . 17
4.10 ε ~iid SN(0, 3, 1), β = (1, 0.01)' . . . . . . . . . . . . . 17
4.11 ANOVA test on simulation data with ε ~iid SN(0, 3, 1), β = (1; 2)' , n = 300 . . . . . . . . . . . . . . . . . . . . . 18
4.12 ANOVA test on simulation data with ε ~iid SN(0, 3, 1), β = (1, 0.01)' ; n = 300 . . . . . . . . . . . . . . . . . . . 18
5.1 Parameter estimates of house prices data. . . . . . . . 24
5.2 Parameter estimates of university admissions data. . . 26
Alhamide, A., Ibrahim, K., and Alodat, M. (2016). Pak. j. statist. 2016 vol. 32 (2),81-96 inference for multiple linear regression model with extended skew normal
errors. Pak. J. Statist, 32(2):81–96.
Azzalini, A. (1985). A class of distributions which includes the normal ones.Scandinavian journal of statistics, pages 171–178.
Azzalini, A. and Capitanio, A. (1999). Statistical applications of the multivariate skew normal distribution. Journal of the Royal Statistical Society: Series B
(Statistical Methodology), 61(3):579–602.
De Cock, D. (2011). Ames, iowa: Alternative to the boston housing data as an end of semester regression project. Journal of Statistics Education, 19(3).
Huang, W.-J., Su, N.-C., and Gupta, A. K. (2013). A study of generalized skewnormal distribution. Statistics, 47(5):942–953.
Joanes, D. and Gill, C. (1998). Comparing measures of sample skewness and kurtosis. Journal of the Royal Statistical Society: Series D (The Statistician),
Lin, T. I. and Lee, J. C. (2008). Estimation and prediction in linear mixed models with skew-normal random effects for longitudinal data. Statistics in medicine,
Neter, J., Kutner, M. H., Nachtsheim, C. J., and Wasserman, W. (1996). Applied linear statistical models, volume 4. Irwin Chicago.
Su, N.-C. and Gupta, A. K. (2015). On some sampling distributions for skewnormal population. Journal of Statistical Computation and Simulation, 85(17):
Zhang, D. and Davidian, M. (2001). Linear mixed models with flexible distributions of random effects for longitudinal data. Biometrics, 57(3):795–802.
