Heteroscedasticity Detection in Cross-Sectional Diabetes Pedigree Function: A Comparison of Breusch-Pagan-Godfrey, Harvey and Glejser Tests

Omotayo Oluwatosin ILORI1*, & Fatai Olalekan TANIMOWO2
1,*
Department of Mathematics and Statistics, Austin Peay State University, Clarksville, Tennessee, USA.
2Department of Mathematics and Statistics, Austin Peay State University, Clarksville, Tennessee, USA.
DOI –
http://doi.org/10.37502/IJSMR.2022.51211

Abstract

Diabetes is a serious defect that does not make the body to have enough insulin, and thereby allowing blood sugar to stay in the bloodstream more than the body requires and over time causes serious problems relating to health. So, predicting if a person has diabetes or not using the linear model surface, but a major challenge arises if there is heteroscedasticity in the model, which can make the least square estimates inefficient. So, there is a need to know the method that is best for detecting heteroscedasticity so as not to rely on inefficient model for predicting diabetes. This research therefore aimed at comparing the Breusch-Pagan-Godfrey (BPG), Harvey and Glejser tests for detecting heteroscedasticity in cross-sectional data. To achieve this, data were collected on Diabetes Pedigree Function (DPF), Plasma glucose concentration a 2 hours in an oral glucose tolerance test (G), 2-Hour serum insulin (mu U/ml) (I), and Triceps skin fold thickness (mm) (S) from National Institute of Diabetes and Digestive and Kidney Diseases (1990) comprising 768 observations. The data was divided into two, small sample and large sample. The result of the regression analysis showed that skin fold thickness is the most important factor that can predict diabetes in a patient, followed by plasma glucose concentration, and then by insulin. The result for heteroscedasticity showed that, heteroscedasticity is not present in small dataset using the three tests. However, for the large sample, both the Breusch-Pagan-Godfrey and Glejser detect heteroscedasticity, but Harvey did not. Hence, it is advisable to use either Breusch-pagan or Glejser tests because they are more sensitive to heteroscedasticity in diabetes patient data.

Keywords: Cross-sectional, Diabetes pedigree function, Heteroscedasticity, Ordinary Least square, Residual.

References

  • Abdulhadi, N., Al-Mousa, A. (2021). Diabetes Detection Using Machine Learning Classification Methods, International Conference on Information Technology (ICIT), 2021.
  • Alabi, O. O., Ayinde, K., Babalola, O. E., Bello, H. A., Okon, E. C. (2020). Effects of Multicollinearity on Type I Error of Some Methods of Detecting Heteroscedasticity in Linear Regression Model. Open Journal of Statistics, 2020.
  • Baltagi, B.H., (1980): “On seemingly unrelated regressions with error components”, Econometrica 48, Pp 1547-1551.
  • Breusch, T.S. and Pagan, A.R. (1979): “A simple Test for Heteroscedasticity and Random Coefficient Variation”. Econometrica, vol. 47, No. 5 (September, 1979).
  • Breusch, T. and Pagan, A.R (1980): “The LM Test and Its Applications to Model Specification in Econometrics”. Review of Economic Studies, 47, 1980, Pp 239-254.
  • Center for Disease Control and Prevention (2022). Available at https://www.cdc.gov/diabetes/basics/diabetes. Accessed on 11th November, 2022.
  • Deysi, G. (2022). Type 2 Diabetes among Adult Latinas Living in California, California State University, Fresno.
  • Ekum, M.I., Farinde, D. A. and Ayoola, F.J. (2013): “Panel Data: The Effects of Some World Development Indicator (WDI) on GDP Per Capita of Selected African Union (AU) Countries (1981-2011)”, International Journal of Science and Technology (IJST). Vol. 2 No. 12, December, 2013.
  • Ekum, M.I., Akinmoladun, O.M., Aderele, O.R. and Esan, O.A. (2015). Application of Multivariate Analysis on the effects of World Development Indicators on GDP per capita of Nigeria (1981-2013). International Journal of Science and Technology (IJST); Vol. 4, No. 12, December, 2015, Pp 254-534.
  • Glejser, H. (1969). A New Test for Heteroscedasticity. Journal of the American Statistical Association, 64 (325), 316-323.
  • Golfield, S.M. and Quandt, R.E. (1965). Some Test for Homoscedasticity. Journal of the American Statistical Association, 60, 539-547.
  • Harvey, A.C (1976). Estimating Regression Models with Multiplicative Heteroscedasticity. Econometrica, 44 (3), 461-465.
  • Hildreth, C. and Houck, J.P. (1968): Some Estimators for a Linear Model with Random Coefficients. Journal of the American Statistical Association, 63 (1968), Pp 584-595.
  • Iluno, C., Taylor, J. I., Akinmoladun, O. M., Aderele, O. R., and Ekum, M. I. (2021). Modelling the effect of Covid-19 mortality on the economy of Nigeria. Research in Globalization, 3, 100050.
  • Kennedy, P. (1998): A Guide to Econometrics. Chapters 5,6,7 and 9.
  • Koenker, R. (1981). A Note on Studentizing a Test for Heteroscedasticity. Journal of Econometrics, 17, 107-112.
  • Mittelhammer R. C., Judge, G. G., Miller, D. J. (2000). Econometric Foundations. Cambridge University Press, Cambridge.
  • National Institute of Diabetes and Digestive and Kidney Diseases (1990).
  • Okunnu, M. A., Ekum, M. I. and Aderele, O. R. (2017). The Effects of Microeconomic Indicators on Economic Growth of Nigeria (1970 – 2015). American Journal of Theoretical and Applied Statistics; 2017; 6(6): 325-334.
  • Wu Y, Ding Y, Tanaka Y, Zhang W (2014). Risk factors contributing to type 2 diabetes and recent advances in the treatment and prevention. Int J Med Sci. 11(11),1185-200. doi: 10.7150/ijms.10001.
  • Zou Q, Qu K, Luo Y, Yin D, Ju Y and Tang H (2018). Predicting Diabetes Mellitus With Machine Learning Techniques. Front. Genet. 9:515. doi: 10.3389/fgene.2018.00515.