Multivariate Statistical Interpretation of Airborne Diseases using Principal Components Analysis
Downloads
Accurate and timely determination of relationships among communicable diseases is crucial in taking precautionary measures to control and prevent the transmission of infectious diseases. This will inform the government to make important policies that will help provide effective patient treatments. This study employed the principal components analysis (PCA) method to explore and interpret the relationship among airborne communicable diseases: Tuberculosis (TB), chickenpox, and measles based on a medical dataset obtained from Bibiani Government Hospital, Ghana. The Kaiser criterion was employed to determine a suitable number of principal components (PCs) to feature in the statistical analysis. A scree plot diagram was also used to affirm the number of PCs needed in the analysis. Projection of diseases on PC planes was also used to interpret the relationships among the diseases. From the study, statistical results revealed that the first principal component (PC1), second principal component (PC2), and third principal component (PC3) performed significantly well in the disease interpretation by explaining a total variation of 46.70994%, 30.61631%, and 22.67376, respectively of the useful information in the dataset. There were also marked strong correlations among the diseases concerning PCs. Due to the limited number of diseases considered, this study will serve as preliminary investigation on the use of PCA as a versatile and promising multivariate statistical technique that can be relied upon by public health experts and policymakers to interpret relationships among diseases and make informed decisions very well.
AHMAD, F. & DAR, W. M. (2018) Classification of alzheimer’s disease stages: an approach using PCA-based algorithm. American Journal of Alzheimer’s Disease and Other Dementias, 33, 433-439.
CHEN, X., YI, M. & HUANG, J. (2020) Application of a PCA-ANN-based cost prediction model for general aviation aircraft. IEEE Access, 8, 130124-130135, https://doi.org/10.1109/ACCESS.2020.3008442.
CHERRY, S. (1996) Some comments on singular value decomposition analysis. American. Meteorological Society, 10, 1759-1761.
COSTA, J. C. G. D., DA-SILVA, P. J. G., ALMEIDA, R. M. V. R. & INFANTOSI, A. F. C. (2014) Validation in principal components analysis applied to EEG data. Computational and Mathematical Methods in Medicine, 2014, 1-10. http://dx.doi.org/10.1155/2014/413801.
DASH, P., NAYAK, M. & DAS, G. P. (2014) Principal component analysis using singular value decomposition for image compression. International Journal of Computer Applications, 93(9), 1-8.
de-GRAFT AIKINS, A., ADDO, J., OFEI, F., BOSU, W. K. & AGYMANG, C. (2012) Ghana’s burden of chronic non-communicable diseases: future directions in research, practice and policy. Ghana Medical Journal, 46(2), 1-4
HUNTER, E., Namee, B. M. & Kelleher, J. D. (2020) A model for the spread of infectious diseases in a region. International Journal of Environmental Research and Public Health, 17, 3119, 1-19, https://doi.org/10.3390/ijerph17093119.
JANATI, A., HOSSEINY, M., GOUYA, M. M., MORADI, G. & GHADERI, E (2015) Communicable Disease Reporting Systems in the world: a systematic review article. Iran Journal Public Health, 44(11), 1453-1465.
KANYONGO, G. Y. (2005) Determining the correct number of components to extract from a principal components analysis: A Monte Carlo study of the accuracy of the scree plot. Journal of Modern Applied Statistical Methods: 4(1), 120-133, https://doi.org/10.22237/jmasm/1114906380.
KARAMIZADEH, S., ABDULLAH, S.M., MANAF, A. A. & ZAMANI, M. (2013) An overview of principal component analysis. Journal of Signal and Information Processing, 4, 173-175. https://doi.org/10.4236/jsip.2013.43B031.
KONATÈ, A. A., PAN, H., MA, H., CAO, X., ZIGGAH, Y. Y., OLOO, M. & KHAN, N. (2015) Application of dimensionality reduction technique to improve geophysical log data classification in crystalline rocks. Journal of Petroleum Science and Engineering, 133, 633-645.
LI, Y., LEUNG, G. M., TANG, J. W., YANG, X., CHAO, C. Y. H. & LI, J. Z. (2007) Role of ventilation in airborne transmission of infectious agents in the built environment – a multidisciplinary systematic review. Indoor Air, 17, 2-8. https://doi.org/10.1111_j.1600-0668.2006.00445.x.
LUO, Y., WU, J., LU, J., XU, X., LONG, W., YAN, G., TANG, M., Lou, Z., Xu, D., Zhou, P., SI, Q. & ZHENG, X. (2020) Investigation of COVID-19-related symptoms based on factor analysis. Annals of Palliative Medicine, 1-8, http://dx.doi.org/10.21037/.
MEGHRAOUI, D., BOUDRAA, B., MERAZI-MEKSEN, T. & BOUDRAA, M. (2016) Features dimensionality reduction and multi-dimensional voice processing program to Parkinson disease discrimination. Proceedings of 2016 4th International Conference on Control Engineering and Information Technology, 1-5.
METTLE, F. O., AFFI, P. O. & TWUMESI, C. (2020) Modeling the transmission dynamics of tuberculosis in the Ashanti region of Ghana. Interdisciplinary Perspectives on Infectious Diseases, 2020, 1-16, https://doi.org/10.1155/2020/4513854.
MOHAMMED, S., KHALID, A., OSMAN, S. E. & HELALI, R. G. M. (2016) Usage of principal component analysis (PCA) in AI applications. International Journal of Engineering Research and Technology, 5 (12), 372-375.
MUHAMMAD, M. U., JIADONG, R., Muhammad, N. S., Hussain, M. & Muhammad, I. (2019) Principal component analysis of categorized polytomous variable-based classification of diabetes and other chronic diseases. International Journal of Environmental Research and Public Health, 16, (3593), 1-15,
https://doi.org/10.3390/ijerph16193593.
NGO, A. N. & TURBOW, D. J. (2019) Principal component analysis of morbidity and mortality among the United States homeless population: a systematic review and meta-analysis. International Archives of Public Health and Community Medicine, 3(2), 1-9, https://doi.org/10.23937/2643-4512/1710025
ORUMIE, U. C. & OGBONNA, O. (2019) Principal component analysis and its derivation from singular value decomposition. International Journal of Statistics and Probability, 8(2), 183-191, https://doi.org/10.5539/ijsp.v8n2p183.
SALEM, N. & HUSSEIN, S. (2016) Data dimensional reduction and principal components analysis. Procedia Computer Science,163, 292–299.
SHARMA, S. K., GAJBHIYE, S. & TIGNATH, S. (2015) Application of principal component analysis in grouping geomorphic parameters of a watershed for hydrological modeling. Applied Water Science, 5(1), 89-96, https://doi.org/10.1007/s13201-014-0170-1.
SHILASKAR, S. & GHATOL, A. (2013) Dimensionality reduction techniques for improved diagnosis of heart disease. International Journal of Computer Applications, 61(5), 1-9.
WANG, X., PEI, T., LIU, Q., SONG, C., LIU, Y., CHEN, X., MA, J. & ZHANG, Z. (2020) Quantifying the time-lag effects of human mobility on the COVID-19 transmission: a multi-city study in China. IEEE Access, 8, 216752- 216761, https://doi.org/10.1109/ACCESS.2020.3038995.
WHITTAKER, E., LÓPEZ-VARELA, BRODERICK, C. & SEDDOB. J. A. (2019) Examining the complex relationship between tuberculosis and other infectious diseases in children. Frontiers in Pediatrics, 7(233),1-23, https://doi.org/10.3389/fped.2019.00233.
YADAV, R. & JAT, S. C. (2020) Feature selection and dimensionality reduction methods for chronic disease prediction. International Journal of Scientific and Technology Research, 9(4), 2912-2918.