Expanding the Horizons of Principal Component Analysis: Versatile Applications from Environmental Monitoring to Chemometrics
Downloads
Principal Component Analysis (PCA) is a popular statistical method for large dataset analysis and has established itself in many fields, ranging from environmental modeling to spectroscopy. In this work, we bring to light its use in monitoring NO2 air pollution from satellite data during the lockdown periods of COVID-19, incorporating the strengths of Weighted PCA and Rescaled PCA to achieve improved predictability. Additionally, PCA has been applied in resonant ultrasound spectroscopy to optimize measurement points, especially in samples with complex geometries, demonstrating its effectiveness in reducing data points of collection while maintaining accuracy. Further, PCA's application coupled with classification methods like LDA and SVM has effectively determined the geographic origin of Indonesian coconuts, demonstrating its effectiveness in enhancing classification accuracy and chemometric analysis. The versatility of PCA is also evidenced in its use in clustering high-dimensional data through Adaptive Local PCA, which employs a neural network-based approach with adaptive learning rates to enhance clustering quality in dynamic data environments. These examples show the flexibility and utility of PCA in big data analysis across different fields, and a greater application of PCA in gaseous pollutant analysis and other complex data issues is needed.
“, Ionel Haidu,” no. 2, 2023.
L. Beardslee, P. Shokouhi, and T. J. Ulrich, “Optimal measurement point selection for resonant ultrasound spectroscopy of complex-shaped specimens using principal component analysis,” NDT and E International, vol. 141, no. October 2023, p. 103000, 2024, doi: 10.1016/j.ndteint.2023.103000.
R. Hayati, A. A. Munawar, E. Lukitaningsih, N. Earlia, T. Karma, and R. Idroes, “Combination of PCA with LDA and SVM classifiers: A model for determining the geographical origin of coconut in the coastal plantation, Aceh Province, Indonesia,” Case Studies in Chemical and Environmental Engineering, vol. 9, no. August 2023, p. 100552, 2024, doi: 10.1016/j.cscee.2023.100552.
N. Migenda, R. Möller, and W. Schenck, “Adaptive local Principal Component Analysis improves the clustering of high-dimensional data,” Pattern Recognit, vol. 146, no. October 2023, p. 110030, 2024, doi: 10.1016/j.patcog.2023.110030.
F. R. Tato and I. M. Ibrahim, “Bio-Inspired Algorithms in Healthcare,” vol. 07, no. 02, pp. 233–239, 2024.
A. C. Ikegwu, H. F. Nweke, E. Mkpojiogu, C. V. Anikwe, S. A. Igwe, and U. R. Alo, “Recently emerging trends in big data analytic methods for modeling and combating climate change effects,” Energy Informatics, vol. 7, no. 1, 2024, doi: 10.1186/s42162-024-00307-5.
L. Jiasen, W. X. An, L. Guofeng, Y. Dan, and Z. Jindan, “Improved secure PCA and LDA algorithms for intelligent computing in IoT-to-cloud setting,” Comput Intell, vol. 40, no. 1, 2024, doi: 10.1111/coin.12613.
A. Ciacci and L. Penco, “Business model innovation: harnessing big data analytics and digital transformation in hostile environments,” Journal of Small Business and Enterprise Development, vol. 31, no. 8, pp. 22–46, 2023, doi: 10.1108/JSBED-10-2022-0424.
G. T. Reddy et al., “Analysis of Dimensionality Reduction Techniques on Big Data,” IEEE Access, vol. 8, pp. 54776–54788, 2020, doi: 10.1109/ACCESS.2020.2980942.
T. Zhang, J. Wang, Q. Ma, and L. Fu, “Improving the Detection Effect of Long-Baseline Lightning Location Networks Using PCA and Waveform Cross-Correlation Methods,” Remote Sens (Basel), vol. 16, no. 5, 2024, doi: 10.3390/rs16050885.
T. E. Beavers et al., “Data Nuggets: A Method for Reducing Big Data While Preserving Data Structure,” Journal of Computational and Graphical Statistics, pp. 1–31, 2024, doi: 10.1080/10618600.2024.2341896.
M. Faaique, “Overview of Big Data Analytics in Modern Astronomy,” International Journal of Mathematics, Statistics, and Computer Science, vol. 2, pp. 96–113, 2023, doi: 10.59543/ijmscs.v2i.8561.
Y. Xie, T. Wang, Y. S. Jeong, A. Tosyali, and M. K. Jeong, “True sparse PCA for reducing the number of essential sensors in virtual metrology,” Int J Prod Res, vol. 62, no. 6, pp. 2142–2157, 2024, doi: 10.1080/00207543.2023.2217282.
“Eido1822025AJRCOS129749 نسخة النشر.pdf.”
B. S. Ali et al., ICS-IDS: application of big data analysis in AI-based intrusion detection systems to identify cyberattacks in ICS networks, vol. 80, no. 6. Springer US, 2024. doi: 10.1007/s11227-023-05764-5.
X. Li and Y. S. Lee, “Customer Segmentation Marketing Strategy Based on Big Data Analysis and Clustering Algorithm,” Journal of Cases on Information Technology, vol. 26, no. 1, pp. 1–16, 2024, doi: 10.4018/JCIT.336916.
R. Avdal and H. Maseeh, “Advancing Cybersecurity through Machine Learning : Bridging Gaps , Overcoming Challenges , and Enhancing Protection,” vol. 18, no. 2, pp. 206–217, 2025.
L. Jin et al., “Big data, machine learning, and digital twin assisted additive manufacturing: A review,” Mater Des, vol. 244, no. June, p. 113086, 2024, doi: 10.1016/j.matdes.2024.113086.
P. Manoharan and K. D. Kokkotas, “Finding universal relations using statistical data analysis,” Physical Review D, vol. 109, no. 10, pp. 1–20, 2024, doi: 10.1103/PhysRevD.109.103033.
Zhang. Wang, C., Zhang, Y., Ding, H., “Applied Mathematics and Nonlinear Sciences,” Applied Mathematics and Nonlinear Sciences, vol. 8, no. 2, pp. 3383–3392, 2023.
Pereza, “Pr ep rin t n ot pe er r ev Pr ep ot pe er”.
Baidya, “Pr ep rin t n ot pe er r iew Pr ep t n ot pe er ed”.
O. Christensen, A. Bagger, and J. Rossmeisl, “The Missing Link for Electrochemical CO2 Reduction: Classification of CO vs HCOOH Selectivity via PCA, Reaction Pathways, and Coverage Analysis,” ACS Catal, vol. 14, no. 4, pp. 2151–2161, 2024, doi: 10.1021/acscatal.3c04851.
K. Mehmood, F. Jabeen, M. Rashid, S. M. Alshibani, A. Lanteri, and G. Santoro, “Unraveling the transformation: the three-wave time-lagged study on big data analytics, green innovation and their impact on economic and environmental performance in manufacturing SMEs,” European Journal of Innovation Management, 2024, doi: 10.1108/EJIM-10-2023-0903.
D. C. Wu, S. Zhong, J. Wu, and H. Song, “Tourism and Hospitality Forecasting With Big Data: A Systematic Review of the Literature,” Journal of Hospitality and Tourism Research, 2024, doi: 10.1177/10963480231223151.
Y. Zhao and G. Liang, “A study on the innovative model of foreign language teaching in universities using big data corpus,” Journal of Computational Methods in Sciences and Engineering, vol. 24, no. 1, pp. 87–103, 2024, doi: 10.3233/JCM-237113.
R. Sharma and S. Gurung, “Implementing Big Data Analytics and Machine Learning for Predictive Maintenance in Manufacturing Facilities in South Korea,” AI, IoT and the Fourth Industrial Revolution …, vol. 14, no. 1, 2024, [Online]. Available: https://scicadence.com/index.php/AI-IoT-REVIEW/article/view/36%0Ahttps://scicadence.com/index.php/AI-IoT-REVIEW/article/download/36/34
G. Gandaglia et al., “Clinical Characterization of Patients Diagnosed with Prostate Cancer and Undergoing Conservative Management: A PIONEER Analysis Based on Big Data,” Eur Urol, vol. 85, no. 5, pp. 457–465, 2024, doi: 10.1016/j.eururo.2023.06.012.
J. Nyangon and R. Akintunde, “Principal component analysis of day-ahead electricity price forecasting in CAISO and its implications for highly integrated renewable energy markets,” Wiley Interdiscip Rev Energy Environ, vol. 13, no. 1, 2024, doi: 10.1002/wene.504.
A. Mukherjee, S. Abraham, A. Singh, S. Balaji, and K. S. Mukunthan, “From Data to Cure: A Comprehensive Exploration of Multi-omics Data Analysis for Targeted Therapies,” Mol Biotechnol, no. 0123456789, 2024, doi: 10.1007/s12033-024-01133-6.
J. Li, M. S. Othman, H. Chen, and L. M. Yusuf, “Optimizing IoT intrusion detection system: feature selection versus feature extraction in machine learning,” J Big Data, vol. 11, no. 1, 2024, doi: 10.1186/s40537-024-00892-y.
Z. Wang et al., “Ultra-Short-Term Offshore Wind Power Prediction Based on PCA-SSA-VMD and BiLSTM,” Sensors, vol. 24, no. 2, 2024, doi: 10.3390/s24020444.