Evaluating The Performance of Machine Learning Models in Audit Opinion Prediction – A Study in Vietnam
Downloads
This study investigates the effectiveness of machine learning models in predicting audit opinions using a dataset from the FiinPro-X platform, comprising 9,783 audited consolidated financial statements from public companies listed on Vietnamese stock exchanges from 2016 to 2023. The dataset spans various industries, excluding banks and financial institutions, and focuses on identifying key financial, non-financial, and qualitative variables that influence audit opinions. Six supervised learning algorithms were applied—Logistic Regression, K-Nearest Neighbors (KNN), Decision Trees, Random Forests, Support Vector Machines (SVM), and Naive Bayes—evaluated based on their ability to predict both fully acceptable (unqualified) and non-fully acceptable audit opinions. All data processing and model training were implemented in a Python environment. The Random Forest model demonstrated the best overall performance, achieving an accuracy of 0.868 and an AUC-ROC of 0.87, though its F1 score for predicting non-fully acceptable audit opinions was lower (0.585). This suggests that while machine learning models can improve prediction accuracy, challenges remain in handling imbalanced data and non-linear relationships among input variables. The study also reduced the number of features by 30%, improving the models’ performance. Future research should further refine data and feature construction processes to ensure comparability and practical applicability.
Barr-Pulliam, D., Brown-Liburd, H. L., & Munoko, I. (2022). The Effects of Person-specific, Task, and Environmental Factors on Digital Transformation and Innovation in Auditing: A Review of the Literature. Journal of International Financial Management & Accounting, 33 (2):, 33(2), 337-374.e3
IAASB. (2022). Handbook of International Quality Management, Auditing, Review, Other Assurance,and Related Services Pronouncements. Retrieved from https://www.iaasb.org/publications/2022-handbook-international-quality-management-auditing-review-other-assurance-and-related-services
Kirkos, E., Spathis, C., Nanopoulos, A., & Manolopoulos, Y. (2007). Identifying Qualified Auditors’ Opinions: A Data Mining Approach. Journal of Emerging Technologies in Accounting, 4, 183-197. doi:10.2308/jeta.2007.4.1.183
Pham Thi Thu Oanh, Dang Ngoc Hung & Vu Thi Thuy Van. (2024). Forecasting audit opinions on financial statements: statistical algorithm or machine learning. Electronic Journal of Applied Statistical Analysis, 17(01), 133-152. doi:10.1285/i20705948v17n1p133
Saif, S. M., Sarikhani, M., & Ebrahimi, F. (2012). Finding rules for audit opinions prediction through data mining methods. European Online Journal of Natural and Social Sciences, 1(2), 28-36.
Sánchez-Serrano, J. R., Alaminos, D., & García-Lagos, F. (2020). Predicting Audit Opinion in Consolidated Financial Statements with Artificial Neural Networks. Mathematics, 8(1288). doi:10.3390/math8081288
Stanišić, N., Radojević, T., & Stanić, N. (2019). Predicting the Type of Auditor Opinion: Statistics, Machinelearning, or a Combination of the Two? 16(2), 1-58.
Yaşar, A., Yakut, E., & Gutnu, M. M. (2015). Predicting Qualified Audit Opinions Using Financial Ratios: Evidence from the Istanbul Stock Exchange. International Journal of Business and Social Science, 6(8(1)), 57-67.
Zarei, H., Yazdifar, H., Ghaleno, M. D., & Azhmaneh, R. (2020). Predicting auditors’ opinions using financial ratios and non-financial metrics: evidence from Iran. Journal of Accounting in Emerging Economies, 10(3), 425-446. doi:10.1108/JAEE-03-2018-0027
Zeng, S., Li, Y., & Li, Y. (2022). Research on Audit Opinion Prediction of Listed Companies Based on Sparse Principal Component Analysis and Kernel Fuzzy Clustering Algorithm. Mathematical Problems in Engineering. doi:10.1155/2022/4053916