Classification of Stunting Prevalence Levels in Indonesia

Authors

  • Dewi Fadriani Sebelas Maret University image/svg+xml Author
  • Dewi Retno Sari Saputro Universitas Sebelas Maret, Surakarta, Indonesia Author
  • Nughthoh Arfawi Kurdhi Universitas Sebelas Maret, Surakarta, Indonesia Author

Keywords:

Dimension Reduction, Classification, Autoencoder, K-Nearest Neighbors, Stunting

Abstract

Stunting is a problem of chronic malnutrition in children that affects their productivity in adults, cognitive development, and learning capacity. The high prevalence of stunting in Indonesia is caused by various health, social, and environmental factors that are different in each region, making it necessary to classify regions to identify areas with a high risk of stunting. Data with many variables present high-dimensionality issues that can reduce the accuracy of distance-based classification due to the curse of dimensionality. This study applies Autoencoder and K-Nearest Neighbors (KNN) on stunting data in Indonesia. The Autoencoder is used to reduce data dimensions through a process of reduction and reconstruction, thereby generating latent features for KNN input. The results show that the Autoencoder reduces 11 input variables to 4 latent features. KNN classification with K=7 categorizes provinces into 12 provinces with medium risk, 21 provinces with high risk, and 3 provinces with very high risk, with an accuracy of 75%. This study showed that dimensionality reduction using an Autoencoder improves data representation, enabling KNN to classify stunting more effectively

References

[1] W. Yanti Sihotang et al., “Determinants of stunting in children under five: a scoping review,” 2023.

[2] Kemenkes RI, “SSGI 2024 DALAM ANGKA,” Jakarta, 2025.

[3] A. N. Widya, B. R. Samudro, and E. Gravitiani, “Economics Development Analysis Journal Stunting In Java Island: Spatial and Risk Factor Analysis,” / Economics Development Analysis Journal, vol. 13, no. 2, 2024, [Online]. Available: http://journal.unnes.ac.id/sju/index.php/edaj

[4] A. A. Soofi and A. Awan, “Classification Techniques in Machine Learning: Applications and Issues,” Journal of Basic & Applied Sciences, vol. 13, pp. 459–465, 2017.

[5] J. Salvador-Meneses, Z. Ruiz-Chavez, and J. Garcia-Rodriguez, “Compressed kNN: K-nearest neighbors with data compression,” Entropy, vol. 21, no. 3, Mar. 2019, doi: 10.3390/e21030234.

[6] “Review of Literature on Improving the KNN Algorithm,” Transactions on Machine Learning and Artificial Intelligence, vol. 11, no. 3, Jun. 2023, doi: 10.14738/tecs.113.14768.

[7] P. Cunningham and S. J. Delany, “k-Nearest Neighbour Classifiers: 2nd Edition (with Python examples),” Apr. 2020, doi: 10.1145/3459665.

[8] N. Ukey, Z. Yang, B. Li, G. Zhang, Y. Hu, and W. Zhang, “Survey on Exact kNN Queries over High-Dimensional Data Space,” Jan. 01, 2023, MDPI. doi: 10.3390/s23020629.

[9] A. A. Amer, S. D. Ravana, and R. A. A. Habeeb, “Effective k-nearest neighbor models for data classification enhancement,” J. Big Data, vol. 12, no. 1, Dec. 2025, doi: 10.1186/s40537-025-01137-2.

[10] S. Chen and W. Guo, “Auto-Encoders in Deep Learning—A Review with New Perspectives,” Apr. 01, 2023, MDPI. doi: 10.3390/math11081777.

[11] K. Berahmand, F. Daneshfar, E. S. Salehi, Y. Li, and Y. Xu, “Autoencoders and their applications in machine learning: a survey,” Artif. Intell. Rev., vol. 57, no. 2, Feb. 2024, doi: 10.1007/s10462-023-10662-6.

[12] S. Syahrial, R. Ilham, Z. F. Asikin, and St. S. I. Nurdin, “Stunting Classification in Children’s Measurement Data Using Machine Learning Models,” Journal La Multiapp, vol. 3, no. 2, pp. 52–60, Mar. 2022, doi: 10.37899/journallamultiapp.v3i2.614.

[13] Syahrani Lonanf, “Klasifikasi Status Stunting Pada Balita Menggunakan K-Nearest Neighbor Dengan Feature Selection Backward Elimination,” Jurnal Media Informatika Budidarma, vol. 5, pp. 999–999, Mar. 2021, doi: 10.30865/mib.v5i1.2293.

[14] F. J. Pulgar, F. Charte, A. J. Rivera, and M. J. Del Jesus, “AEkNN: An autoencoder kNN–based classifier with built-in dimensionality reduction,” International Journal of Computational Intelligence Systems, vol. 12, no. 1, pp. 436–452, Nov. 2018, doi: 10.2991/ijcis.2018.125905686.

[15] K. Berahmand, F. Daneshfar, E. S. Salehi, Y. Li, and Y. Xu, “Autoencoders and their applications in machine learning: a survey,” Artif. Intell. Rev., vol. 57, no. 2, Feb. 2024, doi: 10.1007/s10462-023-10662-6.

[16] C. Park, J. Lee, Y. Kim, J. G. Park, H. Kim, and D. Hong, “An Enhanced AI-Based Network Intrusion Detection System Using Generative Adversarial Networks,” IEEE Internet Things J., vol. 10, no. 3, pp. 2330–2345, Feb. 2023, doi: 10.1109/JIOT.2022.3211346.

[17] J. P. Roche, O. Niggemann, and J. Friebe, “Using Autoencoders and Automatic Differentiation to Reconstruct Missing Variables in a Set of Time Series,” SN Comput. Sci., vol. 6, no. 4, Apr. 2025, doi: 10.1007/s42979-025-03798-5.

[18] M. Abu, B. Siddique, S. Sakib, and A. Rahman, “Performance Analysis of Deep Autoencoder and NCA Dimensionality Reduction Techniques with KNN, ENN and SVM Classifiers.”

[19] M. Wildi, “Sign Accuracy, Mean-Squared Error and the Rate of Zero Crossings: a Generalized Forecast Approach,” Jan. 2026, [Online]. Available: http://arxiv.org/abs/2601.06547

[20] M. Garouani, A. Ahmad, M. Bouneffa, and M. Hamlich, “Autoencoder-kNN meta-model based data characterization approach for an automated selection of AI algorithms,” J. Big Data, vol. 10, no. 1, Dec. 2023, doi: 10.1186/s40537-023-00687-7.

[21] A. F. Pulungan, M. Zarlis, and S. Suwilo, “Analysis of Braycurtis, Canberra and Euclidean Distance in KNN Algorithm,” SinkrOn, vol. 4, no. 1, p. 74, Sep. 2019, doi: 10.33395/sinkron.v4i1.10207.

[22] R. Rahim and A. S. Ahmar, “Cross-Validation and Validation Set Methods for Choosing K in KNN Algorithm for Healthcare Case Study,” JINAV: Journal of Information and Visualization, vol. 3, no. 1, pp. 57–61, Jul. 2022, doi: 10.35877/454ri.jinav1557.

[23] S. Uddin, I. Haque, H. Lu, M. A. Moni, and E. Gide, “Comparative performance analysis of K-nearest neighbour (KNN) algorithm and its different variants for disease prediction,” Sci. Rep., vol. 12, no. 1, Dec. 2022, doi: 10.1038/s41598-022-10358-x.

[24] S. Sathyanarayanan, “Confusion Matrix-Based Performance Evaluation Metrics,” African Journal of Biomedical Research, pp. 4023–4031, Nov. 2024, doi: 10.53555/ajbr.v27i4s.4345.

[25] G. Zeng, “Invariance Properties and Evaluation Metrics Derived from the Confusion Matrix in Multiclass Classification,” Mathematics, vol. 13, no. 16, Aug. 2025, doi: 10.3390/math13162609.

Downloads

Published

2026-05-08

How to Cite

Classification of Stunting Prevalence Levels in Indonesia . (2026). Proceeding International Conference on Multidisciplinary Engagement, 1(1), 423-431. https://prosiding.gerakanedukasi.com/index.php/income/article/view/112

Most read articles by the same author(s)

Similar Articles

You may also start an advanced similarity search for this article.