The predictive model of higher education guidance using integrated techniques for imbalanced data of learner groups

Main Article Content

Atsawin Surawatchayotin
Worapat Paireekreng


An inappropriate selection of higher education entrance came from insufficient experience of student and in-depth knowledge of the subject area. Most of students rely on social norms and values to make a decision without consideration of individual skill. Therefore, in order to select the appropriate subject area, the integrated prediction model of higher education entrance based on student’s multiple-intelligence and skills is necessary. According to the research, there is the imbalance data of subject areas. Hence, the integrated techniques which combined Single Imputation, SMOTE, and feature Selection were used to create prediction model. It also implemented Bagging, Boosting and Stacking in the experiments to address the problems. The experimental results found that the most importance skill using the stacking technique appeared to subject area of mathematical logic. There is an accuracyrate of 77% performed better which is compared to other techniques. This subject is the fundamental of other knowledge areas. It affects the admission to higher education.


Download data is not yet available.

Article Details

How to Cite
A. Surawatchayotin and W. Paireekreng, “The predictive model of higher education guidance using integrated techniques for imbalanced data of learner groups”, JIST, vol. 11, no. 1, pp. 65-74, Jun. 2021.
Research Article: Soft Computing (Detail in Scope of Journal)


[1] I. Tongsamsi, "Labor Market and Educational Mismatch A Case Study of New Community Development Workers," presented at The 10th Hatyai National and International Conference, Songkla, Thailand, 2019.

[2] S. Jantapong, "The Skills Mismatch and the Challenges to Education 4.0," MER 2018, Bangkok, Thailand, 2018.

[3] P. Kanjanasamranwong, "THE NEEDS OF ATTENDING IN GRADUATE INSTITUTE OF GRADE 12," Journal of Education Research Faculty of Education, vol.11, no.1, pp. 75-88, 2018.


[5] N. Supanatsetakul, "Interdisciplinary studies: humanities, social sciences and medicine for applying in the medical instruction," AMJAM , vol. 13, no. 3, pp. 380-392, 2013.

[6] Office of the Higher Education Commission, "New student information," OHEC, 2020. [Online]. Available: [Accessed: Jan.12, 2021].

[7] A. Phaeobang, "Adjusting the Imbalanced Data with 5 Classification Methods," Thai Journal of Science and Technology, vol. 9, no.4, July - August, 2020.

[8] D. Noppamas, "Comparison of Imbalanced Data Problem Solvingfor Income Classification of Type I Pharmacies Entrepreneur," in The 9th STOU National Research Conference, pp. 1578-1586, 2018.

[9] W. Jaidee, "The Study of Factors Affecting for On-time Graduating of Ungraduated Student Using Feature Selection Technique on Imbalanced Datasets," Journal of Information Science and Technology, vol.10, no. 1, pp. 75-84, 2020.

[10] N. Akarachantachote, "Feature Selection for High-dimensional data in Classification," Research Methodology & Cognitive Science, vol. 8, no. 2, pp. 1-13, October 2010 – March 2011.

[11] P. Pramokchon, "Filter-Based Feature Selection for Data Classification in IoT," FEU Academic Review, vol. 11, no. 3,pp. 98-113, 2017.

[12] C. Songsiri, T. Rakthanmanon, and K. Waiyamai, "Applying a data mining technique to help students in selecting their majors," In Proc. The 39th Kasetsart University Annual Conference, pp. 43-50, 2001.

[13] W. Pimpakun, "Ensemble Learning Model for Cardiotocography Classification," In Proc. Graduate Research Conference, pp. 333-340, 2012.

[14] P. Promla, "The Comparison of Efficiency on The Analysis of Satisfaction TeachingPerformance using Sentiment Analysis by Ensemble Technique," KKU Research Journal, vol. 20, no. 4, pp. 140-149, 2020.

[15] W. Paireekreng, "An ensemble learning based model for real estate project classification," ScienceDirect, vol. 3, pp. 3852 – 3859, 2015.

[16] P. Thanvanon, "Predict stock price trends in Stock Exchange of Thailand usingEnsemble Model," JOURNAL OF INFORMATION SCIENCE AND TECHNOLOGY, vol. 7, no. 1, pp. 12-21, 2017.

[17] L. Breiman, "Bagging Predictors," Machine Learning, vol. 24, pp. 123-140, 1996.

[18] K. Nasritha, "Comparison of sampling techniques for imbalanced data classification," JOURNAL Of APPLIED INFORMATICS AND TECHNOLOGY, vol. 1, pp. 20-35, 2018.

[19] S. Wang, "Negative correlation learning for classification ensembles," presented at the International Joint Conference on Neural Networks, IJCNN 2010, Barcelona, Spain, 2010.

[20] C. Onsang, "Combining Multiple Models for Predicting the Risk of DiabetesUsing Multi-Kernel Learning," In Proc. NCCIT2019, pp. 1-6, 2019.

[21] N. Sritrakul and T. Hudakorn, "The economic value and satisfaction of substituting LPG in households by a biogas network," Energy Reports, pp. 565-571, 2020.

[22] W. Sookcharoen, "Dealing with Missing Data," ROMPHRUEK JOURNAL KRIRK UNIVERSITY, vol. 33, no. 2,pp. 11-32, 2015.