PREDICTING GRADUATE EMPLOYABILITY USING ENSEMBLE LEARNING ON RESUME DATA, INTERNSHIPS, AND SOFT SKILL SCORES

Authors

  • Dr. Maroti V. Kendre Assistant Professor, School of Liberal Arts, Pimpri Chinchwad University, Pune, Maval (PMRDA), Dist. Pune-412106, Maharashtra, India
  • Dr. Ashok Rajaram Suryawanshi Department of Electronics and Telecommunication Engineering, Pimpri Chinchwad College of Engineering, Pune, Maharashtra, India
  • Mrs. Priyanka S. Utage Assistant Professor, Walchand Institute of Technology, Solapur, Maharashtra, India
  • Dr. Prasad Ghodke Assistant Professor, Department of MBA, Modern Institute of Business Studies, Nigdi, Pune (Savitribai Phule Pune University), Maharashtra, India
  • Dr. Manoj Dnyanaba Mate Assistant Professor, CSMSS Chhatrapati Shahu College of Engineering, Chhatrapati Sambhajinagar, Maharashtra, India
  • Ashish Vilas Jawake Training and Placement Officer, Dr. D. Y. Patil Technical Campus, Varale–Talegaon, Pune, Maharashtra, India

DOI:

https://doi.org/10.29121/shodhkosh.v6.i4s.2025.6951

Keywords:

Graduate Employability, Ensemble Learning, Resume Analysis, Internship Evaluation, Soft Skills, Stacked Model, NLP, Education Data Mining

Abstract [English]

Graduate hiring has become an important way to judge how well higher education systems work, so it's important to use data-driven methods to correctly predict job results. This study suggests using ensemble learning to guess how likely it is that a graduate will be able to get a job after graduation. It does this by combining different types of data, such as resume material, internship experience, and soft skill assessment scores. Natural language processing (NLP) methods are used to pull out meaning information from resume data, like school successes, professional skills, certifications, and leisure activities. Some aspects of internships, like length, importance to the job, and company rank, are measured and used as number forecasts. Psychometric scores and friend evaluations are used to standardize and add to the feature area the level of soft skill proficiency. The forecast model uses a group of ensemble learning methods, such as Random Forests, Gradient Boosting Machines (GBM), and Extreme Gradient Boosting (XGBoost). It also uses a meta-learner that is applied through stacking to improve its ability to generalize. Feature importance analysis shows how important cognitive, behavioral, and practical traits are in terms of finding key employment markers. Cross-validation and hyperparameter tuning are used to make sure that the model is stable, and the ensemble model performs better than individual classifiers in terms of accuracy, precision, recall, and F1-score. This work not only creates a scalable method for predicting how employable graduates will be, but it also gives schools useful information that they can use to improve their courses and gives students useful information that they can use to customize how they prepare for careers.

References

Albina, A., and Sumagaysay, L. (2020). Employability Tracer Study of Information Technology Education Graduates from a State University in the Philippines. Social Sciences and Humanities Open, 2, 100055. https://doi.org/10.1016/j.ssaho.2020.100055 DOI: https://doi.org/10.1016/j.ssaho.2020.100055

Assegie, T. A., Salau, A. O., Chhabra, G., Kaushik, K., and Braide, S. L. (2024). Evaluation of Random Forest and Support Vector Machine Models in Educational Data Mining. In Proceedings of the 2nd International Conference on Advancement in Computation and Computer Technologies (InCACCT) (131–135). IEEE. https://doi.org/10.1109/InCACCT61598.2024.10551110 DOI: https://doi.org/10.1109/InCACCT61598.2024.10551110

Aviso, K. B., Janairo, J. I. B., Lucas, R. I. G., Promentilla, M. A. B., Yu, D. E. C., and Tan, R. R. (2020). Predicting Higher Education Outcomes with Hyperbox Machine Learning: What Factors Influence Graduate Employability? Chemical Engineering Transactions, 81, 679–684.

Celine, S., Dominic, M. M., and Devi, M. S. (2020). Logistic Regression for Employability Prediction. International Journal of Innovative Technology and Exploring Engineering, 9(3), 2471–2478. https://doi.org/10.35940/ijitee.C8170.019320 DOI: https://doi.org/10.35940/ijitee.C8170.019320

Chopra, A., and Saini, M. L. (2023). Comparison Study of Different Neural Network Models for Assessing Employability Skills of IT Graduates. In Proceedings of the International Conference on Sustainable Communication Networks and Application (ICSCNA) (189–194). IEEE. https://doi.org/10.1109/ICSCNA58489.2023.10368605 DOI: https://doi.org/10.1109/ICSCNA58489.2023.10368605

Maaliw, R. R., Quing, K. A. C., Lagman, A. C., Ugalde, B. H., Ballera, M. A., and Ligayo, M. A. D. (2022). Employability Prediction of Engineering Graduates Using Ensemble Classification Modeling. In Proceedings of the IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC) (288–294). IEEE. https://doi.org/10.1109/CCWC54503.2022.9720783 DOI: https://doi.org/10.1109/CCWC54503.2022.9720783

Monteiro, S., Almeida, L., Gomes, C., and Sinval, J. (2020). Employability Profiles of Higher Education Graduates: A Person-Oriented Approach. Studies in Higher Education, 1–14. https://doi.org/10.1080/03075079.2020.1761785 DOI: https://doi.org/10.1080/03075079.2020.1761785

Nordin, N. I., Sobri, N. M., Ismail, N. A., Mahmud, M., and Alias, N. A. (2022). Modelling Graduate Unemployment from Students’ Perspectives. Journal of Mathematical, Computational and Statistical Sciences, 8(2), 68–78. https://doi.org/10.24191/jmcs.v8i2.6986 DOI: https://doi.org/10.24191/jmcs.v8i2.6986

Oliver, N., Pérez-Cruz, F., Kramer, S., Read, J., and Lozano, J. A. (2021). Machine Learning and Knowledge Discovery in Databases. Springer Nature. https://doi.org/10.1007/978-3-030-86486-6 DOI: https://doi.org/10.1007/978-3-030-86523-8

Philippine Statistics Authority. (2021). Unemployment Rate in September 2021 is Estimated at 8.9 Percent.

Shahriyar, J., Ahmad, J. B., Zakaria, N. H., and Su, G. E. (2022). Enhancing Prediction of Employability of Students: Automated Machine Learning Approach. In Proceedings of the 2nd International Conference on Intelligent Cybernetics Technology and Applications (ICICyTA) (87–92). IEEE. https://doi.org/10.1109/ICICyTA57421.2022.10038231 DOI: https://doi.org/10.1109/ICICyTA57421.2022.10038231

Shuker, F. M., and Sadik, H. H. (2024). A Critical Review on Rural Youth Unemployment in Ethiopia. International Journal of Adolescence and Youth, 29(1), 1–17. https://doi.org/10.1080/02673843.2024.2322564 DOI: https://doi.org/10.1080/02673843.2024.2322564

Tamene, E. H., Salau, A. O., Vats, S., Kaushik, K., Molla, T. L., and Tin, T. T. (2024). Predictive Analysis of Graduate Students’ Employability using Machine Learning Techniques. In Proceedings of the International Conference on Artificial Intelligence and Emerging Technology (Global AI Summit) (557–562). IEEE. https://doi.org/10.1109/GlobalAISummit62156.2024.10947923 DOI: https://doi.org/10.1109/GlobalAISummit62156.2024.10947923

Downloads

Published

2025-12-25

How to Cite

Kendre, M. V., Suryawanshi, A. R., Utage, P. S., Ghodke, P., Mate, M. D., & Jawake, A. V. (2025). PREDICTING GRADUATE EMPLOYABILITY USING ENSEMBLE LEARNING ON RESUME DATA, INTERNSHIPS, AND SOFT SKILL SCORES. ShodhKosh: Journal of Visual and Performing Arts, 6(4s), 723–734. https://doi.org/10.29121/shodhkosh.v6.i4s.2025.6951