Application of Named Entity Recognition (NER) in Job Vacancy Matching Using an Ontology-Based Approach (Case Study: Information Technology Sector)
DOI:
https://doi.org/10.35194/mji.v17i2.5675Keywords:
Job vacancy application, Named Entity Recognition, Taxonomy, OntologyAbstract
The dissemination of job vacancies through online platforms still faces limitations in understanding the semantic relationships between the skills possessed by job seekers and the qualifications required by a job position. This mismatch results in an inefficient search process and longer search times. This study aims to develop a semantic-based job vacancy recommendation application (talent matching) using a skill ontology approach. One of the main challenges in developing the ontology is the lack of standardized data structures in job vacancy postings, particularly in the job description section. To address this issue, Named Entity Recognition (NER) techniques are applied to automatically extract skill entities from job description texts. The extracted results are then classified into a taxonomy structure using SkillsGPT, thereby forming a hierarchical skill concept model semantically represented within the ontology using Protégé. The matching process between user skills and job qualifications is conducted through semantic similarity calculations employing the Sánchez Similarity method. Job vacancy data are collected via web scraping, while system development follows the Rational Unified Process (RUP) methodology and is evaluated using Black Box testing. Evaluation results demonstrate that the developed system is capable of providing semantically relevant job vacancy recommendations according to the user's skill profile. Therefore, this study contributes both theoretically and practically to the development of ontology-based recommendation systems, particularly in the automated modeling of skill taxonomies from unstructured data.References
[1] Z. Elkaimbillah, B. El Asri, M. Mikram, and M. Rhanoui, “Construction of an Ontology-based Document Collection for the IT Job Offer in Morocco,” Int. J. Adv. Comput. Sci. Appl., vol. 14, no. 7, pp. 453–461, 2023, doi: 10.14569/IJACSA.2023.0140749.
[2] PT. Glints Indonesia Group, “Lowongan Kerja Kompuer dan Teknologi Informasi (TI),” 2025. https://glints.com/id/job-category/computer-technology (accessed May 04, 2025).
[3] Kalibrr Technology Ventures Inc., “IT and Software,” 2025. https://www.kalibrr.id/id-ID/home/i/it-and-software (accessed May 04, 2025).
[4] A. Habous and E. H. Nfaoui, “A fuzzy logic and ontology-based approach for improving the CV and job offer matching in recruitment process,” Int. J. Metadata, Semant. Ontol., vol. 15, no. 2, pp. 104–120, 2021, doi: 10.1504/IJMSO.2021.120278.
[5] S. Paudel and A. Shakya, “Ontology based Job-Candidate Matching using Skill Sets,” Proc. IOE Grad. Conf., vol. 8914, pp. 251–258, 2017.
[6] S. ScholarWorks and Y. Bashyam Balachander, “Ontology Based Technical Skill Similarity,” Ontol. Based Tech. Ski. Similarity, 2018.
[7] A. Shakya and S. Paudel, “Job-Candidate Matching using ESCO Ontology,” J. Inst. Eng., vol. 15, no. 1, pp. 1–13, 2019, doi: 10.3126/jie.v15i1.27699.
[8] X. Zheng, “Import and edit ontology with Neo4j,” vol. 1613, no. 2020, 2022.
[9] T. Petrican, C. Stan, M. Antal, I. Salomie, T. Cioara, and I. Anghel, “Ontology-based skill matching algorithms,” Proc. - 2017 IEEE 13th Int. Conf. Intell. Comput. Commun. Process. ICCP 2017, no. January 2020, pp. 205–211, 2017, doi: 10.1109/ICCP.2017.8117005.
[10] D. Sánchez, M. Batet, D. Isern, and A. Valls, “Ontology-based semantic similarity: A new feature-based approach,” Expert Syst. Appl., vol. 39, no. 9, pp. 7718–7728, 2012, doi: 10.1016/j.eswa.2012.01.082.
[11] M. A. Khder, “Web scraping or web crawling: State of art, techniques, approaches and application,” Int. J. Adv. Soft Comput. its Appl., vol. 13, no. 3, pp. 144–168, 2021, doi: 10.15849/ijasca.211128.11.
[12] T. Gelar, A. Nanda, and A. Bakhrun, “Serverless Named Entity Recognition untuk Teks Instruksional Pertanian Kota,” J. Tek. Inform. dan Sist. Inf., vol. 8, no. 3, pp. 597–606, 2022, doi: 10.28932/jutisi.v8i3.5447.
[13] C. Larman, “Applying UML and Patterns: An Introduction to Object-Oriented Analysis and Design and the Unified Process (2nd Edition),” p. 656, 2001, [Online]. Available: http://www.amazon.com/Applying-UML-Patterns-Introduction-Object-Oriented/dp/0130925691.
[14] A. Thukral, S. Dhiman, R. Meher, and P. Bedi, “Knowledge graph enrichment from clinical narratives using NLP, NER, and biomedical ontologies for healthcare applications,” Int. J. Inf. Technol., vol. 15, no. 1, pp. 53–65, 2023, doi: 10.1007/s41870-022-01145-y.
[15] T. Pratama and Suharjito, “IndoXLNet: Pre-Trained Language Model for Bahasa Indonesia,” Int. J. Eng. Trends Technol., vol. 70, no. 5, pp. 366–380, 2022, doi: 10.14445/22315381/IJETT-V70I5P240.
[16] “SkillsGPT,” 2025. https://www.skillsgpt.ai/.
[17] Buildmedia.readthedocs.org, “protege-tutorial Documentation,” 2017, [Online]. Available: https://buildmedia.readthedocs.org/media/pdf/go-protege-tutorial/latest/go-protege-tutorial.pdf.
[18] T. Derave, F. Gailly, T. P. Sales, and G. Poels, “A taxonomy and ontology for digital platforms,” Inf. Syst., vol. 120, no. October 2023, p. 102293, 2024, doi: 10.1016/j.is.2023.102293.
[19] “Neosemantics(n10s) User Guide - Neosemantics.” https://neo4j.com/labs/neosemantics/4.0/ (accessed Nov. 11, 2021).
[20] J. Barrasa and A. Cowley, “Neosemantics (n10s): A Linked Data Toolkit for Neo4j,” YouTube, 2020, [Online]. Available: https://www.youtube.com/watch?v=LO-OvQaBq8s&mkt_tok=eyJpIjoiT0RkbE16UmxaVEF3TXpJeiIsInQiOiJ2bHZHdFZCV0xMRVRlWGw2U3dDdXk1NVNhK0JzT2xQT01lTmc5RGw2RWUzWWNCZ0RRRkpcL2x6ZE9FNmViajBzbG0yYWlCWFRucndlbHBDRWd2bVVWeGVVMmZnY3g5TnZHWmZDa044WUp3M1R2RlwvMkErcERnYWdRVk8.
[21] Tecoholic, “NER Annotator,” 2025. https://arunmozhi.in/ner-annotator.
[22] Explosion, “spaCy,” 2025. https://spacy.io/usage/training.
[23] Kamran, “Roadmap.sh,” 2024. https://roadmap.sh/computer-science.
[2] PT. Glints Indonesia Group, “Lowongan Kerja Kompuer dan Teknologi Informasi (TI),” 2025. https://glints.com/id/job-category/computer-technology (accessed May 04, 2025).
[3] Kalibrr Technology Ventures Inc., “IT and Software,” 2025. https://www.kalibrr.id/id-ID/home/i/it-and-software (accessed May 04, 2025).
[4] A. Habous and E. H. Nfaoui, “A fuzzy logic and ontology-based approach for improving the CV and job offer matching in recruitment process,” Int. J. Metadata, Semant. Ontol., vol. 15, no. 2, pp. 104–120, 2021, doi: 10.1504/IJMSO.2021.120278.
[5] S. Paudel and A. Shakya, “Ontology based Job-Candidate Matching using Skill Sets,” Proc. IOE Grad. Conf., vol. 8914, pp. 251–258, 2017.
[6] S. ScholarWorks and Y. Bashyam Balachander, “Ontology Based Technical Skill Similarity,” Ontol. Based Tech. Ski. Similarity, 2018.
[7] A. Shakya and S. Paudel, “Job-Candidate Matching using ESCO Ontology,” J. Inst. Eng., vol. 15, no. 1, pp. 1–13, 2019, doi: 10.3126/jie.v15i1.27699.
[8] X. Zheng, “Import and edit ontology with Neo4j,” vol. 1613, no. 2020, 2022.
[9] T. Petrican, C. Stan, M. Antal, I. Salomie, T. Cioara, and I. Anghel, “Ontology-based skill matching algorithms,” Proc. - 2017 IEEE 13th Int. Conf. Intell. Comput. Commun. Process. ICCP 2017, no. January 2020, pp. 205–211, 2017, doi: 10.1109/ICCP.2017.8117005.
[10] D. Sánchez, M. Batet, D. Isern, and A. Valls, “Ontology-based semantic similarity: A new feature-based approach,” Expert Syst. Appl., vol. 39, no. 9, pp. 7718–7728, 2012, doi: 10.1016/j.eswa.2012.01.082.
[11] M. A. Khder, “Web scraping or web crawling: State of art, techniques, approaches and application,” Int. J. Adv. Soft Comput. its Appl., vol. 13, no. 3, pp. 144–168, 2021, doi: 10.15849/ijasca.211128.11.
[12] T. Gelar, A. Nanda, and A. Bakhrun, “Serverless Named Entity Recognition untuk Teks Instruksional Pertanian Kota,” J. Tek. Inform. dan Sist. Inf., vol. 8, no. 3, pp. 597–606, 2022, doi: 10.28932/jutisi.v8i3.5447.
[13] C. Larman, “Applying UML and Patterns: An Introduction to Object-Oriented Analysis and Design and the Unified Process (2nd Edition),” p. 656, 2001, [Online]. Available: http://www.amazon.com/Applying-UML-Patterns-Introduction-Object-Oriented/dp/0130925691.
[14] A. Thukral, S. Dhiman, R. Meher, and P. Bedi, “Knowledge graph enrichment from clinical narratives using NLP, NER, and biomedical ontologies for healthcare applications,” Int. J. Inf. Technol., vol. 15, no. 1, pp. 53–65, 2023, doi: 10.1007/s41870-022-01145-y.
[15] T. Pratama and Suharjito, “IndoXLNet: Pre-Trained Language Model for Bahasa Indonesia,” Int. J. Eng. Trends Technol., vol. 70, no. 5, pp. 366–380, 2022, doi: 10.14445/22315381/IJETT-V70I5P240.
[16] “SkillsGPT,” 2025. https://www.skillsgpt.ai/.
[17] Buildmedia.readthedocs.org, “protege-tutorial Documentation,” 2017, [Online]. Available: https://buildmedia.readthedocs.org/media/pdf/go-protege-tutorial/latest/go-protege-tutorial.pdf.
[18] T. Derave, F. Gailly, T. P. Sales, and G. Poels, “A taxonomy and ontology for digital platforms,” Inf. Syst., vol. 120, no. October 2023, p. 102293, 2024, doi: 10.1016/j.is.2023.102293.
[19] “Neosemantics(n10s) User Guide - Neosemantics.” https://neo4j.com/labs/neosemantics/4.0/ (accessed Nov. 11, 2021).
[20] J. Barrasa and A. Cowley, “Neosemantics (n10s): A Linked Data Toolkit for Neo4j,” YouTube, 2020, [Online]. Available: https://www.youtube.com/watch?v=LO-OvQaBq8s&mkt_tok=eyJpIjoiT0RkbE16UmxaVEF3TXpJeiIsInQiOiJ2bHZHdFZCV0xMRVRlWGw2U3dDdXk1NVNhK0JzT2xQT01lTmc5RGw2RWUzWWNCZ0RRRkpcL2x6ZE9FNmViajBzbG0yYWlCWFRucndlbHBDRWd2bVVWeGVVMmZnY3g5TnZHWmZDa044WUp3M1R2RlwvMkErcERnYWdRVk8.
[21] Tecoholic, “NER Annotator,” 2025. https://arunmozhi.in/ner-annotator.
[22] Explosion, “spaCy,” 2025. https://spacy.io/usage/training.
[23] Kamran, “Roadmap.sh,” 2024. https://roadmap.sh/computer-science.
Published
2025-12-31
Issue
Section
Articles