# Resume data extract and job recruitment Chatbot features for AI-based resume screening & analytics

**Authors:** Kar Weng Chong, Kok Why Ng, Yong Hong Fu

PMC · DOI: 10.1016/j.mex.2025.103775 · MethodsX · 2025-12-17

## TL;DR

This paper presents an AI-based recruitment system combining resume screening and a chatbot to improve hiring efficiency and candidate experience.

## Contribution

A novel integrated AI recruitment system with resume screening and chatbot features for enhanced hiring processes.

## Key findings

- The resume screening AI effectively parses and evaluates resumes using document processing and language models.
- The chatbot improves accessibility and interaction during recruitment with speech-to-text and FAQ automation.
- The system is designed to be scalable and efficient for diverse hiring needs.

## Abstract

AI technologies are changing the field of manpower recruitment since they make it much more efficient, accurate, and scalable than conventional approaches. The project builds an AI recruitment system that is combined to have two main elements of an integrated AI recruitment system including a Resume Screening AI and a Job Recruitment Chatbot, which have the goal of improving the process in hiring as well as making the experience of the candidates better in the process.

The Resume Screening AI uses a mixed approach incorporating both classical document processing, and capabilities of advanced language models. The unstructured data of raw resumes are mined and normalized into standard forms to allow evaluation and ranking of the candidates in an organized and systematic manner according to the position’s requirements. The Job Recruitment Chatbot entails a programmed chat system of interactive communication during the job recruitment procedure comprising the component of FAQ, conversation-based direction, and voice-to-text dynamic to make the system more accessible to a diverse group of users.•Document Processing Pipeline: Parsed all-format resumes by the means of PyPDF2 and python-docx libraries and programmed the data in a structured manner, via Google Gemini 1.5 Flash API and engineered special prompts to validate the set of JSON-Schema.•Intelligent Screening System: Created automated candidate screening based on (large language model) inference process to compare resume text with job requirements, producing relevance scores and classified evaluations.•Interactive Chatbot Development: Developed natural language processing AI interface with chat capabilities and with speech-to-text and FAQ automation that could be used to answer candidate questions and optimize the recruitment process.

Document Processing Pipeline: Parsed all-format resumes by the means of PyPDF2 and python-docx libraries and programmed the data in a structured manner, via Google Gemini 1.5 Flash API and engineered special prompts to validate the set of JSON-Schema.

Intelligent Screening System: Created automated candidate screening based on (large language model) inference process to compare resume text with job requirements, producing relevance scores and classified evaluations.

Interactive Chatbot Development: Developed natural language processing AI interface with chat capabilities and with speech-to-text and FAQ automation that could be used to answer candidate questions and optimize the recruitment process.

Image, graphical abstract

## Full-text entities

- **Genes:** F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}
- **Diseases:** physically (MESH:D059445)
- **Chemicals:** Dialogue (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12808514/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12808514/full.md

## References

32 references — full list in the complete paper: https://tomesphere.com/paper/PMC12808514/full.md

---
Source: https://tomesphere.com/paper/PMC12808514