AI-Driven Classification of Personality Items: A Study on Large Language Models
DOI:
https://doi.org/10.1590/1982-4327e3609Keywords:
personality, natural language processing, artificial intelligence, psychometrics, validityAbstract
This study investigates the effectiveness of large language models (LLMs) in classifying items that evaluate the Big Five personality dimensions, focusing on performance variations across traits and comparing local and cloud-based models. Five Natural Language Processing (NLP) models were used to classify 385 personality items, employing three levels of detail in prompt design. The results indicate that larger generative models, such as ChatGPT-4o and Gemini 1.5 Pro, outperformed smaller models in terms of accuracy, both for the five personality factors and overall. However, the Llama 3.1 model, run locally, showed adequate results for judge-based analyses, offering a viable alternative for those prioritizing data privacy. The study highlights the potential of LLMs as complementary tools in the development of psychological assessment instruments.
Downloads
Posted
How to Cite
Section
Copyright (c) 2026 Carlos Henrique Sancineto da Silva Nunes, Alexandre José de Souza Peres, Leonardo de Barros Mose, Petar Čolović, Ricardo Primi, Vithor Rosa Franco

This work is licensed under a Creative Commons Attribution 4.0 International License.
Plaudit
Data statement
-
The research data is contained in the manuscript


