Preprint / Version 1

DISCOURSE ANALYSIS FOR THE DEVELOPMENT OF A CYBERGROOMING DETECTION MODEL ON ROBLOX

##article.authors##

  • Ana Paola Castañón Marroquín Meritorious Autonomous University of Puebla image/svg+xml https://orcid.org/0009-0007-3411-0418
    • Conceptualization
    • Formal Analysis
    • Investigation
    • Methodology
    • Validation
    • Visualization
    • Writing – Review & Editing
  • Brenda Ailed Rodríguez Colis Meritorious Autonomous University of Puebla image/svg+xml https://orcid.org/0009-0002-2694-6542
    • Conceptualization
    • Formal Analysis
    • Investigation
    • Methodology
    • Validation
    • Visualization
    • Writing – Original Draft Preparation
    • Writing – Review & Editing
  • Andrea Bazán Durán Meritorious Autonomous University of Puebla image/svg+xml https://orcid.org/0009-0001-7722-1878
    • Conceptualization
    • Formal Analysis
    • Investigation
    • Methodology
    • Project Administration
    • Resources
    • Validation
    • Visualization
    • Writing – Original Draft Preparation
    • Writing – Review & Editing
  • Luis Enrique Colmenares-Guillen Meritorious Autonomous University of Puebla image/svg+xml https://orcid.org/0000-0002-9921-8813
    • Conceptualization
    • Investigation
    • Methodology
    • Project Administration
    • Resources
    • Supervision
    • Validation
    • Writing – Original Draft Preparation
    • Writing – Review & Editing

DOI:

https://doi.org/10.1590/SciELOPreprints.16320

Keywords:

Discourse analysis, computer crime, child abuse, videogame, computational linguistics

Abstract

Cybergrooming represents a growing threat on online gaming platforms such as Roblox, where anonymity and frequent interaction among child users create conditions conducive to child abuse and sexual harassment. The objective of the research that led to this article was to identify linguistic patterns in the discourse of groomers in Spanish-speaking Roblox communities and incorporate them into a computational model for the automatic detection of this cybercrime through text. To this end, a mixed-methods approach was developed, integrating Corpus-Assisted Discourse Studies with the CRISP-DM data mining methodology. A specialized corpus of 25 conversations was compiled and processed, then subjected to detailed analysis. As a main result, a pattern of discursive organization consisting of a sequence of seven conversational modules with specific predictive value and a set of 21 functional lexicogrammatical patterns with 224 associated collocations were identified, described, and subsequently incorporated into a text classification model capable of distinguishing grooming conversations with 93.33% accuracy. In this way, the study demonstrated the efficacy of discourse analysis as a basis for the development of systems for the automatic detection of cybercrimes against minors.

Downloads

Download data is not yet available.

Author Biographies

Ana Paola Castañón Marroquín, Meritorious Autonomous University of Puebla

Facultad de Filosofía y Letras, Licenciatura en Lingüística y Literatura Hispánica

Brenda Ailed Rodríguez Colis, Meritorious Autonomous University of Puebla

Facultad de Ciencias de la Computación, Ingeniería en Ciencias de la Computación

Andrea Bazán Durán, Meritorious Autonomous University of Puebla

Facultad de Ciencias de la Computación, Ingeniería en Ciencias de la Computación

Luis Enrique Colmenares-Guillen, Meritorious Autonomous University of Puebla

Profesor investigador de la Facultad de Ciencias de la Computación y Coordinador del Laboratorio de Análisis Forense Digital en la Benemérita Universidad Autónoma de Puebla en México. En la Facultad, ha impartido las cátedras de Sistemas Operativos, Administración de proyectos, Sistemas Distribuidos, Procesamiento Digital de imágenes, Sistemas de tiempo real, Recuperación de información, administración de proyectos, Proyectos I+D. Actualmente ha desarrollado algoritmos y sistemas clasificadores para el área de la Inteligencia artificial y reconocimiento de patrones.

Posted

05/28/2026

How to Cite

DISCOURSE ANALYSIS FOR THE DEVELOPMENT OF A CYBERGROOMING DETECTION MODEL ON ROBLOX. (2026). In SciELO Preprints. https://doi.org/10.1590/SciELOPreprints.16320

Section

Human Sciences

Plaudit

Data statement