Preprint / Versión 1

Exploring Interoperability Between Local and Global Databases in Scientometrics: Lattes, Capes, and OpenAlex

article.authors6a0a59f4e0a2a

DOI:

https://doi.org/10.1590/SciELOPreprints.12668

Keywords:

bibliometric coverage, Lattes CV, Scientometrics

Resumen

Numerous initiatives are currently underway to disambiguate databases worldwide. In this paper, we propose a methodology for disambiguating research entities using big data techniques, adopting an approach that goes from local to global databases. Our objective is to enhance the quality of data in the OpenAlex database by leveraging information from Brazilian databases, particularly data from the Lattes Platform and the Brazilian Federal Agency for Support and Evaluation of Graduate Education. We compare similar names of authors and institutions, employing Digital Object Identifiers to link entities, along with an adaptation of the Levenshtein distance algorithm. The proposed method is straightforward to implement in tabular databases and facilitates disambiguation, thereby contributing to open science practices and providing an effective solution for research information systems. The findings indicate the potential for integrating local and global databases to address issues related to ambiguous names and incomplete metadata.

Downloads

Los datos de descarga aún no están disponibles.

Postado

21/07/2025

Cómo citar

Exploring Interoperability Between Local and Global Databases in Scientometrics: Lattes, Capes, and OpenAlex. (2025). In SciELO Preprints. https://doi.org/10.1590/SciELOPreprints.12668

Serie

Ciencias Sociales Aplicadas

Datos de los fondos

Plaudit

Declaración de datos

  • Los datos de investigación están disponibles a petición, condición justificada en el manuscrito