Preprint / Version 1

IDENTIFICATION OF MISSING DATA AND IMPUTATION IN VACCINATION RATES FOR AN ECOLOGICAL STUDY OF THE SOUTHERN CONE OF SOUTH AMERICA IN FOUR COUNTRIES

##article.authors##

DOI:

https://doi.org/10.1590/SciELOPreprints.12147

Keywords:

Harmonization, Imputation, Missing data, Reproducible research, Visualization

Abstract

In epidemiological studies with ecological design, it is common practice to work with country-level data, which arise from secondary data sources. For this reason, a harmonization process must be carried out in order to ensure the quality and completeness of the data, as a fundamental element of what is known as reproducible research. This allows other researchers to replicate the results by accessing the same repositories, which are usually international portals that publish statistical data in the health area, such as the WHO, the World Bank or the statistical institutes or health ministries of each country. This harmonization process that guarantees reproducibility has as a fundamental stage the transparency of the process of identifying missing data and any modification that is made when imputing and allowing for complete data, which expands the use of different statistical techniques. This work shows the entire process followed for a group of vaccination rates, including visualization and various imputation methods that are appropriate to the nature of the data, for 4 countries in the Southern Cone for a period of 20 years. The details of this project are on the OSF platform in the project Analysis of different demographic, social, and economic indicators and their association with a group of immunopreventable diseases for 4 countries in the Southern Cone of South America at https://osf.io/6r3ew/

Downloads

Download data is not yet available.

Author Biography

Ramón Álvarez-Vaz, University of the Republic of Uruguay

My undergraduate training is in Statistics and postgraduate in Epidemiology (Master's) and Health Sciences (PhD).
I currently work at the Facultad de Ciencias Económicas y de Administración de la Universidad de la República, Uruguay. I do research in Applied Statistics, among other topics, to Marketing, Finance, Public Health Economics and Epidemiology, as well as statistical modeling in Demography and Actuarial problems. My work has several lines that revolve around the use and incorporation of statistical tools suitable for research in the field of all these disciplines. This implies adapting, their correct use, the statistical methods of supervised and unsupervised learning, sampling, time series, multilevel analysis, network analysis, agent-based models, simulation and Bayesian methods, among others.

Posted

06/16/2025

How to Cite

IDENTIFICATION OF MISSING DATA AND IMPUTATION IN VACCINATION RATES FOR AN ECOLOGICAL STUDY OF THE SOUTHERN CONE OF SOUTH AMERICA IN FOUR COUNTRIES. (2025). In SciELO Preprints. https://doi.org/10.1590/SciELOPreprints.12147

Section

Health Sciences

Plaudit

Data statement

  • The research data is contained in the manuscript

  • The research data is available in one or more data repository(ies)