This preprint has been published elsewhere.
DOI of the published preprint https://doi.org/10.5007/1518-2924.2025.e101283
Preprint / Version 1

Machado de Assis's Literary Gazetteer

##article.authors##

DOI:

https://doi.org/10.1590/SciELOPreprints.9474

Keywords:

Semantic Web, Machado de Assis, Geolocation, Brazilian Literature, Digital Humanities

Abstract

This study aims to develop a semantic web application that maps geographic locations in Machado de Assis's works, storing them in a triplestore. By integrating data from machadodeassis.net encyclopedia with geographic coordinates from Geonames.org and GoogleMaps, the project offers an interactive map-based reading experience, supporting the spatial references made by the writer in the 19th century. Using the Python library BeautifulSoup, the application extracts citations, structures them according to schema.org parameters, and submits them to gpt3.5-instruct and gpt4-turbo models to identify current names and classifications of these locations as per Geonames.org ontology. Finally, SPARQL queries are made to dados.literaturabrasileira.ufsc.br to obtain unique identifiers for each book, integrating maps, citations, and full texts in line with Linked Data standards.

Downloads

Download data is not yet available.

Author Biographies

Dilvan de Abreu Moreira, Universidade de São Paulo

PostDoc in Biomedical Informatics at Stanford University (2008), Ph.D. in Electronics Engineering from the University of Kent at Canterbury (1995), master's degree in Microelectronics from the State University of Campinas (1991), graduation in Electrical Engineering from the Federal University of Bahia (1988). Currently Associate Professor of the University of São Paulo. Acting as AdHoc consultant for FAPESP, CNPq, CAPES and FNR Luxembourg. Member of the IEEE and ACM. Reviewer for Bioinformatics (Oxford). CNPq research productivity funder for 9 years and CNPq and FAPESP research aid fund holder. My research focuses on the application of Web technologies, especially the Semantic Web, on problems in the Biomedical and Bioinformatics area to allow the interpretation of biomedical data by machines. Recently I have collaborated with BMIR-Stanford University with semantic annotation of medical images and with INPA/Embrapa in annotation and semantic search of data on biodiversity. I have more than 20 years of experience in computer research and engineering: distributed client/server and Web applications, including technologies such as Web services, ontologies (Semantic Web OWL) and the languages ​​C, C++, Clojure and Java in Linux, Windows and Mac . Management of research laboratories in the area.

Davi Machado da Rocha, Secretaria da Educação do Estado de São Paulo

Master in History from Universidade Estadual Paulista and database technologist at FIAP, with interest in creating and managing databases of historical documents, big data and digital humanities. He was a special postgraduate student at the University of São Paulo, completing courses in Introduction to the Semantic Web, Unstructured Data Mining, Artificial Intelligence and Natural Language Processing between 2023 and 2024. He has worked as a researcher, member of the editorial board and editor in academic journals, in addition to teaching and research experience in the area of ​​education.

Posted

07/22/2024

How to Cite

Section

Linguistic, literature and arts

Plaudit

Data statement

  • The research data is contained in the manuscript