Skip to content

Commit 8c7534e

Browse files
committed
actualización traducciones
1 parent dc235c4 commit 8c7534e

File tree

7 files changed

+116
-6
lines changed

7 files changed

+116
-6
lines changed

_i18n/en.yml

+5
Original file line numberDiff line numberDiff line change
@@ -32,3 +32,8 @@ form:
3232
email: Email
3333
message: Message
3434
send: Send
35+
36+
bibliotecadigital:
37+
heading: "Access our digital editions"
38+
viajes: "Chronicles and travelogues"
39+
miscelanea: "Miscellaneous texts"

_i18n/en/bibliotecadigital.md

+12
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,12 @@
1+
2+
Digital Scholarly Editing is one of the most representative areas within the field of Digital Humanities. Since early days, initiatives like [Project Gutenberg](https://www.gutenberg.org/) and [Perseus](http://www.perseus.tufts.edu/hopper/) have considered how to approach the digitization of texts in the Humanities not only as static resources for consultation, but also as enriched and searchable texts.
3+
4+
All the digital editions in our Digital Library follow the XML-TEI standard and the principles of minimal computing, favoring the use of open technologies, governance over data and code, and the reduction of computing infrastructure. The enriched texts with notes and maps are complemented with data visualizations and many distant reading resources.
5+
6+
<!-- # Access our digital editions
7+
8+
9+
## Chronicles and travelogues
10+
11+
12+
## Miscellaneous texts -->

_i18n/en/explora.md

+70
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,70 @@
1+
# Explora
2+
3+
The methodology used for digital textual analysis assumes that texts acquire meaning not in isolation, but as part of a system, in other terms, a large corpus. In large digital textual corpora, elements can appear repeatedly but human reading might overlook because of focusing solely on detail and linearity. Macroanalysis, machine reading, or distant reading, in the words of Matthew Jockers, Stephen Ramsay, or Franco Moretti, brings in computational and algorithmic elements and applies quantitative methods that do not exclude qualitative approaches: computers process data firstly, and then (ideally) humans read and interpret it as text.
4+
5+
In this section, we offer some experiences in textual analysis using texts from our [Digital Library]({{site.baseurl}}/bibliotecadigital.md): [Relación de las cosas sucedidas en el Río de la Plata](pero-site) by Pero Hernández, la [Relación de un viaje al Río de la Plata](acarete-site) buy Acarete du Biscay, [La Argentina Manuscrita](ruy-diaz-site) by Ruy Díaz de Guzmán, [Argentina y la conquista del Río de la Plata](centenera-site) *Poema histórico* by Martín del Barco Centenera, and [Viaje al Río de la Plara de Ulrico Schmidel](ulrich-site) by Ulrico Schmidel.
6+
7+
For a more in-depth macroanalytical reading, we invite you to consult this [notebook](https://colab.research.google.com/drive/1l487LB-PUqgs24DFqSTq2tPZ0OrTUbHB?usp=sharing).
8+
9+
## Morphosyntactic Tagging
10+
11+
Morphosyntactic tagging is an automatic annotation process that assigns a grammatical category (or part of speech) to each token in a text. This technique allows for the visualization of syntactic dependencies or investigation of the correlation between a certain class of words and other discursive features, such as sentiment polarity or writing quality.
12+
13+
![dependencias acarete](/assets/img/explora/dependencias-dubiscay.png)
14+
15+
The grammatical categories identified through morphosyntactic tagging can be used for co-occurrence analysis, providing a quick overview of the content of a textual corpus. Co-occurrence analysis measures the appearance of two or more tokens within the same context (sentence, paragraph, etc.). For example, analyzing noun co-occurrences in *Relación de un viaje al Río de la Plata*, as shown in the following graph, helps highlight the commercial dimension and territorial description in Acarete du Biscay’s text through the high frequency of pairs like “barco-mercadería” (ship-merchandise), “mina-plata” (mine-silver), “corona-libra” (crown-pound), and also “legua-río” (league-river), “ciudad-río” (city-river), “montaña-plata” (mountain-silver).
16+
17+
![co-occurrence graph – Acarete](/assets/img/explora/cooc_sust_sust_dubiscay.png)
18+
19+
20+
## Semantic Tagging
21+
22+
Semantic annotation typically aims to distinguish categories such as person, place, and event. However, it can also rely on a taxonomy to identify more specific classes, such as [professional hierarchies](https://github.com/hdcaicyt/Relacion-de-las-cosas-sucedidas/blob/master/assets/taxonomia-personajes-pero-hernandez.pdf), administrative divisions, or types of geographical features.
23+
24+
![bar plot facciones](/assets/img/explora/conteo_personajes_categ.png)
25+
26+
Semantic enrichment allows for, for instance, counting the number of characters by origin, faction, or position, or tracing intersections between these categories throughout the textual narrative, as shown in the following [narrative diagram](http://hdlab.space/Relacion-de-las-cosas-sucedidas/narrative-chart/) for [_Relación de las cosas sucedidas en el Río de la Plata_](http://hdlab.space/Relacion-de-las-cosas-sucedidas/):
27+
28+
<a href="http://hdlab.space/Relacion-de-las-cosas-sucedidas/narrative-chart/" target="_blank"><img src="http://hdlab.space/Relacion-de-las-cosas-sucedidas/assets/img/narrative-chart.png" alt="narrative chart"></a>
29+
30+
31+
## Topic modeling
32+
33+
Topic modeling is a method for organizing large corpora based on machine learning. This technique identifies the topics present in a collection of texts, the documents related to each topic, and the words that represent them—without the need for labeled data.
34+
35+
![topics pelagios](/assets/img/explora/topic_texts_pelagios.png)
36+
37+
The following chart displays the ten most representative words for each topic identified through topic modeling in _Relación de las cosas sucedidas_, _Relación de un viaje al Río de la Plata_, and _La Argentina Manuscrita_. Topic 2 features words strongly tied to _Relación de las cosas sucedidas_ (the text describes the conflict between Domingo de Irala and the Governor, Álvar Núñez), while Topic 1 includes words like “capitán” (captain) and “pedro,” which are characteristic of _La Argentina Manuscrita_ (which recounts the founding of Asunción by one of Pedro de Mendoza’s captains). It's worth noting that some words appear in multiple topics—“río” (river), “indios” (Indians), “plata” (silver)—which makes sense given that all the texts deal with the colonization of the Río de la Plata region in the late 16th and early 17th centuries.
38+
39+
![topics pelagios](/assets/img/explora/topic_words_pelagios.png)
40+
41+
42+
## Stylometric Analysis with Stylo
43+
44+
Stylometry is the quantitative study of the linguistic style of written texts. It has a variety of applications, such as determining authorship, verifying the authenticity of texts, and classifying documents. The *Stylo* software—developed by Maciej Eder, Jan Rybicki, Mike Kestemont, and Steffen Pielstroem—enables easy stylometric analysis and generates visual representations that help evaluate and compare stylistic similarities or differences.
45+
For a stylometric analysis experience with Stylo, we added the following works from our Digital Library to the corpus:
46+
47+
* _Descripción e Historia del Paraguay y del Río de la Plata_, by by Félix de Azara
48+
* _Días de ocio_, by William H. Hudson
49+
* _Viaje a la Patagonia Austral_, by Francisco Pascasio Moreno
50+
* _Primer viaje alrededor del mundo_, by Antonio Pigafetta
51+
* _Diario de un viaje a la costa de Patagonia, para reconocer los puntos en donde establecer poblaciones_, by Antonio de Viedma
52+
* _Relación de la entrada de los Chiriguanos_, by Ruy Díaz de Guzmán.
53+
54+
The dendrogram below shows the texts grouped according to their degree of stylistic similarity:
55+
56+
![macroanalisis1](/assets/img/explora/macroanalisis-ngram-eder-delta.png)
57+
58+
Texts that appear on the same branch have greater stylistic similarities, while greater distances between nodes represent more marked stylistic differences. Thus, in this analysis, two main branches can be observed: the upper branch contains most of the texts, while the smaller, lower branch contains a node grouping two stylistically similar works by the same author: _La Argentina Manuscrita_ by Rui Díaz de Guzmán and a text identified as _Chiriguanos_, which corresponds to the *Relación de la entrada de los Chiriguanos* written by the same author between 1617 and 1618.
59+
60+
This analysis can be complemented and confirmed by applying other stylometric analysis algorithms. Consensus trees, for example, are produced by iterative algorithms that provide more robust results.
61+
62+
![macroanalisis2](/assets/img/explora/macroanalisis-consensus-tree.png)
63+
64+
The consensus tree of our corpus confirms the authorship of the *Chiriguanos* text by grouping it with the other analyzed work by Díaz de Guzmán. This model is a powerful tool in stylometry, facilitating comparison and verification of authorship.
65+
66+
[pero-site]: https://hdlab.space/Relacion-de-las-cosas-sucedidas
67+
[acarete-site]: https://hdlab.space/Relacion-de-un-viaje
68+
[ruy-diaz-site]: https://hdlab.space/La-Argentina-Manuscrita/
69+
[centenera-site]: https://hdlab.space/argentina-y-conquista-del-rio-de-la-plata/
70+
[ulrich-site]: https://hdlab.space/viaje-al-rio-de-la-plata/

_i18n/en/investigacion.md

+7
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
The research line of the Digital Humanities Laboratory at CONICET (HD LAB) originates from the institutional project *Methodologies in Digital Tools for Research in Social Sciences and Humanities*, initiated in 2014 by Drs. Mela Bosch and Gimena del Rio Riande.
2+
3+
This project led to outreach events and workshops in Digital Humanities: Day of DH 2015 and 2016, a plenary by Mela Bosch at the AAHD 2014 Digital Humanities Conference, the ABGRA Symposium in 2015 and 2016, and a Digital Humanities event at the National University of Mar del Plata, among others.
4+
5+
As research and collaborations advanced (with the CIM at the National University of Rosario, LINHD Madrid, TaDIRAH - DARIAH, Big Data Machine, Pelagios Commons), the Digital Humanities Laboratory began to take shape with its own professional core and agenda.
6+
7+
HD LAB was founded in 2018. According to **Resolution No. 4436/17**, the Digital Humanities Laboratory was institutionally consolidated through the incorporation of the study and development of digital tools for research in Digital Humanities into the missions of CAICYT. Furthermore, since 2022, HD LAB has been based at the [Institute for Bibliographic Research and Textual Criticism (IIBICRIT-CONICET)](https://iibicrit.conicet.gov.ar/) as a laboratory dedicated to research in Digital Humanities.

_i18n/es.yml

+5
Original file line numberDiff line numberDiff line change
@@ -32,3 +32,8 @@ form:
3232
email: Email
3333
message: Mensaje
3434
send: Enviar
35+
36+
bibliotecadigital:
37+
heading: "Acceda a nuestras ediciones digitales"
38+
viajes: "Relatos de viaje"
39+
miscelanea: "Misceláneas"

_i18n/es/bibliotecadigital.md

+5-2
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,10 @@ La edición digital de textos constituye una de las líneas más representativas
22

33
Todas las ediciones digitales de nuestra Biblioteca Digital siguen el estándar XML-TEI y los principios de la minimal computing, y privilegian el uso de tecnologías abiertas, la gobernanza sobre datos y código y la reducción de infraestructura informática. Se completan los textos enriquecidos con notas y mapas con visualizaciones de datos y diferentes recursos de lectura distante.
44

5+
<!-- # Acceda a nuestras ediciones digitales
56
6-
# Acceda a nuestras ediciones digitales
77
8-
## Relatos de viaje
8+
## Relatos de viaje
9+
10+
11+
## Miscelánea -->

_pages/bibliotecadigital.html

+12-4
Original file line numberDiff line numberDiff line change
@@ -14,12 +14,21 @@ <h1 class="hero__title">{% t titles.library %}</h1>
1414

1515
<div class="container">
1616
<div class="page-content" id="main">
17-
17+
{% if page.lang == "en" %}
18+
{% assign t = site.data.en %}
19+
{% else %}
20+
{% assign t = site.data.es %}
21+
{% endif %}
22+
1823
{% tf bibliotecadigital.md %}
24+
<h1>{{ t.bibliotecadigital.heading }}</h1>
25+
1926

27+
<h2>{{ t.bibliotecadigital.viajes }}</h2>
2028
<div class="container">
2129
<div class="post-list" itemscope="" itemtype="http://schema.org/Blog">
2230
<!-- Card LAM -->
31+
2332
<div class="post-card" itemprop="blogPosts" itemscope="" itemtype="http://schema.org/BlogPosting" style="box-shadow: 0 1px 3px rgba(0, 0, 0, 0.35);">
2433
<a href="http://hdlabconicet.github.io/La-Argentina-Manuscrita" target="_blank"><img src="{{site.baseurl}}/assets/img/chico_arg-manus-pagina-uno.png" align="center"/></a>
2534
<a class="post-card__inner" href="http://hdlabconicet.github.io/La-Argentina-Manuscrita" target="_blank">
@@ -70,9 +79,8 @@ <h2>Viaje al Río de la Plata</h2>
7079
</div>
7180
</div>
7281

73-
74-
75-
<h2>Miscelánea</h2>
82+
<h2>{{ t.bibliotecadigital.miscelanea }}</h2>
83+
<!-- <h2>Miscelánea</h2> -->
7684

7785
<div class="container">
7886
<div class="post-list" itemscope="" itemtype="http://schema.org/Blog">

0 commit comments

Comments
 (0)