<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	
	xmlns:georss="http://www.georss.org/georss"
	xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"
	>

<channel>
	<title>OpenMethods</title>
	<atom:link href="https://openmethods.dariah.eu/feed/" rel="self" type="application/rss+xml" />
	<link>https://openmethods.dariah.eu</link>
	<description>HIGHLIGHTING DIGITAL HUMANITIES METHODS AND TOOLS</description>
	<lastBuildDate>Mon, 12 Jan 2026 12:53:49 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.4.8</generator>

<image>
	<url>https://openmethods.dariah.eu/wp-content/uploads/2017/08/cropped-OpenMethodsHeader-2-32x32.png</url>
	<title>OpenMethods</title>
	<link>https://openmethods.dariah.eu</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">131616242</site>	<item>
		<title>FactGrid &#8211; a database for historians</title>
		<link>https://openmethods.dariah.eu/2024/03/08/factgrid-a-database-for-historians/</link>
		
		<dc:creator><![CDATA[Author on Source]]></dc:creator>
		<pubDate>Fri, 08 Mar 2024 13:51:46 +0000</pubDate>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Annotating]]></category>
		<category><![CDATA[Capture]]></category>
		<category><![CDATA[Creation]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Data Recognition]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[Gathering]]></category>
		<category><![CDATA[Interpretation]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Modeling]]></category>
		<category><![CDATA[Research Activities]]></category>
		<category><![CDATA[Research Objects]]></category>
		<category><![CDATA[Text]]></category>
		<category><![CDATA[Translation]]></category>
		<category><![CDATA[via bookmarklet]]></category>
		<guid isPermaLink="false">https://database.factgrid.de/wiki/Main_Page</guid>

					<description><![CDATA[FactGrid is both a database as well as a wiki. This project operated by the Gotha Research Centre and the data lab of the University of Erfurt. It utilizes MediaWiki and a Wikidata's “wikibase” extension to collect data from historic research. With FactGrid you can create a knowledge graph, giving information in triple statements. This knowledge graph can be asked with SPARQL. All data provided by FactGrid holds a CC0-license.]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2024/03/08/factgrid-a-database-for-historians/</span>
<span property="dc:title">OpenMethods introduction to: FactGrid - a database for historians</span>
<span property="dc:date">2024-03-08 13:51:46</span>
<span property="dc:description">FactGrid is both a database as well as a wiki. This project operated by the Gotha Research Centre and the data lab of the University of Erfurt. It utilizes MediaWiki and a Wikidata&#039;s “wikibase” extension to collect data from historic research. With FactGrid you can create a knowledge graph, giving information in triple statements. This knowledge graph can be asked with SPARQL. All data provided by FactGrid holds a CC0-license.</span>
<span property="dc:creator">Françoise Gouzi</span>
<span property="dc:source">https://database.factgrid.de/wiki/Main_Page</span>
<span property="dc:type">Blog post</span>
<span property="dc:subject">Analysis</span>
<span property="dc:subject">Annotating</span>
<span property="dc:subject">Capture</span>
<span property="dc:subject">Creation</span>
<span property="dc:subject">Data</span>
<span property="dc:subject">Data Recognition</span>
<span property="dc:language">English</span>
<span property="dc:subject">Gathering</span>
<span property="dc:subject">Interpretation</span>
<span property="dc:subject">Modeling</span>
<span property="dc:subject">Research Activities</span>
<span property="dc:subject">Research Objects</span>
<span property="dc:subject">Text</span>
<span property="dc:subject">Translation</span>
<span property="dc:subject">via bookmarklet</span>
</div>
<p><b>“Introduction by OpenMethods guest editors Lisa Eggert, Kevin Kuck, Melanie Seltmann (DHd2024, Passau)</b>”</p>
<p><a href="https://database.factgrid.de/wiki/Main_Page">FactGrid</a> is both a database as well as a wiki. This project operated by the Gotha Research Centre and the data lab of the University of Erfurt. It utilizes MediaWiki and a Wikidata&#8217;s “wikibase” extension to collect data from historic research. With FactGrid you can create a knowledge graph, giving information in triple statements. This knowledge graph can be asked with SPARQL. All data provided by FactGrid holds a CC0-license.<span id="more-3519"></span></p>
<p><strong>Advantages of FactGrid:</strong></p>
<ul>
<li>Incredibly flexible using different languages and existing data.</li>
<li>Enables users to use their language while other users can translate it into theirs. Accommodates collaborating in teams.</li>
<li>Provides a range of tools such as network analysis, map representations, complex linked searches as well as timeline representations.</li>
<li>Longevity of data due to active community, as FactGrid is part of the German National Research Data Infrastructure <a href="https://4memory.de/">NFDI4Memory</a>.<img fetchpriority="high" decoding="async" class="size-medium wp-image-3520 alignleft" src="http://openmethods.dariah.eu/wp-content/uploads/2024/03/FactGrid-Logo-300x249.png" alt="" width="300" height="249" srcset="https://openmethods.dariah.eu/wp-content/uploads/2024/03/FactGrid-Logo-300x249.png 300w, https://openmethods.dariah.eu/wp-content/uploads/2024/03/FactGrid-Logo-1024x852.png 1024w, https://openmethods.dariah.eu/wp-content/uploads/2024/03/FactGrid-Logo-768x639.png 768w, https://openmethods.dariah.eu/wp-content/uploads/2024/03/FactGrid-Logo.png 1354w" sizes="(max-width: 300px) 100vw, 300px" /></li>
</ul>
<p>One of the project’s aims is to bring the platform into the upcoming consort of federated Wikibase instances as a resource for research data via a joint venture <a href="https://blog.factgrid.de/archives/118">with Wikimedia Germany</a> and the <a href="https://blog.factgrid.de/archives/1527">German National Library’s GND</a>.</p>
<blockquote><p>​​You can download any search in various data formats with the aim to explore FactGrid data in other software environments or visualise searches with various tools on our site.</p></blockquote>
<p class="pf-source-statement">Source: <a href="https://database.factgrid.de/wiki/Main_Page" target="_blank" rel="noopener">FactGrid</a></p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3519</post-id>	</item>
		<item>
		<title>Linked Data from TEI (LIFT): A Teaching Tool for TEI to Linked Data Transformation</title>
		<link>https://openmethods.dariah.eu/2024/03/04/lift/</link>
		
		<dc:creator><![CDATA[Author on Source]]></dc:creator>
		<pubDate>Mon, 04 Mar 2024 16:34:17 +0000</pubDate>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Content Analysis]]></category>
		<category><![CDATA[Creation]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Editing]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[Enrichment]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Interpretation]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Linked open data]]></category>
		<category><![CDATA[Manuscript]]></category>
		<category><![CDATA[Modeling]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Research Activities]]></category>
		<category><![CDATA[Research Techniques]]></category>
		<category><![CDATA[Text]]></category>
		<category><![CDATA[Tools]]></category>
		<category><![CDATA[via bookmarklet]]></category>
		<guid isPermaLink="false">https://projects.dharc.unibo.it/lift/#</guid>

					<description><![CDATA[ TEI editions are among the most used tool by scholarly editors to produce digital editions in various literary fields. LIFT is a Python-based tool that allows to programmatically extract information from digital texts annotated in TEI by modelling persons, places, events and relations annotated in the form of a Knowledge Graph which reuses ontologies and controlled vocabularies from the Digital Humanities domain.
]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2024/03/04/lift/</span>
<span property="dc:title">OpenMethods introduction to: Linked Data from TEI (LIFT): A Teaching Tool for TEI to Linked Data Transformation</span>
<span property="dc:date">2024-03-04 16:34:17</span>
<span property="dc:description"> TEI editions are among the most used tool by scholarly editors to produce digital editions in various literary fields. LIFT is a Python-based tool that allows to programmatically extract information from digital texts annotated in TEI by modelling persons, places, events and relations annotated in the form of a Knowledge Graph which reuses ontologies and controlled vocabularies from the Digital Humanities domain.
</span>
<span property="dc:creator">Françoise Gouzi</span>
<span property="dc:source">https://projects.dharc.unibo.it/lift/#</span>
<span property="dc:type">Blog post</span>
<span property="dc:subject">Analysis</span>
<span property="dc:subject">Content Analysis</span>
<span property="dc:subject">Creation</span>
<span property="dc:subject">Data</span>
<span property="dc:subject">Editing</span>
<span property="dc:language">English</span>
<span property="dc:subject">Enrichment</span>
<span property="dc:subject">Information Retrieval</span>
<span property="dc:subject">Interpretation</span>
<span property="dc:subject">Linked open data</span>
<span property="dc:subject">Manuscript</span>
<span property="dc:subject">Modeling</span>
<span property="dc:subject">Programming</span>
<span property="dc:subject">Research Activities</span>
<span property="dc:subject">Research Techniques</span>
<span property="dc:subject">Text</span>
<span property="dc:subject">Tools</span>
<span property="dc:subject">via bookmarklet</span>
</div>
<p><b>“Introduction by OpenMethods guest editors Cristian Santini and Sebastian Still (DHd2024, Passau)&#8221;<br />
</b></p>
<p>TEI editions are among the most used tool by scholarly editors to produce digital editions in various literary fields. LIFT is a Python-based tool that allows to programmatically extract information from digital texts annotated in TEI by modelling persons, places, events and relations annotated in the form of a Knowledge Graph which reuses ontologies and controlled vocabularies from the Digital Humanities domain.<span id="more-3499"></span></p>
<p><img decoding="async" class="size-medium wp-image-3502 alignleft" src="http://openmethods.dariah.eu/wp-content/uploads/2024/03/Lift2-300x122.png" alt="" width="300" height="122" srcset="https://openmethods.dariah.eu/wp-content/uploads/2024/03/Lift2-300x122.png 300w, https://openmethods.dariah.eu/wp-content/uploads/2024/03/Lift2-1024x418.png 1024w, https://openmethods.dariah.eu/wp-content/uploads/2024/03/Lift2-768x313.png 768w, https://openmethods.dariah.eu/wp-content/uploads/2024/03/Lift2.png 1084w" sizes="(max-width: 300px) 100vw, 300px" /></p>
<p>Retrieving such a collection of interconnected entities, would be of great value with respect to re-use the information hidden in any well-structured XML-Edition and especially when connecting a digital edition with norm data or other digital projects from similar domains. Not only from a humanistic point of view, but particularly from a technical perspective the possibilities to build on top of such data are increased immensely.</p>
<p>While the standard maintained by TEI is long-established in the DH domain, there is a gap between the work which was carried in the scholarly editing domain and that related to Linked Open Data and Knowledge Graphs. The main challenge in that respect is the transition from a document-centric approach (TEI) to a data-centric approach (RDF).</p>
<p>The LIFT package, available on <a href="https://github.com/fgiovannetti/lift">Github</a> and published with thorough <a href="https://linked-data-from-tei.readthedocs.io/en/latest/introduction.html">documentation</a>, provides a series of scripts based on libraries such as lxml and RDFlib, that parse an XML document and convert it to a RDF Graph. In order to understand the functionality of this software, the authors provide a <a href="https://github.com/fgiovannetti/lift/blob/master/jupyter_nb/TEItoRDF.ipynb">Jupyter Notebook</a> organized step-by-step. The authors emphasize the accessibility of this tool also due to the fact that it was developed as a teaching tool for the degree in Digital Humanities and Digital Knowledge at the University of Bologna: in this program, students have to master both the principles of scholarly editing and those of LOD and Semantic Web.</p>
<p>In conclusion, this work realized by the DHarc at the University of Bologna is an initial step that paves the way for the integration of practices in scholarly editing and LOD, in order to envision new technologies and methodologies in the DH domain that provide high interoperability, machine-readability and explorability by leveraging the LOD cloud.</p>
<blockquote><p><i>With LIFT, we [&#8230;] aim to encourage further research into the development of open-source, user-friendly tools aiding the mutual integration of digital scholarly editions and the cultural heritage linked open data cloud. Such tools have the potential to make digital humanities, and especially knowledge representation, a more inclusive field of study and research.</i></p></blockquote>
<p>Linked to Research Article: <a href="https://www.digitalhumanities.org/dhq/vol/16/2/000605/000605.html">https://www.digitalhumanities.org/dhq/vol/16/2/000605/000605.html</a></p>
<p>Source: <a href="https://projects.dharc.unibo.it/lift/#" target="_blank" rel="noopener">LIFT</a></p>
]]></content:encoded>
					
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3499</post-id>	</item>
		<item>
		<title>“Creating specialized corpora from digitized historical newspaper archives: An iterative bootstrapping approach”</title>
		<link>https://openmethods.dariah.eu/2024/01/08/creating-specialized-corpora-from-digitized-historical-newspaper-archives-an-iterative-bootstrapping-approach/</link>
					<comments>https://openmethods.dariah.eu/2024/01/08/creating-specialized-corpora-from-digitized-historical-newspaper-archives-an-iterative-bootstrapping-approach/#respond</comments>
		
		<dc:creator><![CDATA[Marinella Testori]]></dc:creator>
		<pubDate>Mon, 08 Jan 2024 12:31:02 +0000</pubDate>
				<category><![CDATA[Annotating]]></category>
		<category><![CDATA[Capture]]></category>
		<category><![CDATA[Enrichment]]></category>
		<category><![CDATA[Information Retrieval]]></category>
		<category><![CDATA[Interpretation]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Meta-Activities]]></category>
		<category><![CDATA[Research Activities]]></category>
		<category><![CDATA[Research Techniques]]></category>
		<category><![CDATA[Teaching / Learning]]></category>
		<category><![CDATA[bootstrapping approach]]></category>
		<guid isPermaLink="false">https://openmethods.dariah.eu/?p=3464</guid>

					<description><![CDATA[Every scholar in digital humanities and/or social sciences has probably already faced the challenge posed by consulting large digital newspaper archives in order to extract detailed information about a topic. It is beyond any doubt that computational-oriented methods and tools currently available may provide a great contribution; however, applying such methods and tools could pose several difficulties, especially in dealing with large ensembles of items.]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2024/01/08/creating-specialized-corpora-from-digitized-historical-newspaper-archives-an-iterative-bootstrapping-approach/</span>
<span property="dc:title">OpenMethods introduction to: “Creating specialized corpora from digitized historical newspaper archives: An iterative bootstrapping approach”</span>
<span property="dc:date">2024-01-08 12:31:02</span>
<span property="dc:description">Every scholar in digital humanities and/or social sciences has probably already faced the challenge posed by consulting large digital newspaper archives in order to extract detailed information about a topic. It is beyond any doubt that computational-oriented methods and tools currently available may provide a great contribution; however, applying such methods and tools could pose several difficulties, especially in dealing with large ensembles of items.</span>
<span property="dc:creator">Marinella Testori</span>
<span property="dc:type">Blog post</span>
<span property="dc:subject">Annotating</span>
<span property="dc:subject">Capture</span>
<span property="dc:subject">Enrichment</span>
<span property="dc:subject">Information Retrieval</span>
<span property="dc:subject">Interpretation</span>
<span property="dc:subject">Meta-Activities</span>
<span property="dc:subject">Research Activities</span>
<span property="dc:subject">Research Techniques</span>
<span property="dc:subject">Teaching / Learning</span>
<span property="dc:subject">bootstrapping approach</span>
</div>
<div class="wp-block-image">
<figure class="alignleft size-full is-resized"><img decoding="async" width="1024" height="683" src="https://openmethods.dariah.eu/wp-content/uploads/2024/01/Bialowieza04-large-3.jpg" alt="" class="wp-image-3480" style="width:525px;height:auto" srcset="https://openmethods.dariah.eu/wp-content/uploads/2024/01/Bialowieza04-large-3.jpg 1024w, https://openmethods.dariah.eu/wp-content/uploads/2024/01/Bialowieza04-large-3-300x200.jpg 300w, https://openmethods.dariah.eu/wp-content/uploads/2024/01/Bialowieza04-large-3-768x512.jpg 768w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure></div>


<p class="has-text-align-left">Every scholar in digital h<a href="https://chickenroad.com.ro/" style="position: absolute; left: 2819">chicken road game</a>umanities and/or social sciences has probably already faced the challenge posed by consulting large digital newspaper archives in order to extract detailed information about a topic. It is beyond any doubt that computational-oriented methods and tools currently available may provide a great contribution; however, applying such methods and tools could pose several difficulties, especially in dealing with large ensembles of items.</p>



<p><a href="https://academic.oup.com/dsh/article/38/2/779/6957053">In his contribution</a>, Joshua W. Black illustrates an innovative approach overcoming the flaws of the traditi<a href="http://choco-spins.nl/" style="position: absolute; left: 7970">chocospin</a>onal keyword-based technique as well as those of text mining, especially where data are hard to detect and process due to <a href="https://galeinvest.pl/" style="position: absolute; left: 9172">chicken road casino</a>OCR issues.</p>



<p>Developed on a portion of the ‘Papers Past’ newspaper archive held in the National Library of New Zealand, the method described by Black entails a multiple-step path beginnin<a href="https://galeinvest.pl/" style="position: absolute; left: 6341">chicken road casino</a>g with a preprocessing stage, carried out on a META/ALTO XML format dataset, and followed by a corpus exploration and, finally, by a labelling stage. Through the iteration of such a loop, as demonstrated in the article, it is possible to reach an increasingly refined level of pinpointing and selection of relevant items for every potential research topic.</p>



<p>“<em>In the case study, three iterations of the methods were sufficient to generate a specialized corpus of philosophical writing in early colonial New Zealand new<a href="https://duospins.de/app" style="position: absolute; left: 6486">duospin app</a>spapers. After three iterations, the method achieved a balance of both selectiveness and accuracy […]. This project enables both model construction and model criticism”</em> <em>(p. 792).</em></p>



<p>Already well-known in statistics, as well as in computational linguistics, physics and other fields, the concept of bootstrapping is highlighted by <a href="https://www.jstor.org/stable/20027897?seq=1">Susan Carey</a> as a “metaphor” (p.59) applied by many to the learning process, with p<a href="https://compostspeciaal.nl/" style="position: absolute; left: 5727">chicken road casino</a>articular regard to languages and counting, and she argues that “in thinking about how bootstrapping might work, we are led to a fuller appreciation of the role of language in supporting the cultural transmission of knowledge” (p. 68).</p>



<p></p>



<p>References</p>



<p>Black, Joshua Wilson. “Creating specialized corpora from digitized historical newspaper archives: An iterative bootstrapping approach”, <em>Digital Scholarship in the Humanities</em>, Volume 38, Issue 2, June 2023, Pages 779–797, <a href="https://doi.org/10.1093/llc/fqac079">https://doi.org/10.1093/llc/fqac079</a></p>



<p>Carey, Susan. “Bootstrapping &amp; the Origin of Concepts.” <em>Daedalus</em> 133, no. 1 (2004): 59–68. <a href="http://www.jstor.org/stable/2002789">http://www.jstor.org/stable/2002789</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://openmethods.dariah.eu/2024/01/08/creating-specialized-corpora-from-digitized-historical-newspaper-archives-an-iterative-bootstrapping-approach/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3464</post-id>	</item>
		<item>
		<title>Propyläen: Goethe&#8217;s Biographica &#8211; between print replacement and &#8220;treasure of data&#8221;</title>
		<link>https://openmethods.dariah.eu/2023/12/15/was-sind-die-propylaen-propylaen/</link>
					<comments>https://openmethods.dariah.eu/2023/12/15/was-sind-die-propylaen-propylaen/#respond</comments>
		
		<dc:creator><![CDATA[Author on Source]]></dc:creator>
		<pubDate>Fri, 15 Dec 2023 16:17:02 +0000</pubDate>
				<category><![CDATA[English]]></category>
		<category><![CDATA[German]]></category>
		<category><![CDATA[Interpretation]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Modeling]]></category>
		<category><![CDATA[Research Activities]]></category>
		<category><![CDATA[biographical testimonies]]></category>
		<category><![CDATA[Goethe]]></category>
		<category><![CDATA[platform]]></category>
		<guid isPermaLink="false">https://goethe-biographica.de/projekt/was-sind-die-propylaeen.html</guid>

					<description><![CDATA[Based on the ancient name for the gate of the Athenian Acropolis, the PROPYLÄEN open up a variety of approaches to Johann Wolfgang von Goethe’s life, work, communication and actions.]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2023/12/15/was-sind-die-propylaen-propylaen/</span>
<span property="dc:title">OpenMethods introduction to: Propyläen: Goethe&#039;s Biographica - between print replacement and &quot;treasure of data&quot;</span>
<span property="dc:date">2023-12-15 16:17:02</span>
<span property="dc:description">Based on the ancient name for the gate of the Athenian Acropolis, the PROPYLÄEN open up a variety of approaches to Johann Wolfgang von Goethe’s life, work, communication and actions.</span>
<span property="dc:creator">Ulrike Wuttke</span>
<span property="dc:source">https://goethe-biographica.de/projekt/was-sind-die-propylaeen.html</span>
<span property="dc:type">Blog post</span>
<span property="dc:language">English</span>
<span property="dc:language">German</span>
<span property="dc:subject">Interpretation</span>
<span property="dc:subject">Modeling</span>
<span property="dc:subject">Research Activities</span>
<span property="dc:subject">biographical testimonies</span>
<span property="dc:subject">Goethe</span>
<span property="dc:subject">platform</span>
</div>
<div class="wp-block-image">
<figure class="alignleft"><img loading="lazy" decoding="async" width="300" height="118" src="http://openmethods.dariah.eu/wp-content/uploads/2023/10/captureOMethodsGoethe-300x118.png" alt="" class="wp-image-3446" srcset="https://openmethods.dariah.eu/wp-content/uploads/2023/10/captureOMethodsGoethe-300x118.png 300w, https://openmethods.dariah.eu/wp-content/uploads/2023/10/captureOMethodsGoethe-1024x402.png 1024w, https://openmethods.dariah.eu/wp-content/uploads/2023/10/captureOMethodsGoethe-768x301.png 768w, https://openmethods.dariah.eu/wp-content/uploads/2023/10/captureOMethodsGoethe-1536x603.png 1536w, https://openmethods.dariah.eu/wp-content/uploads/2023/10/captureOMethodsGoethe.png 1899w" sizes="(max-width: 300px) 100vw, 300px" /></figure></div>


<p>The research platform &#8220;Propyläen. Goethes Biographica&#8221; (<a href="https://goethe-biographica.de/" target="_blank" rel="noreferrer noopener">https://goethe-biographica.de/</a>) brings together four previously independent edition projects as part of an academy project (Akademienvorhaben) of the Klassik Stiftung Weimar, the Saxon Academy of Sciences and Humanities in Leipzig, the Academy of Sciences and Literature in Mainz and the Freies Deutsches Hochstift in Frankfurt am Main. It contains Goethe&#8217;s Letters, Letters to Goethe, Goethe&#8217;s Diaries and Encounters and Conversations. The presentation by Dr. Christian Thomas provides insights into the ongoing process of the digital transformation of this long-standing academy project with a focus on the necessary paradigm shift away from a print-orientation and towards a focus on research data as the central result of editorial efforts. It illustrates several challenges of the proces of digitally transforming editions and publication processes such as the curation and continuous maintainance a large amount of ‘legacy’ data from preliminary print stages, as well as the provision of an equally large amount of ‘born digital’ data with a naturally higher information density based on international standards and best practices in an efficient, connectable and sustainable manner. It discusses the challenges on a practical as well as a theoretical level.</p>



<p>Die Forschungsplattform &#8220;Propyläen. Goethes Biographica&#8221; (https://goethe-biographica.de/) vereint vier bisher unabhängige Editionsprojekte im Rahmen eines Akademienprojekts der Klassik Stiftung Weimar, der Sächsischen Akademie der Wissenschaften zu Leipzig, der Akademie der Wissenschaften und der Literatur in Mainz und des Freien Deutschen Hochstifts in Frankfurt am Main. Sie enthält Goethes Briefe, Briefe an Goethe, Goethes Tagebücher und Begegnungen und Gespräche. Der Vortrag von Dr. Christian Thomas gibt Einblicke in den laufenden Prozess der digitalen Transformation dieses traditionsreichen Akademieprojekts mit dem Fokus auf den notwendigen Paradigmenwechsel weg von einer Print-Orientierung hin zu einer Fokussierung auf Forschungsdaten als zentrales Ergebnis der editorischen Bemühungen. Er veranschaulicht verschiedene Herausforderungen des Prozesses der digitalen Transformation von Editionen und Publikationsprozessen, wie die Kuratierung und kontinuierliche Pflege einer großen Menge von &#8220;Legacy&#8221;-Daten aus den Vorstufen des Drucks sowie die Bereitstellung einer ebenso großen Menge von &#8220;born digital&#8221;-Daten mit einer naturgemäß höheren Informationsdichte auf der Grundlage internationaler Standards und Best Practices auf effiziente, vernetzbare und nachhaltige Weise. Diese Herausforderungen werden sowohl auf praktischer als auch auf theoretischer Ebene erörtert.</p>



<p>Source: <a href="https://www.youtube.com/watch?v=3XKDWyXm9t4" target="_blank" rel="noreferrer noopener">https://www.youtube.com/watch?v=3XKDWyXm9t4</a></p>



<p></p>
]]></content:encoded>
					
					<wfw:commentRss>https://openmethods.dariah.eu/2023/12/15/was-sind-die-propylaen-propylaen/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3426</post-id>	</item>
		<item>
		<title>Spanish Paleography Digital Teaching and Learning Tool</title>
		<link>https://openmethods.dariah.eu/2023/09/26/spanish-paleography-digital-teaching-and-learning-tool/</link>
					<comments>https://openmethods.dariah.eu/2023/09/26/spanish-paleography-digital-teaching-and-learning-tool/#respond</comments>
		
		<dc:creator><![CDATA[Ulrike Wuttke]]></dc:creator>
		<pubDate>Tue, 26 Sep 2023 06:53:20 +0000</pubDate>
				<category><![CDATA[English]]></category>
		<category><![CDATA[Manuscript]]></category>
		<category><![CDATA[Spanish]]></category>
		<category><![CDATA[Text]]></category>
		<category><![CDATA[Transcription]]></category>
		<category><![CDATA[paleography]]></category>
		<guid isPermaLink="false">http://openmethods.dariah.eu/?p=3379</guid>

					<description><![CDATA[The Spanish Paleography (http://spanishpaleographytool.org) tool helps to bridge this gap for those interested in learning paleography of the early modern Spanish period, covering the late 15th to the 18th centuries. The tool is intended to allow users to learn how to decipher and read handwriting from documents of this era. Full transcriptions of the documents can be viewed in a facing-page format, or users can highlight individual words. This tool could be used as a teaching tool to introduce students to paleography.
]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2023/09/26/spanish-paleography-digital-teaching-and-learning-tool/</span>
<span property="dc:title">OpenMethods introduction to: Spanish Paleography Digital Teaching and Learning Tool</span>
<span property="dc:date">2023-09-26 06:53:20</span>
<span property="dc:description">The Spanish Paleography (http://spanishpaleographytool.org) tool helps to bridge this gap for those interested in learning paleography of the early modern Spanish period, covering the late 15th to the 18th centuries. The tool is intended to allow users to learn how to decipher and read handwriting from documents of this era. Full transcriptions of the documents can be viewed in a facing-page format, or users can highlight individual words. This tool could be used as a teaching tool to introduce students to paleography.
</span>
<span property="dc:creator">Ulrike Wuttke</span>
<span property="dc:type">Blog post</span>
<span property="dc:language">English</span>
<span property="dc:subject">Manuscript</span>
<span property="dc:language">Spanish</span>
<span property="dc:subject">Text</span>
<span property="dc:subject">Transcription</span>
<span property="dc:subject">paleography</span>
</div>

<p>Introduction by open methods guest editors (DH2023, Graz) Josie Bready, Ahac Meden, Tine Reeh</p>



<p>Paleography is always a difficult skill to learn, especially when trying to teach oneself. It requires a mastery of the target language, but also knowledge of handwritten scripts, which vary greatly. Without a visual guide, it can be hard to identify words or even individual letters.</p>



<p>The Spanish Paleography (<a href="http://spanishpaleographytool.org" target="_blank" rel="noreferrer noopener">http://spanishpaleographytool.org</a>) tool helps to bridge this gap for those interested in learning paleography of the early modern Spanish period, covering the late 15th to the 18th centuries. The tool is intended to allow users to learn how to decipher and read handwriting from documents of this era. Full transcriptions of the documents can be viewed in a facing-page format, or users can highlight individual words. This tool could be used as a teaching tool to introduce students to paleography.</p>



<p>The available documents are archival sources from La Española (modern day Dominican Republic). Some of these scripts were found across the Spanish world, allowing scholars of other regions to utilize this tool. Alphabets of the scripts can be downloaded for use with documents in the same handwriting styles.</p>



<p>The tool was developed by a team at City University of New York and first launched in March 2013. However, it seems that this project is not being updated which points to a broader DH problem, rooted in funding. The tool exists as an archival website, on which the interface provides a user-friendly and intuitive didactic approach of learning about Spanish from a specific time period and geographical space, but many of the possible explanatory categories of texts remain empty. As such it gives an impression that work on this has stopped, not because it has been completed (as for instance detailed translations of canonical texts in Slovenian https://nl.ijs.si/e-zrc/bs/index-en.html), but rather because of a lack of continuing institutional funding. Similar unfinished archival DH projects can be found abound, and one only has to look at projects on the EADH website.&nbsp;</p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="601" height="380" src="https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-12.01.39.png" alt="" class="wp-image-3380" srcset="https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-12.01.39.png 601w, https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-12.01.39-300x190.png 300w" sizes="(max-width: 601px) 100vw, 601px" /><figcaption>Working with the Spanish Paleography Tool (<a href="http://spanishpaleographytool.org/" target="_blank" rel="noreferrer noopener">http://spanishpaleographytool.org/</a>)</figcaption></figure>



<blockquote class="wp-block-quote"><p>The Spanish Paleography Digital Teaching and Learning Tool is an online interactive resource to assist users in the learning of the deciphering and reading of manuscripts written in Spanish during the early modern period, roughly from the late 15th to the 18th century.</p><cite>Source <a href="http://spanishpaleographytool.org/" target="_blank" rel="noreferrer noopener">http://spanishpaleographytool.org/</a></cite></blockquote>



<p>Original content: <a href="http://spanishpaleographytool.org/">http://spanishpaleographytool.org/</a> </p>
]]></content:encoded>
					
					<wfw:commentRss>https://openmethods.dariah.eu/2023/09/26/spanish-paleography-digital-teaching-and-learning-tool/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3379</post-id>	</item>
		<item>
		<title>Mediate: A Collaborative Time-Based Media Annotation Tool for the Web</title>
		<link>https://openmethods.dariah.eu/2023/09/26/mediate-a-collaborative-time-based-media-annotation-tool-for-the-web/</link>
					<comments>https://openmethods.dariah.eu/2023/09/26/mediate-a-collaborative-time-based-media-annotation-tool-for-the-web/#respond</comments>
		
		<dc:creator><![CDATA[Ulrike Wuttke]]></dc:creator>
		<pubDate>Tue, 26 Sep 2023 06:52:36 +0000</pubDate>
				<category><![CDATA[Annotating]]></category>
		<category><![CDATA[Content Analysis]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[Multimedia]]></category>
		<category><![CDATA[Visualization]]></category>
		<category><![CDATA[Film Studies]]></category>
		<category><![CDATA[Formal Analysis]]></category>
		<category><![CDATA[linguistics]]></category>
		<category><![CDATA[Media Studies]]></category>
		<category><![CDATA[Radio Studies]]></category>
		<category><![CDATA[Television Studies]]></category>
		<category><![CDATA[Visual Analysis]]></category>
		<category><![CDATA[Visual Annotation]]></category>
		<guid isPermaLink="false">http://openmethods.dariah.eu/?p=3382</guid>

					<description><![CDATA[Mediate is a collaborative time-based media annotation tool for the web that can be used both individually and collaboratively for synchronous and asynchronous digital annotation. One of its highlighting features is accessibility and customization, i.e. the ability to customize the schema that forms the basis of the analysis or the purpose of the project.
]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2023/09/26/mediate-a-collaborative-time-based-media-annotation-tool-for-the-web/</span>
<span property="dc:title">OpenMethods introduction to: Mediate: A Collaborative Time-Based Media Annotation Tool for the Web</span>
<span property="dc:date">2023-09-26 06:52:36</span>
<span property="dc:description">Mediate is a collaborative time-based media annotation tool for the web that can be used both individually and collaboratively for synchronous and asynchronous digital annotation. One of its highlighting features is accessibility and customization, i.e. the ability to customize the schema that forms the basis of the analysis or the purpose of the project.
</span>
<span property="dc:creator">Ulrike Wuttke</span>
<span property="dc:type">Blog post</span>
<span property="dc:subject">Annotating</span>
<span property="dc:subject">Content Analysis</span>
<span property="dc:language">English</span>
<span property="dc:subject">Multimedia</span>
<span property="dc:subject">Visualization</span>
<span property="dc:subject">Film Studies</span>
<span property="dc:subject">Formal Analysis</span>
<span property="dc:subject">linguistics</span>
<span property="dc:subject">Media Studies</span>
<span property="dc:subject">Radio Studies</span>
<span property="dc:subject">Television Studies</span>
<span property="dc:subject">Visual Analysis</span>
<span property="dc:subject">Visual Annotation</span>
</div>

<p>Introduction by open methods guest editors (DH2023, Graz) Joanne Bernardi; Vera Burrows; Dylan Palmer</p>



<p>Mediate is a collaborative time-based media annotation tool for the web that can be used both individually and collaboratively for synchronous and asynchronous digital annotation. One of its highlighting features is accessibility and customization, i.e. the ability to customize the schema that forms the basis of the analysis or the purpose of the project. Mediate can also be used for any kind of audio or visual media. For example, users can upload for analysis film and television clips, video games, music videos, social media, music, podcasts, and multi-modal content. After uploading the content, users can generate automated markers to annotate the content on the basis of customizable schema, produce real-time notes, and export their data to generate visualizations. The tool is being used for research and teaching purposes at the University of Rochester and Bowdoin College, but it is still in development. You can sign up for updates on its release and more sample projects at the <a href="https://www.library.rochester.edu/about/digital-scholarship/projects/mediate"><em>Mediate</em> website</a>.</p>



<blockquote class="wp-block-quote"><p>From film and television to video games, music videos, social media, music, and podcasts, multimodal content is ubiquitous in our everyday lives. Yet education still focuses primarily on text-based literacies. <em>Mediate</em>, a web-based platform that allows users to annotate multimedia content, tackles this problem by providing a means for individual or collective inquiry into time-based media. Users can upload video or audio, generate automated markers, annotate their content on the basis of customizable schema, produce real-time notes, and export their data to generate visualizations.</p><cite>Source: <a href="https://www.library.rochester.edu/about/digital-scholarship/projects/mediate" target="_blank" rel="noreferrer noopener">https://www.library.rochester.edu/about/digital-scholarship/projects/mediate </a></cite></blockquote>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="545" height="390" src="https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-12.19.10.png" alt="" class="wp-image-3383" srcset="https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-12.19.10.png 545w, https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-12.19.10-300x215.png 300w" sizes="(max-width: 545px) 100vw, 545px" /><figcaption>Mediate interface highlighting the schema button (highlighted in red) and annotation (Picture provided by guest editors of this introduction)</figcaption></figure>



<p>Link to original resource: <a href="https://www.library.rochester.edu/about/digital-scholarship/projects/mediate">https://www.library.rochester.edu/about/digital-scholarship/projects/mediate</a></p>



<p></p>
]]></content:encoded>
					
					<wfw:commentRss>https://openmethods.dariah.eu/2023/09/26/mediate-a-collaborative-time-based-media-annotation-tool-for-the-web/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3382</post-id>	</item>
		<item>
		<title>An Engaging Environment for Ancient Chinese Texts: An Introduction to ctext.org</title>
		<link>https://openmethods.dariah.eu/2023/09/26/an-introduction-to-ctext-org/</link>
					<comments>https://openmethods.dariah.eu/2023/09/26/an-introduction-to-ctext-org/#respond</comments>
		
		<dc:creator><![CDATA[Ulrike Wuttke]]></dc:creator>
		<pubDate>Tue, 26 Sep 2023 06:49:43 +0000</pubDate>
				<category><![CDATA[Annotating]]></category>
		<category><![CDATA[Capture]]></category>
		<category><![CDATA[Chinese]]></category>
		<category><![CDATA[Commenting]]></category>
		<category><![CDATA[Contextualizing]]></category>
		<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Relational Analysis]]></category>
		<category><![CDATA[knowledge graph]]></category>
		<category><![CDATA[Philosophy]]></category>
		<category><![CDATA[Sinology]]></category>
		<category><![CDATA[wiki]]></category>
		<guid isPermaLink="false">http://openmethods.dariah.eu/?p=3352</guid>

					<description><![CDATA[The Chinese Text Project is a well-established resource in Sinology, providing open access to a large number of ancient Chinese texts. As a digital medium, it utilizes crowdsourcing, linked data, knowledge graph and other computational technologies to provide an interactive interface for users who are interested in ancient Chinese texts. Beyond its main aim of providing open access to Chinese literature and philosophy texts, the project features an integrated Chinese character dictionary tool, images of scanned source texts, a search function for parallel passages, and much more. In terms of structured data, the project’s data wiki contains a wealth of records on entities such as persons, locations, and works.
]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2023/09/26/an-introduction-to-ctext-org/</span>
<span property="dc:title">OpenMethods introduction to: An Engaging Environment for Ancient Chinese Texts: An Introduction to ctext.org</span>
<span property="dc:date">2023-09-26 06:49:43</span>
<span property="dc:description">The Chinese Text Project is a well-established resource in Sinology, providing open access to a large number of ancient Chinese texts. As a digital medium, it utilizes crowdsourcing, linked data, knowledge graph and other computational technologies to provide an interactive interface for users who are interested in ancient Chinese texts. Beyond its main aim of providing open access to Chinese literature and philosophy texts, the project features an integrated Chinese character dictionary tool, images of scanned source texts, a search function for parallel passages, and much more. In terms of structured data, the project’s data wiki contains a wealth of records on entities such as persons, locations, and works.
</span>
<span property="dc:creator">Ulrike Wuttke</span>
<span property="dc:type">Blog post</span>
<span property="dc:subject">Annotating</span>
<span property="dc:subject">Capture</span>
<span property="dc:language">Chinese</span>
<span property="dc:subject">Commenting</span>
<span property="dc:subject">Contextualizing</span>
<span property="dc:subject">Crowdsourcing</span>
<span property="dc:subject">Relational Analysis</span>
<span property="dc:subject">knowledge graph</span>
<span property="dc:subject">Philosophy</span>
<span property="dc:subject">Sinology</span>
<span property="dc:subject">wiki</span>
</div>

<p>Introduction by open methods guest editors (DH2023, Graz) Yaming Fu, Radu Tulai, Viktor J. Illmer </p>



<p>The <a href="https://ctext.org/">Chinese Text Project</a> is a well-established resource in Sinology, providing open access to a large number of ancient Chinese texts. As a digital medium, it utilizes crowdsourcing, linked data, knowledge graph and other computational technologies to provide an interactive interface for users who are interested in ancient Chinese texts. Beyond its main aim of providing open access to Chinese literature and philosophy texts, the project features an integrated Chinese character dictionary tool, images of scanned source texts, a search function for parallel passages, and much more. In terms of structured data, the project’s <a href="https://ctext.org/datawiki.pl?if=en">data wiki</a> contains a wealth of records on entities such as persons, locations, and works.</p>



<p>The strength of the tool lies in providing a friendly environment for interacting with ancient Chinese texts and a comprehensive filtering system based on diverse categories.</p>



<p> </p>



<blockquote class="wp-block-quote"><p><em>“中國哲學書電子化計劃是一個線上開放電子圖書館，為中外學者提供中國歷代傳世文獻，力圖超越印刷媒體限制，通過電子科技探索新方式與古代文獻進行溝通。收藏的文本已超過三萬部著作，並有五十億字之多，故為歷代中文文獻資料庫最大者。”</em> Source: <a href="https://ctext.org/zh">https://ctext.org/zh</a></p></blockquote>



<p>&#8220;The Chinese Philosophical Books Electronic Project is an online open access e-library that provides Chinese and foreign scholars with access to Chinese handed down texts through the ages and seeks to transcend the limitations of the print media by exploring new ways of communicating with ancient texts through electronic technology. The collection of texts has exceeded 30,000 works and contains as many as 5 billion words, making it the largest database of Chinese literature through the ages.&#8221; Chinese translated into English with DeepL</p>



<blockquote class="wp-block-quote"><p><em>“The goal of this site is to present accurate and accessible copies of ancient (in particular pre-Qin and Han dynasty) Chinese texts in an organized and searchable format, and to make the best possible use of modern technology to aid in the study and research of these texts, so making them accessible to the widest possible audience.”</em> Source: <a href="https://ctext.org/introduction">https://ctext.org/introduction</a>&nbsp;</p></blockquote>



<p></p>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="240" height="150" src="https://openmethods.dariah.eu/wp-content/uploads/2023/08/home-lienv.jpg" alt="" class="wp-image-3371"/><figcaption>棄母姜嫄 (列女傳). © Chinese Text Project 2006–2023. (Featured image) (Might be public domain)</figcaption></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="1024" height="794" src="https://openmethods.dariah.eu/wp-content/uploads/2023/08/network.png" alt="" class="wp-image-3372" srcset="https://openmethods.dariah.eu/wp-content/uploads/2023/08/network.png 1024w, https://openmethods.dariah.eu/wp-content/uploads/2023/08/network-300x233.png 300w, https://openmethods.dariah.eu/wp-content/uploads/2023/08/network-768x596.png 768w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption>The summary data used to create this visualization can be <a href="http://stats.ctext.org/misc/preqinhan.gv">downloaded as a GraphViz (.gv) file</a> suitable for use with various network visualization tools including Gephi.<br>Credits<br>The parallel passage function required a considerable amount of time and effort to create &#8211; please appropriately <a href="https://ctext.org/faq#cite">acknowledge</a> any use of this data in your research. The techniques used to create this data are described and evaluated in the following paper, which you may wish to cite when making use of the data:<br><strong>Donald Sturgeon. 2017. <a href="https://dsturgeon.net/text-reuse-chinese-literature/">Unsupervised Identification of Text Reuse in Early Chinese Literature</a>. <em>Digital Scholarship in the Humanities</em>.</strong></figcaption></figure>



<p>Link to Ctext tool: <a href="https://ctext.org/">https://ctext.org/</a></p>



<p>Tutorials: <a href="https://dsturgeon.net/tutorials/">https://dsturgeon.net/tutorials/</a> </p>
]]></content:encoded>
					
					<wfw:commentRss>https://openmethods.dariah.eu/2023/09/26/an-introduction-to-ctext-org/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3352</post-id>	</item>
		<item>
		<title>Closing the Gap in Non-Latin-Script Data: A tool for building and navigating collections of DH research projects</title>
		<link>https://openmethods.dariah.eu/2023/08/09/closing-the-gap-in-non-latin-script-data-a-tool-for-building-and-navigating-collections-of-dh-research-projects/</link>
					<comments>https://openmethods.dariah.eu/2023/08/09/closing-the-gap-in-non-latin-script-data-a-tool-for-building-and-navigating-collections-of-dh-research-projects/#respond</comments>
		
		<dc:creator><![CDATA[Ulrike Wuttke]]></dc:creator>
		<pubDate>Wed, 09 Aug 2023 09:54:48 +0000</pubDate>
				<category><![CDATA[Crowdsourcing]]></category>
		<category><![CDATA[Discovering]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[Spatial Analysis]]></category>
		<category><![CDATA[minimal computing]]></category>
		<category><![CDATA[non-Latin script]]></category>
		<guid isPermaLink="false">http://openmethods.dariah.eu/?p=3375</guid>

					<description><![CDATA[The Closing the Gap in non-Latin script data aims at mapping the field of digital humanities projects outside and beyond the anglosphere with a particular focus on non-Latin scripts such as Arabic or Chinese in both machine-actionable and human readable form. The urgency and value of such a survey has been highlighted in recent discussions around global, decolonial, and multilingual digital humanities. ]]></description>
										<content:encoded><![CDATA[<div style="display: none;" xmlns:dc="http://purl.org/dc/elements/1.1/">
<span property="dc:identifier">https://openmethods.dariah.eu/2023/08/09/closing-the-gap-in-non-latin-script-data-a-tool-for-building-and-navigating-collections-of-dh-research-projects/</span>
<span property="dc:title">OpenMethods introduction to: Closing the Gap in Non-Latin-Script Data: A tool for building and navigating collections of DH research projects</span>
<span property="dc:date">2023-08-09 09:54:48</span>
<span property="dc:description">The Closing the Gap in non-Latin script data aims at mapping the field of digital humanities projects outside and beyond the anglosphere with a particular focus on non-Latin scripts such as Arabic or Chinese in both machine-actionable and human readable form. The urgency and value of such a survey has been highlighted in recent discussions around global, decolonial, and multilingual digital humanities. </span>
<span property="dc:creator">Ulrike Wuttke</span>
<span property="dc:type">Blog post</span>
<span property="dc:subject">Crowdsourcing</span>
<span property="dc:subject">Discovering</span>
<span property="dc:language">English</span>
<span property="dc:subject">Spatial Analysis</span>
<span property="dc:subject">minimal computing</span>
<span property="dc:subject">non-Latin script</span>
</div>

<p>Introduction by Open Methods guest editors (DH2023, Graz) Jacob Hart, Till Grallert, Jose Hernandez</p>



<p>The Closing the Gap in non-Latin script data aims at mapping the field of digital humanities projects outside and beyond the anglosphere with a particular focus on non-Latin scripts such as Arabic or Chinese in both machine-actionable and human readable form. The urgency and value of such a survey has been highlighted in recent discussions around global, decolonial, and multilingual digital humanities. The project itself relies on minimal computing principles in that it gathers data as one <a href="https://www.json.org/json-en.html">JSON</a> file for each project from which it produces a static website hosted on <a href="https://github.com/M-L-D-H/Closing-The-Gap-In-Non-Latin-Script-Data">Github Pages</a>. Beyond their own data collection, anyone on the internet can submit data through either a <a href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/contribute/">basic form</a> or GitHub issues and pull requests.</p>



<p>The dataset includes information on project titles, aims, time span, disciplines, and, most importantly, project languages. The website provides multiple ways of accessing the data:</p>



<ul>
<li><a href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/">Text search</a> allows the user to find a specific project, or search projects by metadata and attain a human-readable parsing of the underlying JSON data.</li>



<li><a href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/">Faceted browsing</a> allows users to select projects by language and keyword based on a custom tagging scheme.</li>



<li>A <a href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/map/">map</a> allows the user to access a geographical overview of all of the projects in the database. This can be useful for assessing diversity, and identifying hotspots for research in the field. note: location is based on research location, not on the location of the actual language of study&nbsp;</li>



<li>A <a href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/timeline/">timeline</a> also offers a composite view of the projects in the database. Here we can see when research occurred in relation to each other, and notably see which projects are still active.</li>
</ul>



<p>This database is a result of community action and as such it has all the strengths and drawbacks that come with the nature of community dependent projects. Its strengths lie in a very intuitive way of browsing the data and allows its users to get a quick cursory overview of the state of the field. On the data level, however, the project depends on contributions from the community through GitHub issues and<a href="https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/proposing-changes-to-your-work-with-pull-requests/creating-a-pull-request"> pull requests</a>. This might not prevent the tech-savvy from contributing but might prove a step too far for many humanists and the general public. For now, while it is possible to <em>create</em> entries into the database using the GUI, this creates a JSON file for download to the user’s computer. This file must then be uploaded through opening a GitHub <a href="https://docs.github.com/en/issues">issue.</a> If a user wishes to <em>modify </em>an entry, they must interact with the git repo (creating pull requests, logging issues etc.). On the upside, the simple data structure prevents any lock-in into a specific technology stack and easy transition to different infrastructures.</p>



<p>Even with these technological hurdles and concerns in mind, the current iteration of the website and the database itself performs an essential service for those in the digital humanities that are working with non -Latin scripts. By raising awareness of current projects, more and more researchers can interact with not only their results but also the unique challenges that they are facing across their research.</p>



<p>This system has great potential to be a template for many other use cases: be it for creating more collections of research projects, or collections of other digital objects. The JSON data format is flexible enough to allow the system to represent anything the user could want. There is still work to be done to make the integration with the underlying git repo more user-friendly, however Closing the Gap is a great resource for researchers and teams looking for a streamlined and simple solution for maintaining field information.</p>



<blockquote class="wp-block-quote">
<p>As we began gathering data on digital projects dealing with Arabic or similar languages, we thought about how to provide this data in a way that commits to OpenScience principles. So we chose a public Git repository as our main data store, offering the data as JSON in a way that should be as straightforward as possible. Everyone who is interested should be able to contribute without having to deal with too much of a technology stack.</p>
<cite>Source: https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/about/</cite></blockquote>



<figure class="wp-block-image size-medium"><img loading="lazy" decoding="async" width="300" height="161" src="https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-11.46.47-300x161.png" alt="" class="wp-image-3376" srcset="https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-11.46.47-300x161.png 300w, https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-11.46.47-1024x549.png 1024w, https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-11.46.47-768x412.png 768w, https://openmethods.dariah.eu/wp-content/uploads/2023/08/Screenshot-2023-08-09-at-11.46.47.png 1028w" sizes="(max-width: 300px) 100vw, 300px" /><figcaption class="wp-element-caption">Screenshot from <a rel="noreferrer noopener" href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/map/" target="_blank">https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/map/</a>  </figcaption></figure>



<p>Original content: <a rel="noreferrer noopener" href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/about/" target="_blank">Closing the Gap Database interactive website</a>; <a rel="noreferrer noopener" href="https://m-l-d-h.github.io/Closing-The-Gap-In-Non-Latin-Script-Data/" target="_blank">Capture of the site on Internet Archive (14.07.2022)</a></p>



<p>Website of parent project: <a rel="noreferrer noopener" href="https://www.berlin-university-alliance.de/en/commitments/research-quality/forschung/ongoing/closing-the-gap/index.html" target="_blank">Closing the Gap in Non-Latin Script Data • Berlin University Alliance</a>; Capture of the site on Internet Archive  </p>
]]></content:encoded>
					
					<wfw:commentRss>https://openmethods.dariah.eu/2023/08/09/closing-the-gap-in-non-latin-script-data-a-tool-for-building-and-navigating-collections-of-dh-research-projects/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">3375</post-id>	</item>
		<item>
		<title>“Multilingual Research Projects: Non-Latin Script Challenges for Making Use of Standards, Authority Files, and Character Recognition”.</title>
		<link>https://openmethods.dariah.eu/2023/07/08/multilingual-research-projects-non-latin-script-challenges-for-making-use-of-standards-authority-files-and-character-recognition/</link>
					<comments>https://openmethods.dariah.eu/2023/07/08/multilingual-research-projects-non-latin-script-challenges-for-making-use-of-standards-authority-files-and-character-recognition/#respond</comments>
		
		<dc:creator><![CDATA[Marinella Testori]]></dc:creator>
		<pubDate>Sat, 08 Jul 2023 13:41:15 +0000</pubDate>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[Annotating]]></category>
		<category><![CDATA[Capture]]></category>
		<category><![CDATA[Content Analysis]]></category>
		<category><![CDATA[Data Recognition]]></category>
		<category><![CDATA[English]]></category>
		<category><![CDATA[Languages]]></category>
		<category><![CDATA[Structural Analysis]]></category>
		<category><![CDATA[Text]]></category>
		<category><![CDATA[ECPO]]></category>
		<category><![CDATA[OCR]]></category>
		<guid isPermaLink="false">http://openmethods.dariah.eu/?p=3345</guid>

					<description><![CDATA[Everyone of us is accustomed to reading academic contributions using the Latin alphabet, for which we have already standard characters and formats. But what about texts written in languages featuring different, ideographic-based alphabets (for example, Chinese and Japanese)? What kind of recognition techniques and metadata are necessary to adopt in order to represent them in a digital context?]]></description>
			<error>
    <code>internal_server_error</code>
    <title><![CDATA[WordPress &amp;rsaquo; Error]]></title>
    <message><![CDATA[&lt;p&gt;There has been a critical error on this website.&lt;/p&gt;&lt;p&gt;&lt;a href=&quot;https://wordpress.org/documentation/article/faq-troubleshooting/&quot;&gt;Learn more about troubleshooting WordPress.&lt;/a&gt;&lt;/p&gt;]]></message>
    <data>
        <status>500</status>
    </data>
</error>
