{"status":"ok","message-type":"work","message-version":"1.0.0","message":{"indexed":{"date-parts":[[2025,6,19]],"date-time":"2025-06-19T04:46:58Z","timestamp":1750308418723,"version":"3.41.0"},"reference-count":33,"publisher":"Association for Computing Machinery (ACM)","issue":"2","license":[{"start":{"date-parts":[[2022,4,7]],"date-time":"2022-04-07T00:00:00Z","timestamp":1649289600000},"content-version":"vor","delay-in-days":0,"URL":"https:\/\/www.acm.org\/publications\/policies\/copyright_policy#Background"}],"funder":[{"name":"Israeli Ministry of Sciences and Technology"},{"name":"French Ministry of Sciences, Higher Education and Innovation"},{"name":"French Ministry of European and Foreign Affairs in the frame of the PHC-Maimonide","award":["41146YC"],"award-info":[{"award-number":["41146YC"]}]},{"name":"European Union\u2019s Horizon 2020 Research and Innovation Program","award":["871127"],"award-info":[{"award-number":["871127"]}]}],"content-domain":{"domain":["dl.acm.org"],"crossmark-restriction":true},"short-container-title":["J. Comput. Cult. Herit."],"published-print":{"date-parts":[[2022,6,30]]},"abstract":"<jats:p>Making ancient handwritten manuscripts accessible to the general public is challenging, for several reasons. Foremost, they are handwritten. Each and every one is unique, so there is a need for manual transcription for providing enough examples for training a machine-learning-based algorithm to automatically transcribe the handwritten text. Moreover, the quality of the text is diverse\u2014over time the ink faded, pages were damaged, and so forth. Furthermore, the boundaries of the textual regions on a page and the lines of text are not standard. Sometimes there are corrections above the lines, the lines are curved, there are comments and annotations on the margins, and more. A possible solution for these challenges is having a \u201cperson in the loop.\u201d However, manual correction brings with it another challenge\u2014how to address disagreement between annotations (as usually several corrections are considered before a decision is taken about the correct transcription). Tikkoun-Sofrim is a system that integrates automatic handwritten text recognition with manual, crowdsourced error correction, introducing an automatic decision process about when to stop asking for additional transcription and selecting the best transcription, declaring it as the recommended agreed reading. The system was applied to several manuscripts of \u201cMidrash Tanhuma,\u201d a medieval Hebrew rabbinic homiletic text, achieving a high level of success.<\/jats:p>","DOI":"10.1145\/3476776","type":"journal-article","created":{"date-parts":[[2022,4,8]],"date-time":"2022-04-08T06:25:06Z","timestamp":1649399106000},"page":"1-20","update-policy":"https:\/\/doi.org\/10.1145\/crossmark-policy","source":"Crossref","is-referenced-by-count":2,"title":["Tikkoun Sofrim: Making Ancient Manuscripts Digitally Accessible: The Case of Midrash Tanhuma"],"prefix":"10.1145","volume":"15","author":[{"ORCID":"https:\/\/orcid.org\/0000-0003-4914-8949","authenticated-orcid":false,"given":"Alan J.","family":"Wecker","sequence":"first","affiliation":[{"name":"University of Haifa, Haifa, Israel"}]},{"given":"Vered","family":"Raziel-Kretzmer","sequence":"additional","affiliation":[{"name":"University of Haifa, Haifa, Israel"}]},{"given":"Benjamin","family":"Kiessling","sequence":"additional","affiliation":[{"name":"AOrOc (UMR 8546) EPHE, PSL, Paris, France"}]},{"given":"Daniel St\u00f6kl Ben","family":"Ezra","sequence":"additional","affiliation":[{"name":"AOrOc (UMR 8546) EPHE, PSL, Paris, France"}]},{"given":"Moshe","family":"Lavee","sequence":"additional","affiliation":[{"name":"University of Haifa, Haifa, Israel"}]},{"given":"Tsvi","family":"Kuflik","sequence":"additional","affiliation":[{"name":"University of Haifa, Haifa, Israel"}]},{"given":"Dror","family":"Elovits","sequence":"additional","affiliation":[{"name":"University of Haifa, Haifa, Israel"}]},{"given":"Moshe","family":"Schorr","sequence":"additional","affiliation":[{"name":"University of Haifa, Haifa, Israel"}]},{"given":"Uri","family":"Schor","sequence":"additional","affiliation":[{"name":"University of Haifa, Haifa, Israel"}]},{"given":"Pawel","family":"Jablonski","sequence":"additional","affiliation":[{"name":"AOrOc (UMR 8546) EPHE, PSL, Paris, France"}]}],"member":"320","published-online":{"date-parts":[[2022,4,7]]},"reference":[{"key":"e_1_3_3_2_2","unstructured":"Avraham Meir Rozen (Ed.). 1877. Midrash Tanhuma. Unterhendler Warsow. https:\/\/tablet.otzar.org\/pages\/?&pagenum=1&book=5159."},{"key":"e_1_3_3_3_2","doi-asserted-by":"crossref","unstructured":"Mathieu Andro Imad Saleh. 2017. Digital libraries and crowdsourcing: a review. In Collective Intelligence and Digital Archives: Towards Knowledge Ecosystems Samuel Szoniecky and Nasreddine Bouha\u00ef (Eds.). ISTE; Wiley 135\u2013162. https:\/\/hal.archives-ouvertes.fr\/hal-01436766\/document.","DOI":"10.1002\/9781119384694.ch5"},{"key":"e_1_3_3_4_2","doi-asserted-by":"publisher","DOI":"10.1145\/3355610"},{"key":"e_1_3_3_5_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2017.59"},{"key":"e_1_3_3_6_2","doi-asserted-by":"publisher","DOI":"10.31826\/9781463209445"},{"key":"e_1_3_3_7_2","volume-title":"Midrash Tanhuma,","author":"Buber Solomon","year":"1885","unstructured":"Solomon Buber (Ed.). 1885. Midrash Tanhuma, 2 vols. Romm, Vilna."},{"key":"e_1_3_3_8_2","volume-title":"7th Digital Humatities Benelux 2020","author":"Caceres Anna","year":"2020","unstructured":"Anna Caceres, Andreas Weber, and Lambert Schomaker. 2020. MONK in practice: Indexing heterogeneous handwritten collections. In 7th Digital Humatities Benelux 2020, Vol. 7. DH Benelux, Leiden, Netherlands."},{"key":"e_1_3_3_9_2","doi-asserted-by":"publisher","DOI":"10.3366\/ijhac.2014.0119"},{"key":"e_1_3_3_10_2","unstructured":"David Ben-Gurion. 1950. Letter to Eliezer Kaplan. 128 pages. https:\/\/www.archives.gov.il\/...\/0b0...\/File\/0b07170680967faf."},{"key":"e_1_3_3_11_2","doi-asserted-by":"publisher","DOI":"10.1145\/2595188.2595199"},{"key":"e_1_3_3_12_2","doi-asserted-by":"publisher","DOI":"10.1145\/2960811.2960815"},{"key":"e_1_3_3_13_2","first-page":"446,450","volume-title":"Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC\u201918)","author":"Granet Adeline","year":"2018","unstructured":"Adeline Granet, Benjamin Hervy, Geoffrey Roman-Jimenez, Marouane Hachicha, Emmanuel Morin, Harold Mouch\u00e8re, Solen Quiniou, Guillaume Raschia, Fran\u00e7oise Rubellin, and Christian Viard-Gaudin. 2018. Crowdsourcing-based annotation of the accounting registers of the Italian comedy. In Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC\u201918). European Language Resources Association (ELRA), 446,450. https:\/\/www.aclweb.org\/anthology\/L18-1069."},{"key":"e_1_3_3_14_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDAR.2017.307"},{"key":"e_1_3_3_15_2","doi-asserted-by":"publisher","DOI":"10.34894\/Z9G2EX"},{"key":"e_1_3_3_16_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICFHR2020.2020.00064"},{"key":"e_1_3_3_17_2","doi-asserted-by":"publisher","DOI":"10.1109\/ICDARW.2019.10032"},{"key":"e_1_3_3_18_2","doi-asserted-by":"publisher","DOI":"10.1145\/3394486.3403102"},{"key":"e_1_3_3_19_2","doi-asserted-by":"publisher","DOI":"10.1108\/so-10-2013-0019"},{"key":"e_1_3_3_20_2","doi-asserted-by":"publisher","DOI":"10.1109\/HICSS.2013.85"},{"volume-title":"Crowdsourcing Our Cultural Heritage","year":"2014","key":"e_1_3_3_21_2","unstructured":"Mia Ridge (Ed.). 2014. Crowdsourcing Our Cultural Heritage. Ashgate Publishing, Ltd., Farnham, England."},{"key":"e_1_3_3_22_2","volume-title":"Coaching the Crowd: Tutorial Design for Zooniverse Projects","author":"Rosser Holly Kathryn","year":"2018","unstructured":"Holly Kathryn Rosser. 2018. Coaching the Crowd: Tutorial Design for Zooniverse Projects. Ph.D. Dissertation. University of Nebraska at Omaha."},{"key":"e_1_3_3_23_2","doi-asserted-by":"publisher","DOI":"10.1145\/2494266.2494294"},{"key":"e_1_3_3_24_2","volume-title":"VREs, an Ancient Manuscripts Conference 2020","author":"Schor Uri","year":"2020","unstructured":"Uri Schor, Vered Raziel-Kretzmer, Moshe Lavee, and Tsvi Kuflik. 2020. Digital research library for multi-hierarchical interrelated texts from Tikkoun Sofrim text production to text modeling. In VREs, an Ancient Manuscripts Conference 2020. Swiss National Science Foundation, Lausanne."},{"key":"e_1_3_3_25_2","doi-asserted-by":"publisher","DOI":"10.1145\/3151509.3151521"},{"key":"e_1_3_3_26_2","doi-asserted-by":"publisher","DOI":"10.1145\/2567948.2579215"},{"key":"e_1_3_3_27_2","unstructured":"Manuscript Studies Z-profile: Holistic preprocessing applied to Hebrew manuscripts for HTR with Ocropy and Kraken"},{"key":"e_1_3_3_28_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-319-19390-8_29"},{"key":"e_1_3_3_29_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-0-85729-479-1_3"},{"key":"e_1_3_3_30_2","volume-title":"Midrash Tanhuma: Translated Into English with Introduction, Indices, and Brief Notes","author":"Townsend John T.","year":"1989","unstructured":"John T. Townsend (Ed.). 1989. Midrash Tanhuma: Translated Into English with Introduction, Indices, and Brief Notes. KTAV Publishing House, Hoboken, NJ."},{"key":"e_1_3_3_31_2","doi-asserted-by":"publisher","DOI":"10.1145\/3314183.3324972"},{"key":"e_1_3_3_32_2","doi-asserted-by":"publisher","DOI":"10.1145\/3386392.3402436"},{"key":"e_1_3_3_33_2","doi-asserted-by":"publisher","DOI":"10.1007\/978-3-030-01231-1_23"},{"key":"e_1_3_3_34_2","doi-asserted-by":"crossref","unstructured":"D. St\u00f6kl Ben Ezra B. Brown-DeVost P. Jablonski B. Kiessling E. Lolli and H. Lapin. 2021. BiblIA \u2013 a General Model for Medieval Hebrew Manuscripts and an Open Annotated Dataset. HIP@ICDAR 2021.","DOI":"10.1145\/3476887.3476896"}],"container-title":["Journal on Computing and Cultural Heritage"],"original-title":[],"language":"en","link":[{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3476776","content-type":"unspecified","content-version":"vor","intended-application":"text-mining"},{"URL":"https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3476776","content-type":"unspecified","content-version":"vor","intended-application":"similarity-checking"}],"deposited":{"date-parts":[[2025,6,18]],"date-time":"2025-06-18T17:49:20Z","timestamp":1750268960000},"score":1,"resource":{"primary":{"URL":"https:\/\/dl.acm.org\/doi\/10.1145\/3476776"}},"subtitle":[],"short-title":[],"issued":{"date-parts":[[2022,4,7]]},"references-count":33,"journal-issue":{"issue":"2","published-print":{"date-parts":[[2022,6,30]]}},"alternative-id":["10.1145\/3476776"],"URL":"https:\/\/doi.org\/10.1145\/3476776","relation":{},"ISSN":["1556-4673","1556-4711"],"issn-type":[{"type":"print","value":"1556-4673"},{"type":"electronic","value":"1556-4711"}],"subject":[],"published":{"date-parts":[[2022,4,7]]},"assertion":[{"value":"2020-11-01","order":0,"name":"received","label":"Received","group":{"name":"publication_history","label":"Publication History"}},{"value":"2021-06-01","order":1,"name":"accepted","label":"Accepted","group":{"name":"publication_history","label":"Publication History"}},{"value":"2022-04-07","order":2,"name":"published","label":"Published","group":{"name":"publication_history","label":"Publication History"}}]}}