#HTR - kbin.social

litteracarolina, 1 month ago to random

I’m really excited to be part of this incredible project. To learn more, see the press release (in English and German) on my university’s website: https://www.uni-goettingen.de/de/3240.html?id=7427

From: @InsularMSS
https://mastodon.social/@InsularMSS/112253102162278330

reply

expand (1)

collapse (1)

report

activity

copy /kbin url

copy original url

open original url

Loading...

litteracarolina, 1 month ago

I should also say that @InsularMSS will involve not only #medieval #manuscript studies but also #digitalhumanities and scientific research: palaeoproteomics, #multispectral and photometric stereo #imaging, handwritten text recognition (#HTR), Raman and XRF #spectroscopy, infra-red #microscopy & much more. Perhaps most important, though, will be the freedom and time to do this important work!

@medievodons @historikerinnen @histodons

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ oblomov

DavidH, 3 months ago to random

Why do people volunteer as citizen scientist transcribers (transcrptors?)? What feelings are evoked by their participation?

Some questions begun to be answered by my team and our partners at the Gothenburg University, https://publications.archivists.org.au/index.php/asa/article/view/10937

#CitizenScience #HTR #Archives

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ indieterminacy

bibelexegese, 4 months ago to antiquidons German

Morgen im Digital Classicist Seminar Berlin (16:15 Uhr vor Ort in der @BBAW und online): Daniel Stökl Ben Ezra @d_stoekl (Paris) spricht über „Lösungsansätze und offene Fragen zur Massentranskription historischer Handschriften“ und stellt #eScriptorium #HTR (https://escriptorium.readthedocs.io/).
Alle weiteren Informationen: https://digiclass.bbaw.de/seminar.html @antiquidons

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

mhucka, 4 months ago to machinelearning

People who work in AI and libraries/archives/museums, we need your help! 👋🏻

A few of us maintain an "awesome-ai4lam" 🕶️ list at https://github.com/AI4LAM/awesome-ai4lam and we need your help finding more things to add. Please tell us what we missed!

You can just reply to this toot, or open an issue/ticket in the GitHub repo, or email me, or whatever is easiest for you.

Please boost this to reach more people! 📣
#MachineLearning #Libraries #AI #ArtificialIntelligence #ChatGPT #GPT #LLMs #OCR #HTR

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ kellogh, teksquisite

petersuber, 6 months ago to random

Project Ansund at Trinity College Dublin (@tcddublin) will use handwritten text recognition (#HTR) "to build an exhaustive, #OpenAccess digital corpus of #OldEnglish texts, that transcribes all surviving Old English for the first time, and in an unparalleled level of detail."
https://theconversation.com/how-ai-could-reveal-secrets-of-thousands-of-handwritten-documents-from-medieval-manuscripts-to-hieroglyphics-218734

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

DigitalHistory, 6 months ago to histodons German

Umfangreiche französischsprachige Quellenkorpora des Mittelalters maschinell erschließen?

Im nächsten #DigitalHistoryOFK nimmt Pauline Spychala (DHI Paris) die Texterkennungsplattformen #eScriptorium & #Transkribus unter die Lupe. Ziel ihres Projektes ist die Entwicklung eines Workflows, der beide Tools effektiv kombiniert, um u.a. den Eigenschaften der untersuchten Quellen gerecht zu werden.

🔜 Mi, 22. Nov., 4-6 pm - via Zoom

ℹ️ Info: https://dhistory.hypotheses.org/6384

#DigitalHistory #HTR @histodons

reply

expand (3)

collapse (3)

report

activity

copy /kbin url

copy original url

open original url

Loading...

daieuxetdailleurs, 6 months ago to Quebec French

[#veille #HTR] Déchiffrer des documents d’archives manuscrits à l’aide de l’intelligence artificielle | UdeMNouvelles
https://nouvelles.umontreal.ca/article/2023/11/08/dechiffrer-des-documents-d-archives-manuscrits-a-l-aide-de-l-intelligence-artificielle/
#UdeM #transkribus #IA #archives #quebec @archivistodon

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

phillipstroebel, 7 months ago to random

The next PATT meeting will be about teaching automatic text recognition methods to researchers in the #humanities. Interested people are welcome to join (sign up here: Alicia Schümperli, sekteuscher@hist.uzh.ch). #HTR #digitalhumanities

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ CerstinMahlow

DavidH, 7 months ago to ai

1/3

Today, we at the National Archives have officially released an open base-model for handwriting recognition. It works best on Swedish manuscripts from about 1650 - 1900.

By basic model it is meant that it has two intended areas of use:

To HTR large amounts of images of handwritten text with good enough quality to index the text for search.

To function as a starting point for using own training data to create more specialized HTR models.

#HTR #AI #DigitalHumanities #opensource

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mrundkvist, acdha

dhiparis, 8 months ago to random German

Dominique Stutzmann s’est adressé à l'IHA aux jeunes historiennes et historiens qui souhaitent utiliser la reconnaissance automatique de texte. Il s'agit de comprendre l'évolution et son impact sur la science historique. Vous avez raté ça? L’enregistrement audio est maintenant disponible sur notre site web!

👉https://ow.ly/LvpW50POfJR

#OCR #HTR #DigitalHumanities #HumanitesNumeriques

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ linusable

phillipstroebel, 9 months ago to philosophy

My #PhD #thesis 'Flexible Techniques for Automatic Text Recognition of Historical Documents' has been published & is now available: https://doi.org/10.5167/uzh-234886

I recommend Chapter 4 if you are interested in #HTR & what #Transformers can do for #historical #documents. #digitalhumanities #OCR

Thanks again to my supervisors Martin Volk & @thist, & everyone at the Department of Computational Linguistics from the University of Zurich!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ mwfc

petrichor, 9 months ago to random

Here's a few more details about my progress training a handwriting model with #Transkribus

https://erambler.co.uk/blog/training-a-handwriting-model-update-1/

#htr

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ brainwane

petrichor, 9 months ago to random

OK, I've finally got round to transcribing enough pages of my own handwriting to train up a model with #Transkribus, and the results are surprisingly good! I expected to need more than the minimal 25 pages to get a decent level of accuracy but it's already noticeably better than the generic recognition on my reMarkable tablet or OneNote.

#htr

reply

expand (5)

collapse (5)

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ brainwane

petrichor, 9 months ago

Since #PyLaia is open source, it should be possible now to recreate this training on my own desktop with the same parameters, and apply the model to recognise new pages, and from there figure out a workflow to simplify getting handwritten notes into plain text for reference or publication.

Has done any of these stages? Any pointers?

#Transkribus #htr

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ brainwane

DavidH, 9 months ago to UX

A very belated #introduction !

I'm David, once an archaeologist and classicist, pretty much always a nerd. Currently working at the Swedish National #Archives leading its little Digital Experiences team.

Of special interest at the moment:

#UX of archival discovery systems

#OpenData and #API s in archives

#AI and esp. #HTR as applied to archives

Citizen science and #crowdsourcing in archives

Happy to receive reading tips and examples of the above! Will do the same.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

+ fiee, joeroe, mwfc, josh +2 more

Private

dta_cthomas, 10 months ago

@jacobward @polarbear @histodons

have you heard of/tried #Transkribus https://app.transkribus.eu/? (NB They also have a very handy "Scan Tent", https://readcoop.eu/de/scantent/).

They also have a pricing scheme, but you might get far enough with your free start credits.

Would allow you to concentrate on photographing in the archive, and then batch- #OCR resp. #HTR your document folders later.

You could even train your own model, but the existing ones work for regular scripts well enough, I guess.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

jonaskjoller, 1 year ago to earlymodern

I hear a frequent complaint about applying quantitative methods on texts that have been through #HTR tools, such as #Transkribus, that the expected error rate means that you will miss too many occurrences of the word you are looking for. (1/n)

@histodons @digitalhumanities @earlymodern

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...