Alix Chagué 🌈
banner
alix-tz.bsky.social
Alix Chagué 🌈
@alix-tz.bsky.social
PhD candidate at @Inria and @UMontreal working on automatic transcription of manuscripts (HTR). Posts about DH stuff and #HTR_United. More on my research blog: alix-tz.github.io/phd
Reposted by Alix Chagué 🌈
Does anyone have a dataset of 1,000 + pages of handwritten text on Transkribus that they want to use for finetuning a VLM? If so, please let me know. This would be for any language and any script.
October 27, 2025 at 5:56 PM
Reposted by Alix Chagué 🌈
It's been brewing for months: @inriaparisnlp.bsky.social releases CoMMA (Corpus of Multilingual Medieval Archives) !

📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1

(1/🧵)
CoMMA
comma.inria.fr
October 15, 2025 at 2:51 PM
Reposted by Alix Chagué 🌈
🚨Job ALERT🚨! My old postdoc is available!

I cannot emphasize enough how much a life-altering position this was for me. It gave me the experience that I needed for my current role. As a postdoc, I was able to define my projects and acquire a lot of new skills as well as refine some I already had.
September 24, 2025 at 1:50 PM
Reposted by Alix Chagué 🌈
I'm sorry, worldwide, irrevocable, non-exclusive, transferable permission to my voice and likeness? For what now? In any manner for any purpose???

This is in academia/.edu's new ToS, which you're prompted to agree to on login. Anyway I'll be jumping ship. You can find my stuff at hcommons.org.
September 17, 2025 at 5:16 PM
Reposted by Alix Chagué 🌈
Coming up on Oct 3-4, 2025 at Central European University: OCR/HTR Workshop for Under-resourced &Under-represented Languages in DH, funded by the Cluster of Excellence EurAsian Transformations &CLARIAH-AT! (Main organizer: yours truly) #digitalhumanities #multilingualdh #textrecognition #ocr #htr
September 1, 2025 at 11:00 AM
Reposted by Alix Chagué 🌈
Have high quality data sitting on Transkribus? Want to make it available on @hf.co with a single command line? Introducing Transkribus-HF which allows you to take a Transkribus export zip and make it into a HF dataset! It can parse pages, regions, lines, or windows!

github.com/wjbmattingly...
June 25, 2025 at 2:03 AM
Already on my way back, but these past few days I was in Princeton at the IAS to talk about HTR and its future with a whole lot of interesting people I was happy to meet or see again!
The IAS campus is quite a unique place, I'm very happy I was given the opportunity to travel there! 🦌🌳
June 14, 2025 at 2:36 PM
Reposted by Alix Chagué 🌈
Tomorrow (Wednesday, 11 June), if you’re in Paris, come see the many presentations open to the public. #medieval

Demain, le mercredi 11 juin, rejoignez-nous à Paris pour la partie de la conférence ouverte au public. Il y aura plein d’interventions passionnantes.

École nationale des chartes, Paris
June 10, 2025 at 9:00 PM
Reposted by Alix Chagué 🌈
Call for Nominations: The Rahtz Prize for TEI Ingenuity 2025 is now open. Submit your nomination or self-nomination by 30 June 2025. @teiconsortium.bsky.social

tei-c.org/activities/r...
June 4, 2025 at 8:41 AM
Reposted by Alix Chagué 🌈
Team: Get infected with enthusiasm for Death by Numbers and 17th century London with @jmotis.bsky.social
on the @oieahc.bsky.social's Digital Humanities Coffeehouse series: youtu.be/MmfJidjs370?...
Death By Numbers: Morality Records & Plague in Britain
YouTube video by Omohundro Institute
youtu.be
May 27, 2025 at 11:05 AM
Reposted by Alix Chagué 🌈
The results from #DH Awards 2024 are released!

Results: dhawards.org/dhawards2024...

Statistics: dhawards.org/dhawards2024...

Winners can use the corresponding icon, I mean, if they want to.
DH Awards 2024 Results | Digital Humanities Awards
dhawards.org
April 10, 2025 at 8:30 PM
Reposted by Alix Chagué 🌈
#digiclass If you have #Epidoc inscriptions / papyri, I would love to have a test run with this...
If you are a Python developer and are interested in Distributed Text Services, I have just adapted some of my code to deal with milestone/lb/pb in Dapytains when used in conjunction with CiteStructure.

I'd like to have testers and test corpora for this: github.com/distributed-...
Add support for one level of non-containing elements by PonteIneptique · Pull Request #8 · distributed-text-services/MyDapytains
This pull request add supports for the commonly used lb or milestone elements. This should be quite helpful to quite a lot of people and is a clear improvement over MyCapitains...
github.com
March 22, 2025 at 8:02 AM
#DHSI this year will take place at @umontreal.ca between the 26th of May and the 6th of June. I have the pleasure to co-chair the DHSI Aligned Conference with Moni Razavi (@uottawa.bsky.social). It's a series of short conference sessions held at the end of two days each week of the DHSI!

➡️ dhsi.org
Digital Humanities Summer Institute
dhsi.org
March 5, 2025 at 4:15 PM
Reposted by Alix Chagué 🌈
Are you interested in cultural transmission, medieval manuscripts or digital humanities, and want to pursue a PhD in a city bustling with intellectual and cultural life ? Come work with us !
📣 Job offers
We are recruiting two PhD students in computational philology, to work on textual transmission with a methodology blending cultural evolution models, philology and data analysis 🧪
Deadline: 31 March 2025
February 18, 2025 at 4:19 PM
Reposted by Alix Chagué 🌈
Dear friends, do I know anyone here who has experience in (digital) numismatics, in particular coin finds from Roman Antiquity? (Please repost if you might know someone!)

Thx for any pointers.
February 10, 2025 at 9:00 PM
Reposted by Alix Chagué 🌈
We'd love to expand a little more to Old/Middle English manuscripts in the context of CATMuS, but we do not have much contact in this area.

Would anyone be interested ?
January 22, 2025 at 9:24 AM
Reposted by Alix Chagué 🌈
🦋 We’ve made the move! The ALMAnaCH project-team at Inria Paris is now posting here. Follow us for news and updates about our research and seminar announcements!
January 29, 2025 at 9:54 AM
Reposted by Alix Chagué 🌈
Nominate now before it is too late! #DH #DigitalHumanities
Last reminder!

Nominate something for DH Awards 2024 before Monday! Do it now!

Don't worry, we'll remove duplicates, but we don't add things to the ballot you've not nominated! Are all your 2024-updated resources nominated?

Link at the bottom of:

dhawards.org/dhawards2024...
DH Awards 2024 – Call For Nominations | Digital Humanities Awards
dhawards.org
January 24, 2025 at 6:01 PM
Reposted by Alix Chagué 🌈
Call for book chapter proposals: Critical Approaches to Automated Text Recognition. Edited by me, Paul Gooding, @semames.bsky.social & @jnockels.bsky.social. Proposals on any critical topic relating to automated and advanced text recognition (including OCR, HTR, etc) are welcome by 31st March 25.
2025 Call for Chapter Submissions: Critical Approaches to Automated Text Recognition
Researchers and practitioners are invited to submit to a collection of essays tentatively entitled Critical Approaches to Automated Text Recognition, to be edited by Melissa Terras, Paul Goodi…
melissaterras.org
January 22, 2025 at 6:15 PM
Reposted by Alix Chagué 🌈
🚨 Reminder
🌟 Join the ALMAnaCH team at Inria Paris as a Data Librarian for Ancient Greek Corpora.

📚 Apply here: jobs.inria.fr/public/class...

📅 Contract Type: Fixed-term (1 year)
🎓 Required Degree: Master's or equivalent
🇫🇷 Location: Paris, France
🏧 Salary: Starting 2.7k€ gross
Data Librarian pour les corpus de Grec Ancien
Offre d'emploi Inria
jobs.inria.fr
January 16, 2025 at 9:06 AM
Reposted by Alix Chagué 🌈
Does anybody know if there's an escriptorium model that might work for 15th century Old Swedish (gothic cursiva)? And, how strict is the invitation system?
Thank you!
December 26, 2024 at 10:41 AM
Reposted by Alix Chagué 🌈
First job: Data Librarian for Ancient Greek
Starting ~ 2700 € (gross salary)
Duration: A year
Objectives: Managing the partners in the project, establishing objectives in terms of corpus and producing/controlling the quality of TEI XML
Full description: recrutement.inria.fr/public/class...
Data Librarian pour les corpus de Grec Ancien
Offre d'emploi Inria
recrutement.inria.fr
December 20, 2024 at 9:42 AM
Reposted by Alix Chagué 🌈
First job: OCR / Machine Learning Engineer
Starting ~ 2700 € (gross salary)
Duration: A year
Objectives: Enhancing / building a pipeline for producing XML TEI documents from raw scans.
Full description: recrutement.inria.fr/public/class...
Ingénieur pour l'OCR et la structuration de documents imprimés
Offre d'emploi Inria
recrutement.inria.fr
December 20, 2024 at 9:42 AM
Reposted by Alix Chagué 🌈
The news is official: starting in March, I will lead the project Corpus Liberatum Linguae Graecae which aims to complement the available work in Perseus and the Patristik Text Arkiv in order to help researcher have free access to Ancient Greek Texts.

So JOB KLAXON #DH #DigiClass #NLP
December 20, 2024 at 9:42 AM