Thibault Clérice
@ponteineptique.bsky.social
On vacation.
Digital humanists, loves python, making data, talking to data, reusing data.
Researcher @ ALMAnaCh, Inria Paris.
Digital humanists, loves python, making data, talking to data, reusing data.
Researcher @ ALMAnaCh, Inria Paris.
There seems to be a scam / Fishing attempt on #CHR2025
I received an email from TravelHousing [.] net which alerted me that my booking was not finished, but I got it through my own institution.
It seems related to an alert from Elsevier ( www.elsevier.com/events/confe... )
I received an email from TravelHousing [.] net which alerted me that my booking was not finished, but I got it through my own institution.
It seems related to an alert from Elsevier ( www.elsevier.com/events/confe... )
Elsevier scam alert for Exhibitor Housing Services (EHS) and Exhibitor Housing Management (EHM)
Warning about scam alert for Exhibitor Housing Services (EHS) and Exhibitor Housing Management (EHM)
www.elsevier.com
November 9, 2025 at 10:30 AM
There seems to be a scam / Fishing attempt on #CHR2025
I received an email from TravelHousing [.] net which alerted me that my booking was not finished, but I got it through my own institution.
It seems related to an alert from Elsevier ( www.elsevier.com/events/confe... )
I received an email from TravelHousing [.] net which alerted me that my booking was not finished, but I got it through my own institution.
It seems related to an alert from Elsevier ( www.elsevier.com/events/confe... )
Reposted by Thibault Clérice
Thrilled to release Gaperon, an open LLM suite for French, English and Coding 🧀
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
November 7, 2025 at 9:11 PM
Thrilled to release Gaperon, an open LLM suite for French, English and Coding 🧀
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
Reposted by Thibault Clérice
What happens when we model the detective archetype at scale? 🕵️♂️📚
Our new paper, accepted for #CHR2025 combines literary history and computational modeling to trace how the figure of the detective evolves across 150 years of French fiction.
arxiv.org/pdf/2511.00627
Our new paper, accepted for #CHR2025 combines literary history and computational modeling to trace how the figure of the detective evolves across 150 years of French fiction.
arxiv.org/pdf/2511.00627
November 4, 2025 at 5:36 PM
What happens when we model the detective archetype at scale? 🕵️♂️📚
Our new paper, accepted for #CHR2025 combines literary history and computational modeling to trace how the figure of the detective evolves across 150 years of French fiction.
arxiv.org/pdf/2511.00627
Our new paper, accepted for #CHR2025 combines literary history and computational modeling to trace how the figure of the detective evolves across 150 years of French fiction.
arxiv.org/pdf/2511.00627
Reposted by Thibault Clérice
As DH grows, it’s increasingly important to publish conference papers, but there hasn’t been a clear venue for that.
So I’m thrilled to share this new home for DH proceedings, which will include CHR papers & more.
Thanks to @taylor-arnold.bsky.social for leading this effort!
bit.ly/ach-anthology
So I’m thrilled to share this new home for DH proceedings, which will include CHR papers & more.
Thanks to @taylor-arnold.bsky.social for leading this effort!
bit.ly/ach-anthology
October 29, 2025 at 3:39 PM
As DH grows, it’s increasingly important to publish conference papers, but there hasn’t been a clear venue for that.
So I’m thrilled to share this new home for DH proceedings, which will include CHR papers & more.
Thanks to @taylor-arnold.bsky.social for leading this effort!
bit.ly/ach-anthology
So I’m thrilled to share this new home for DH proceedings, which will include CHR papers & more.
Thanks to @taylor-arnold.bsky.social for leading this effort!
bit.ly/ach-anthology
Appel aux francophones de France: j'ai pas mal de single issues (petits magazines?) de comics DC que j'aimerais bien relier, car ils ne seront jamais publiés en format relié.
Avez-vous des contacts / idées, déjà fait ce genre de chose ? #comics
Avez-vous des contacts / idées, déjà fait ce genre de chose ? #comics
October 24, 2025 at 7:22 AM
Appel aux francophones de France: j'ai pas mal de single issues (petits magazines?) de comics DC que j'aimerais bien relier, car ils ne seront jamais publiés en format relié.
Avez-vous des contacts / idées, déjà fait ce genre de chose ? #comics
Avez-vous des contacts / idées, déjà fait ce genre de chose ? #comics
Of course a bug appeared at the last minute.
Of course...
Of course...
It's been brewing for months: @inriaparisnlp.bsky.social releases CoMMA (Corpus of Multilingual Medieval Archives) !
📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1
(1/🧵)
📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1
(1/🧵)
CoMMA
comma.inria.fr
October 15, 2025 at 4:32 PM
Of course a bug appeared at the last minute.
Of course...
Of course...
It's been brewing for months: @inriaparisnlp.bsky.social releases CoMMA (Corpus of Multilingual Medieval Archives) !
📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1
(1/🧵)
📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1
(1/🧵)
CoMMA
comma.inria.fr
October 15, 2025 at 2:51 PM
It's been brewing for months: @inriaparisnlp.bsky.social releases CoMMA (Corpus of Multilingual Medieval Archives) !
📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1
(1/🧵)
📚 2.5bn tokens of mostly Latin and French texts
🕰️ 800→1600 CE
📜 23k manuscripts
🖥️ 18k on the reading interface: comma.inria.fr
🔍 Paper: inria.hal.science/hal-05299220v1
(1/🧵)
Reposted by Thibault Clérice
🧵 Five years ago @yaelrice.bsky.social and I published this so that no one would have to reinvent the wheel of revealing why research like this is so misguided it defies sense. hyperallergic.com/604897/how-s...
September 28, 2025 at 11:44 AM
🧵 Five years ago @yaelrice.bsky.social and I published this so that no one would have to reinvent the wheel of revealing why research like this is so misguided it defies sense. hyperallergic.com/604897/how-s...
Reposted by Thibault Clérice
but, because it is LONG and "detailed", it is given more weight.
If you are out there doing this, then why are you in the academy? This is a research conference, not an exam, and I am not your student to be 'corrected'.
If you are out there doing this, then why are you in the academy? This is a research conference, not an exam, and I am not your student to be 'corrected'.
September 19, 2025 at 7:53 PM
but, because it is LONG and "detailed", it is given more weight.
If you are out there doing this, then why are you in the academy? This is a research conference, not an exam, and I am not your student to be 'corrected'.
If you are out there doing this, then why are you in the academy? This is a research conference, not an exam, and I am not your student to be 'corrected'.
End of the TranscriboQuest 2025, funded by @biblissima.bsky.social and @atrium-eu.bsky.social and we are starting to present each team datasets.
First, medieval vernaculars with German, Swedish, Irish, Spanish
First, medieval vernaculars with German, Swedish, Irish, Spanish
September 5, 2025 at 11:41 AM
End of the TranscriboQuest 2025, funded by @biblissima.bsky.social and @atrium-eu.bsky.social and we are starting to present each team datasets.
First, medieval vernaculars with German, Swedish, Irish, Spanish
First, medieval vernaculars with German, Swedish, Irish, Spanish
Distributed Text Services just reached 1.0 Release Candidate (1) !
We strongly encourage you to implement DTS now and share your feedback. The specification is feature-complete and stable, and we expect only minor clarifications before the final 1.0 release in 3 months (11/10/2025). w3id.org/dts/
We strongly encourage you to implement DTS now and share your feedback. The specification is feature-complete and stable, and we expect only minor clarifications before the final 1.0 release in 3 months (11/10/2025). w3id.org/dts/
Distributed Text Services (DTS)
The Distributed Text Services (DTS) Specification defines a Hypermedia-Driven Web API for working with collections of text as machine-actionable data.
w3id.org
July 15, 2025 at 12:53 PM
Distributed Text Services just reached 1.0 Release Candidate (1) !
We strongly encourage you to implement DTS now and share your feedback. The specification is feature-complete and stable, and we expect only minor clarifications before the final 1.0 release in 3 months (11/10/2025). w3id.org/dts/
We strongly encourage you to implement DTS now and share your feedback. The specification is feature-complete and stable, and we expect only minor clarifications before the final 1.0 release in 3 months (11/10/2025). w3id.org/dts/
Reposted by Thibault Clérice
In #DigiClass seminar today, @ponteineptique.bsky.social talking about contributions of Distributed Text Services to TEI citation structures handling. www.youtube.com/live/s1po4jV...
#TEIFriday
#TEIFriday
Distributed Text Services for Digital Classics
YouTube video by Digital Classicist London Seminars
www.youtube.com
July 11, 2025 at 4:22 PM
In #DigiClass seminar today, @ponteineptique.bsky.social talking about contributions of Distributed Text Services to TEI citation structures handling. www.youtube.com/live/s1po4jV...
#TEIFriday
#TEIFriday
As of version 2.0.0, Hooktest will not support any longer Capitains as its base library, as well as CTS Guidelines, and will move to dapytains. If Hooktest is critical in your workflows, pin the version to HookTest<1.0.0.
For the new hooktest, see github.com/cllg-project...
For the new hooktest, see github.com/cllg-project...
GitHub - cllg-project/hooktest
Contribute to cllg-project/hooktest development by creating an account on GitHub.
github.com
July 11, 2025 at 8:55 AM
As of version 2.0.0, Hooktest will not support any longer Capitains as its base library, as well as CTS Guidelines, and will move to dapytains. If Hooktest is critical in your workflows, pin the version to HookTest<1.0.0.
For the new hooktest, see github.com/cllg-project...
For the new hooktest, see github.com/cllg-project...
Tomorrow, at 5PM BST, I will talk about the Distributed Text Services in the context of Classics (I'll be on site, in London), and the recent tooling progress that happened. I hope to have more of a discussion after an introduction to the topic :) #Digiclass
classicalassociation.org/events/digit...
classicalassociation.org/events/digit...
Digital Classicist London 2025 seminar: Thibault Clérice on 'Distributed Text Services for Digital Classics' - The Classical Association
The organisers of Digital Classicist London 2025 (Gabriel Bodard and Elizabeth Koch-Kölük at the ICS, Stephen Kay at the British School in Rome, and Katharine Shields at King’s College London) are del...
classicalassociation.org
July 10, 2025 at 2:52 PM
Tomorrow, at 5PM BST, I will talk about the Distributed Text Services in the context of Classics (I'll be on site, in London), and the recent tooling progress that happened. I hope to have more of a discussion after an introduction to the topic :) #Digiclass
classicalassociation.org/events/digit...
classicalassociation.org/events/digit...
Reposted by Thibault Clérice
I'm happy to share with you that my article, "La traduction automatique dialectale: état de l'art et étude préliminaire sur le continuum dialectal de l'occitan", received the 🥇 Best Paper Award in the RJC track of conference TALN 2025 !
🔗 lnkd.in/eYM_J-pM
🔗 lnkd.in/eYM_J-pM
July 7, 2025 at 9:07 AM
I'm happy to share with you that my article, "La traduction automatique dialectale: état de l'art et étude préliminaire sur le continuum dialectal de l'occitan", received the 🥇 Best Paper Award in the RJC track of conference TALN 2025 !
🔗 lnkd.in/eYM_J-pM
🔗 lnkd.in/eYM_J-pM
Reposted by Thibault Clérice
The draft program is available and early bird registration for the TEI Annual Meeting in Krakow is open! Register by July 8 for the lower rate!
tei-c.org/news/2025/06...
tei-c.org/news/2025/06...
Draft Programme and Early Bird Registration for the TEI Conference
tei-c.org
July 6, 2025 at 9:39 PM
The draft program is available and early bird registration for the TEI Annual Meeting in Krakow is open! Register by July 8 for the lower rate!
tei-c.org/news/2025/06...
tei-c.org/news/2025/06...
One DTS service 🥰
Live from Kraków: our own @danja.bsky.social presenting results from WP 7, focusing on the @dracor.org prototype, at the CLS INFRA Closing Event. #digitalhumanities
July 2, 2025 at 5:33 PM
One DTS service 🥰
Reposted by Thibault Clérice
💥 New in OA from me & friends! www.cambridge.org/core/journal...
I had this 'lol, maybe?' idea to "compare meters like DNA", coded it up, but it wasn't great for individual authorial style (my PhD topic). But! Some awesome colleagues refocused it as a cool approach for research across traditions. 😊
I had this 'lol, maybe?' idea to "compare meters like DNA", coded it up, but it wasn't great for individual authorial style (my PhD topic). But! Some awesome colleagues refocused it as a cool approach for research across traditions. 😊
Metronome: tracing variation in poetic meters via local sequence alignment | Computational Humanities Research | Cambridge Core
Metronome: tracing variation in poetic meters via local sequence alignment - Volume 1
www.cambridge.org
June 26, 2025 at 12:06 PM
💥 New in OA from me & friends! www.cambridge.org/core/journal...
I had this 'lol, maybe?' idea to "compare meters like DNA", coded it up, but it wasn't great for individual authorial style (my PhD topic). But! Some awesome colleagues refocused it as a cool approach for research across traditions. 😊
I had this 'lol, maybe?' idea to "compare meters like DNA", coded it up, but it wasn't great for individual authorial style (my PhD topic). But! Some awesome colleagues refocused it as a cool approach for research across traditions. 😊
I missed this information. Wonderful news.
We've updated our anonymity guidelines for CHR submissions! 📝 The anonymity period now runs until acceptance notification (Sept 18), and authors can post preprints on arXiv/Zenodo before submission. Learn more: 2025.computational-humanities-research.org/news/anonymi...
June 12, 2025 at 10:48 AM
I missed this information. Wonderful news.
Reposted by Thibault Clérice
Check out our new paper led by @srishtiy.bsky.social and @nolauren.bsky.social! This work brings together computer vision, cultural theory, semiotics, and visual studies to provide new tools and perspectives for the study of ~culture~ in VLMs.
I am excited to announce our latest work 🎉 "Cultural Evaluations of Vision-Language Models Have a Lot to Learn from Cultural Theory". We review recent works on culture in VLMs and argue for deeper grounding in cultural theory to enable more inclusive evaluations.
Paper 🔗: arxiv.org/pdf/2505.22793
Paper 🔗: arxiv.org/pdf/2505.22793
June 2, 2025 at 12:37 PM
Check out our new paper led by @srishtiy.bsky.social and @nolauren.bsky.social! This work brings together computer vision, cultural theory, semiotics, and visual studies to provide new tools and perspectives for the study of ~culture~ in VLMs.
Hey #digiclass :) Is there any fonts for Ancient Greek (free or not) that would get close to the one used in pre-modern printing of Ancient Greek (humanistic cursive) ?
Feel free to RT :)
Feel free to RT :)
May 16, 2025 at 12:00 PM
Hey #digiclass :) Is there any fonts for Ancient Greek (free or not) that would get close to the one used in pre-modern printing of Ancient Greek (humanistic cursive) ?
Feel free to RT :)
Feel free to RT :)
Reposted by Thibault Clérice
We hebben een aantrekkelijke vacature voor een doctoraatsstudent computationele letterkunde in ons team aan de @uantwerpen.be
Hou je van middeleeuwen, liederen en code? Zeker solliciteren!
Begeleiding met @remcosleiderink.bsky.social
www.uantwerpen.be/nl/jobs/vaca...
Hou je van middeleeuwen, liederen en code? Zeker solliciteren!
Begeleiding met @remcosleiderink.bsky.social
www.uantwerpen.be/nl/jobs/vaca...
Doctoraatsbursaal computationele letterkunde binnen het interuniversitaire onderzoeksproject "New perspectives on medieval and renaissance courtly song" | Universiteit Antwerpen
YUFE vacature
www.uantwerpen.be
May 9, 2025 at 12:34 PM
We hebben een aantrekkelijke vacature voor een doctoraatsstudent computationele letterkunde in ons team aan de @uantwerpen.be
Hou je van middeleeuwen, liederen en code? Zeker solliciteren!
Begeleiding met @remcosleiderink.bsky.social
www.uantwerpen.be/nl/jobs/vaca...
Hou je van middeleeuwen, liederen en code? Zeker solliciteren!
Begeleiding met @remcosleiderink.bsky.social
www.uantwerpen.be/nl/jobs/vaca...
While I am at here, has there been any public response to the question of preprints (before review publications) which is a common thing in most NLP/CV conferences and is helping for CVs ? @comphumresearch.bsky.social
May 4, 2025 at 2:45 PM
While I am at here, has there been any public response to the question of preprints (before review publications) which is a common thing in most NLP/CV conferences and is helping for CVs ? @comphumresearch.bsky.social
@comphumresearch.bsky.social Hey :) The link under the text "2025 conference website" is not updated on computational-humanities-research.org/conference/ (but it is on the "2025" in the table)
Computational Humanities Research
computational-humanities-research.org
May 4, 2025 at 12:33 PM
@comphumresearch.bsky.social Hey :) The link under the text "2025 conference website" is not updated on computational-humanities-research.org/conference/ (but it is on the "2025" in the table)