Sebastian Nehrdich
snehrdich.bsky.social
Sebastian Nehrdich
@snehrdich.bsky.social
CTO of the MITRA project @BAIR, UC Berkeley.
Research in ancient Asian low resource languages, especially text reuse, machine translation, semantic similarity search.
Buddhist studies MA, now PhD in computational linguistics @Duesseldorf university.
Dharmamitra has recently seen the integration of the fantastic Digital Dictionary of Buddhism for Chinese and the equally great dictionary for Tibetan by Christian Steinert into the “English (explained)” translation mode!
November 4, 2025 at 10:31 PM
Dharmamitra got a significant update: We now feature fast OCR for Sanskrit, Tibetan, etc. powered by Gemini. You can upload images and PDFs. We also added an option to translate from files directly, instead of needing to go through OCR manually first!
June 19, 2025 at 5:02 AM
I will be giving a presentation on the effectiveness of semantic similarity models for textual reuse detections in Buddhist source languages with focus on Buddhist Chinese in this great online workshop coming up at Bochum University!
www.oaw.ruhr-uni-bochum.de/forschung/ha...
June 6, 2025 at 3:23 PM
Happy to announce that I will be joining Tohoku University as tenure track assistant professor in autumn this year! My position will be at the intersection of Buddhist & Japanese Studies and machine learning / digital humanities.
June 3, 2025 at 4:02 PM
I will be presenting the various MITRA tools and what exciting new features we will offer in 2025 at the IABS conference in Leipzig on Tuesday August 12 in this panel organized by Marcus Bingenheimer!
June 3, 2025 at 10:48 AM
Cherry bloom in Sendai, Japan
April 13, 2025 at 11:45 PM
I had a great time teaching a workshop on LLMs and AI technology for Asian studies at the AAS conference in Columbus, Ohio this year!
March 18, 2025 at 2:38 PM
The world is trim enough these days. Here is the view of downtown San Francisco and the bay bridge taken from Kensington.
March 4, 2025 at 2:41 AM
Gemini 2 is perhaps the best available AI model out there that few people talk about. In our tests on translating Classical/Buddhist Chinese into English, Gemini 2 Flash + RAG augmentation is competitive to Claude 3.5 Sonnet while being more than a hundred times cheaper. Google is 🔥🔥
February 15, 2025 at 9:55 PM
The Dharmamitra chrome extension also got a significant update. In addition to a better underlying model, It now supports multiple output languages. Just right-click on the dharmamitra icon, open "options" and then set it to the language you prefer.
Enjoy!
February 15, 2025 at 9:43 PM
Dharmamitra just got a whole lot better at just about everything, especially when it comes to translating into or out of non-English languages! It’s still not perfect but it now again beats claude etc. by a margin. Bigger update will come with new functionality in the coming days, stay tuned!
February 9, 2025 at 1:29 AM
Chinese New Year's gift: MITRA will very soon significantly increase its capabilities across the board, including translation into modern Chinese. Its going to be 🔥🔥
January 29, 2025 at 7:11 AM
Japan has always been very kind to me. In 2018, Tsukuba university invited me for a young scholars workshop, that was my first visit. From 2018-2020 I spent excellent years in Kyoto. Now I could come back for the 100 year celebration of the 大正大蔵経/30 years of SAT. ありがとう!
December 25, 2024 at 12:15 PM
I will be in Naples Nov 24-27 to present dharmamitra at this workshop: ow.ly/IH7O50UbG41
November 21, 2024 at 4:58 PM
My talk on dharmamitra, what it can do now and what it will be able to do in the future this week! Afaik it is going to be public, via the QR code in the announcement you should be able to join.
November 19, 2024 at 8:03 PM
BuddhaNexus 2.0 is maturing and will be ready for release soon! This is going to be 🔥🔥🔥
November 18, 2024 at 4:36 PM
github.com/dharmamitra/... dharmamitra emacs extension is great fun! I am working on a chrome extension next to make it easy to translate inside the web browser
November 9, 2024 at 3:29 PM
Out of this life and into the next
You paid for this but they give you that
November 8, 2024 at 6:25 PM
New publication @EMNLP 2024 Findings: One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit
NLP Tasks
arxiv.org/pdf/2409.13920 New SOTA model for a number of Sanskrit NLP tasks: Word segmentation, lemmatization, and morphosyntactic tagging.
September 24, 2024 at 3:56 AM