michael ginn
banner
mginn.bsky.social
michael ginn
@mginn.bsky.social
compling phd student @ boulder
rare languages, morphology, finite state automata
michaelginn.com
Find out in our new paper, which my colleague Ray Groshan will present at ACL!!

arxiv.org/abs/2506.03593
Is linguistically-motivated data augmentation worth it?
Data augmentation, a widely-employed technique for addressing data scarcity, involves generating synthetic data examples which are then used to augment available training data. Researchers have seen s...
arxiv.org
June 5, 2025 at 6:53 AM
Well isn’t the idea that the entire layer defines a high dimensional space, where each neuron is a dimension?
January 14, 2025 at 7:37 PM
Probably doesn’t help that there is effectively an online cult promoting all of this
January 8, 2025 at 7:55 AM
It’s interesting how they describe patches of bytes that are determined by changes in entropy, without making any reference to morphology…Zellig Harris did basically the same thing 50 years ago
December 13, 2024 at 10:03 PM
Reposted by michael ginn
Can RAG+LLM systems help boost small models for rare languages?

Find out in “Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation” by Bhargav Shandilya and @alexispalmer.bsky.social

arxiv.org/abs/2410.00387
Boosting the Capabilities of Compact Models in Low-Data Contexts with Large Language Models and Retrieval-Augmented Generation
The data and compute requirements of current language modeling technology pose challenges for the processing and analysis of low-resource languages. Declarative linguistic knowledge has the potential ...
arxiv.org
December 2, 2024 at 6:43 PM
Also shout-out to the morphology reviewers!
November 27, 2024 at 9:35 PM
🙋‍♂️
November 23, 2024 at 11:29 PM
Hi, unfortunately the pack is now full, however @datatherapist.bsky.social started a third one! go.bsky.app/CUuio7g
November 23, 2024 at 1:48 AM
Hi, unfortunately the pack is now full, however @datatherapist.bsky.social started a third one! go.bsky.app/CUuio7g
November 23, 2024 at 1:48 AM
Hi, unfortunately the pack is now full, however @datatherapist.bsky.social started a third one! go.bsky.app/CUuio7g
November 23, 2024 at 1:48 AM
Hi, unfortunately the pack is now full, however @datatherapist.bsky.social started a third one! go.bsky.app/CUuio7g
November 23, 2024 at 1:48 AM
Hi, unfortunately the pack is now full, however @datatherapist.bsky.social started a third one! go.bsky.app/CUuio7g
November 23, 2024 at 1:47 AM
Hi, unfortunately the pack is now full, however @datatherapist.bsky.social started a third one! go.bsky.app/CUuio7g
November 23, 2024 at 1:47 AM