Morgan Sonderegger
msonderegger.bsky.social
Morgan Sonderegger
@msonderegger.bsky.social
Linguistics, statistics, speech, cognitive science | McGill University Department of Linguistics
whoops, yes
March 31, 2025 at 2:17 PM
At least some of the papers I'm an author on measuring English VOT (Stuart-Smith et al. Labphon; Sonderegger et al Language x2) are large corpus studies that consider stress. Not word position, iirc.
March 27, 2025 at 1:34 PM
I sympathize with the problem, though, and have seen people in this situation just exclude all words with frequency < some_constant, which seems worse.
December 6, 2024 at 8:25 PM
So there'd be a pseudo-word called "low_frequency_word"? I think I wouldn't do this, because it loses a lot of information... but I'm not sure what harm it'd do, if any. Probably affect the random-effect variance estimate and thus maybe SEs for word-level predictors? Curious what you find.
December 6, 2024 at 8:24 PM
could you add me? Thanks for doing this!
November 29, 2024 at 11:44 AM
not sure if you're doing Bayesian or frequentist models, but I've also used the neutralization dataset for different topics in Bayesian models here (feel free to take anything)
people.linguistics.mcgill.ca/~morgan/ling...

"smallest effect size of interest" connects nicely to ROPE
November 22, 2024 at 8:28 PM
I like your incomplete neutralization dataset! There are extensive exercises with it in my book. The 'english' and 'french_cdi' (word learning) datasets there, are also simple and intuitive, I've used them in data science courses.
November 22, 2024 at 7:44 PM
Ooh, agree. There's a section on this in my book -- random intercepts for "nuisance variables" with many levels. Shamelessly mentioning in case it helps more than 2 people read it...
October 9, 2024 at 11:42 PM
You can kind of do this using webR "Line-by-line Execution":

quarto-webr.thecoatlessprofessor.com/qwebr-code-c...

not exactly what you're saying, but close?
Exploring Interactive Code Cells – quarto-webr
quarto-webr.thecoatlessprofessor.com
October 9, 2024 at 5:57 PM
Deadline extension! Abstracts for CorpusPhon are now due Wednesday, March 13 AoE.

We are also excited to announce Dr. Michael McAuliffe @mmcauliffe.bsky.social
as our invited speaker.

Hope you can join us!
March 5, 2024 at 9:58 PM