Michael McAuliffe
banner
memcauliffe.com
Michael McAuliffe
@memcauliffe.com
Linguist, computational phonetician, recovering academic.

Maintainer of Montreal Forced Alignment

Sometimes my tinkerings are useful

https://memcauliffe.com/

https://github.com/mmcauliffe

he/him
Oh yeah, that's probably an error, I'll correct it. The way it works for <cutoff> is that it'll assume it's a cutoff of the next immediately following word, so yeah should be "<cutoff-longword> longword word"
July 31, 2025 at 8:24 PM
Definitely a bug, there is logic to decrease the probability of spn for cutoffs (but keep it as possibility vs subwords), but either it's not working correctly or if it is treating it as an <unk> token explicitly, then the cutoff model isn't being applied. I'll try to take a look at it this weekend!
July 31, 2025 at 8:07 PM