Soon @cnrs.fr @univ-amu.fr; currently postdoc at MIT, Cambridge, US.
This has the advantage of being easy to explain/implement and the disadvantage of being quite inefficient.
You could also add a speaker criterion if you want to matched segments both based on onset/offset and speaker identity
This has the advantage of being easy to explain/implement and the disadvantage of being quite inefficient.
You could also add a speaker criterion if you want to matched segments both based on onset/offset and speaker identity
Assign this automatic segment to the manual segment with the greatest overlap.
Assign this automatic segment to the manual segment with the greatest overlap.
Assign all of your manual segments a number: 1, 2, 3, 4,...
Assign all of your automatic segments a number: 1, 2, 3, 4,...
Assign all of your manual segments a number: 1, 2, 3, 4,...
Assign all of your automatic segments a number: 1, 2, 3, 4,...
pyannote.github.io/pyannote-met...
@hbredin.bsky.social 🙏
pyannote.github.io/pyannote-met...
@hbredin.bsky.social 🙏
I bet no 🙃
I bet no 🙃
The hype around LMs is very new, but there's a long tradition of using them to model lang. acq. in children.
The hype around LMs is very new, but there's a long tradition of using them to model lang. acq. in children.
and many more that I missed!
and many more that I missed!
That model is directly trained on child-centered long-form recordings (those collected by @bergelsonlab.bsky.social); spoiler: learning from spontaneous & noisy speech makes the problem even more difficult, but also more interesting in my opinion!
That model is directly trained on child-centered long-form recordings (those collected by @bergelsonlab.bsky.social); spoiler: learning from spontaneous & noisy speech makes the problem even more difficult, but also more interesting in my opinion!
1) onlinelibrary.wiley.com/doi/10.1111/... on early sound and word (form) acquisition in SSL models; many analyses about what the learned tokens look like. Carried out with @maureendeseyssel.bsky.social
1) onlinelibrary.wiley.com/doi/10.1111/... on early sound and word (form) acquisition in SSL models; many analyses about what the learned tokens look like. Carried out with @maureendeseyssel.bsky.social