🏂🎹🚵♀️🥋
- FV heads have relatively high induction scores and vice versa compared to other heads
- FV heads emerge later in training than induction heads
- ICL accuracy rises around the same time induction emerges during training, but increases more gradually
- FV heads have relatively high induction scores and vice versa compared to other heads
- FV heads emerge later in training than induction heads
- ICL accuracy rises around the same time induction emerges during training, but increases more gradually
Several instances of FV heads have a high induction score earlier in training (around when induction heads first emerge). However, the reverse (induction heads with high FV scores earlier) does not occur.
Several instances of FV heads have a high induction score earlier in training (around when induction heads first emerge). However, the reverse (induction heads with high FV scores earlier) does not occur.
Our ablations show that FV heads are crucial for few-shot ICL, whereas induction heads are not necessary.
Our ablations show that FV heads are crucial for few-shot ICL, whereas induction heads are not necessary.
We find that recently discovered "function vector" heads, which encode the ICL task, are the actual primary mechanisms behind few-shot ICL!
arxiv.org/abs/2502.14010
🧵👇
We find that recently discovered "function vector" heads, which encode the ICL task, are the actual primary mechanisms behind few-shot ICL!
arxiv.org/abs/2502.14010
🧵👇
Perceptual effort to distinguish between 2 handshapes is very weakly correlated with how often the 2 letters appear in similar contexts in English, and in the "wrong" direction for efficiency. 7/8
Perceptual effort to distinguish between 2 handshapes is very weakly correlated with how often the 2 letters appear in similar contexts in English, and in the "wrong" direction for efficiency. 7/8
No significant correlation between fingerspelling handshapes and English letter frequency! 6/8
No significant correlation between fingerspelling handshapes and English letter frequency! 6/8
In core signs native to ASL (left), frequent handshapes are easier to produce!
In initialized and loan signs borrowed from English (right), no correlation! 5/8
In core signs native to ASL (left), frequent handshapes are easier to produce!
In initialized and loan signs borrowed from English (right), no correlation! 5/8
When two handshapes have similar finger joint angles, they appear more alike, making it harder to distinguish between them perceptually. 4/8
When two handshapes have similar finger joint angles, they appear more alike, making it harder to distinguish between them perceptually. 4/8
The more variation there is in finger joint angles within a handshape, the more difficult it is to produce that handshape. 3/8
The more variation there is in finger joint angles within a handshape, the more difficult it is to produce that handshape. 3/8
Spoken languages exhibit communicative efficiency by minimizing speaker+listener effort.
What about signed languages?
ASL handshapes reflect efficiency pressures - but only in native signs, not signs borrowed from English!
aclanthology.org/2024.acl-lon... 🧵
Spoken languages exhibit communicative efficiency by minimizing speaker+listener effort.
What about signed languages?
ASL handshapes reflect efficiency pressures - but only in native signs, not signs borrowed from English!
aclanthology.org/2024.acl-lon... 🧵
We find that contrastive learning significantly improves model accuracy.
6/8
We find that contrastive learning significantly improves model accuracy.
6/8
We propose a new task - *automatic sign suggestion*: given an English sentence and an ASL video, a model detects when the interpreter fingerspells, and suggests ASL signs to use instead.
5/8
We propose a new task - *automatic sign suggestion*: given an English sentence and an ASL video, a model detects when the interpreter fingerspells, and suggests ASL signs to use instead.
5/8
Our dataset reflects how interpreters often use fingerspelling (spelling out the English word using letter signs) for technical terms when the ASL sign is unknown.
4/8
Our dataset reflects how interpreters often use fingerspelling (spelling out the English word using letter signs) for technical terms when the ASL sign is unknown.
4/8
We release this dataset alongside our paper, identifying several use cases for ASL STEM Wiki informed by its unique characteristics.
3/8
We release this dataset alongside our paper, identifying several use cases for ASL STEM Wiki informed by its unique characteristics.
3/8
We release ASL STEM Wiki: the first signing dataset of STEM articles!
📰 254 Wikipedia articles
📹 ~300 hours of ASL interpretations
👋 New task: automatic sign suggestion to make STEM education more accessible
microsoft.com/en-us/resear...
🧵 #EMNLP2024
We release ASL STEM Wiki: the first signing dataset of STEM articles!
📰 254 Wikipedia articles
📹 ~300 hours of ASL interpretations
👋 New task: automatic sign suggestion to make STEM education more accessible
microsoft.com/en-us/resear...
🧵 #EMNLP2024
food in miami was so good thanks emnlp
food in miami was so good thanks emnlp