Would love to meet and chat ☕️💬
If you’re attending #ACL2025, feel free to stop by and say hi! 👋
🧵[4/4]
Would love to meet and chat ☕️💬
If you’re attending #ACL2025, feel free to stop by and say hi! 👋
🧵[4/4]
We track how factual knowledge forms in OLMo over training by analyzing the evolving roles of Attention Heads and FFNs.
Heads are dynamic and often repurposed; FFNs are stable and keep refining facts.
By: A. Dawar Hakimi
arxiv.org/abs/2506.03434
🧵[3/4]
We track how factual knowledge forms in OLMo over training by analyzing the evolving roles of Attention Heads and FFNs.
Heads are dynamic and often repurposed; FFNs are stable and keep refining facts.
By: A. Dawar Hakimi
arxiv.org/abs/2506.03434
🧵[3/4]
A method for assessing the multilingual capabilities of English-centric LLMs using parallel sentences. It estimates how many languages an LLM covers and at what level.
By: @kargaranamir.bsky.social
x.com/amir_nlp/sta...
🧵[2/4]
A method for assessing the multilingual capabilities of English-centric LLMs using parallel sentences. It estimates how many languages an LLM covers and at what level.
By: @kargaranamir.bsky.social
x.com/amir_nlp/sta...
🧵[2/4]
📄: arxiv.org/abs/2502.05167
🔗: github.com/adobe-resear...
🤗: huggingface.co/datasets/amo...
This work was my internship project at
@adobe.com, in collaboration with my mentors there and Hinrich Schütze.
📄: arxiv.org/abs/2502.05167
🔗: github.com/adobe-resear...
🤗: huggingface.co/datasets/amo...
This work was my internship project at
@adobe.com, in collaboration with my mentors there and Hinrich Schütze.
work with an amazing team:
@mohsen-fayyaz.bsky.social,
Hinrich Schütze,
@violetpeng.bsky.social
paper: arxiv.org/abs/2503.05037
dataset 🤗: t.co/QZFyCLqP0P
Cross-post from x.com/mohsen_fayyaz
work with an amazing team:
@mohsen-fayyaz.bsky.social,
Hinrich Schütze,
@violetpeng.bsky.social
paper: arxiv.org/abs/2503.05037
dataset 🤗: t.co/QZFyCLqP0P
Cross-post from x.com/mohsen_fayyaz
📉 Answer-containing docs ranked <3% of the time over a synthetic biased doc with no answer!
📉 Answer-containing docs ranked <3% of the time over a synthetic biased doc with no answer!
We design controlled experiments by repurposing a relation extraction dataset, exposing serious flaws in models like Dragon+ and Contriever.
We design controlled experiments by repurposing a relation extraction dataset, exposing serious flaws in models like Dragon+ and Contriever.