Lightnews — Scholar-powered news

Reposted by Griffon

Andi

@admiralandrea.bsky.social

All borders are arbitrary and created by humans

February 3, 2025 at 10:48 AM

Griffon

@ryancallihan.bsky.social

Drowning in Documents: Consequences of Scaling Reranker Inference

This paper conducts a simple test of the effectiveness of rerankers on large amounts of documents. It's really important to think about if you are using RAG a lot.

December 9, 2024 at 10:30 AM

Reposted by Griffon

Hellina Hailu Nigatu

@hellinanigatu.bsky.social

I hope I am not late to the party (was away post-quals chilling) but here are some thoughts on why this is bad IMO:

First, a disclaimer that I am writing this as an African who is a speaker of multiple African languages, NLP researcher of African languages, and HCI researcher focusing broadly on..

Dr Abeba Birhane @abeba.bsky.social · Nov 26

this is a green flag for openai & meta to formally be arbitrators of our languages & mass exploit the population (& researcher that've poured their souls into low resource languages),all to throw unreliable AI that has so far proven to result in more harm than benefit
www.reuters.com/technology/a...

Orange enlists Meta and OpenAI to develop AI language models in Africa

Orange will enlist OpenAI and Meta to fine-tune AI large language models (LLMs) to translate regional African languages for the French telecoms operator, it said on Tuesday.

www.reuters.com

December 2, 2024 at 11:43 PM

Reposted by Griffon

M.J. Crockett

@mjcrockett.bsky.social

Anyone saying The Left must stay on Twitter to save democracy doesn’t understand how Twitter affects our psychology. Twitter makes money by disconnecting us from social reality and making us feel shitty about ourselves and each other.

December 1, 2024 at 10:48 PM

Reposted by Griffon

M.J. Crockett

@mjcrockett.bsky.social

Is it Bad to leave Twitter? No. Here are 7+ years of insights from my lab’s research that explain why.

Featuring work w/ @williambrady.bsky.social @killianmcloughlin.bsky.social

🧵

December 1, 2024 at 10:48 PM

Griffon

@ryancallihan.bsky.social

Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

Love love love this. Those who know me know that I am generally grumpy about evaluation metrics for LLMs and the building culture of benchmark beating that has been going on for awhile now. More of this instead please.

December 3, 2024 at 10:03 AM

Griffon

@ryancallihan.bsky.social

Nonmyopic Generation of Language Models for Reasoning and Planning

A big issue with LLM reasoning is poisoning downstream tasks. In a reasoning chain if 1 instruction is nonoptimal that has a huge impact on later steps and the most popular frameworks don’t have a way to account for that

December 1, 2024 at 11:04 AM

Griffon

@ryancallihan.bsky.social

Does domain adaptive pretraining make a difference? Possibly not. In this paper, the authors find that FT models (medical domain) outperform base models 12% of the time. Fine tuning can be fun, but is it worth it?

arxiv.org/abs/2411.04118

Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress?

Several recent works seek to develop foundation models specifically for medical applications, adapting general-purpose large language models (LLMs) and vision-language models (VLMs) via continued pret...

arxiv.org

November 28, 2024 at 8:30 PM

Griffon

@ryancallihan.bsky.social

AI tools can widen skill gaps: the top decile of researchers at a material science company using AI tools saw 81% more productivity, while the bottom third had less than 25%.

“AI won’t take your job, but those who use it will” feels uncomfortably true.

November 27, 2024 at 9:21 AM

Griffon

@ryancallihan.bsky.social

Ive been thinking about agents recently, specifically multi-agent systems. This paper touches on an aspect of that, specifically, assigning "expert" roles to produce a better-formed output. Sometimes LLMs can be a little unidimentional and more of this might change that
arxiv.org/abs/2411.00492