Lightnews — Scholar-powered news

Reposted by Pushpdeep

Interdisciplinary Institute for Societal Computing

@i2sc.net

The lecture was followed by a hands-on activity where Abhisek Dash and @pushpdeep.bsky.social (MPI-SWS) guided the participants through the challenges of validating labels of content posted on Bluesky.

September 12, 2025 at 11:22 AM

Reposted by Pushpdeep

Christopher Barrie

@cbarrie.bsky.social

📄NEW PAPER📄

Ever wondered content people actually pay *attention* to online? Our new research reveals that you likely pay attention to far more varied political content than your likes and shares suggest

March 25, 2025 at 12:04 PM

Reposted by Pushpdeep

arxiv cs.CL

@arxiv-cs-cl.bsky.social

Matthias Orlikowski, Jiaxin Pei, Paul R\"ottger, Philipp Cimiano, David Jurgens, Dirk Hovy
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions
https://arxiv.org/abs/2502.20897

March 3, 2025 at 7:40 AM

Reposted by Pushpdeep

arxiv cs.CL

@arxiv-cs-cl.bsky.social

Artem Vazhentsev, Ivan Sviridov, Alvard Barseghyan, Gleb Kuzmin, Alexander Panchenko, Aleksandr Nesterov, Artem Shelmanov, Maxim Panov
Uncertainty-aware abstention in medical diagnosis based on medical texts
https://arxiv.org/abs/2502.18050

February 26, 2025 at 8:07 AM

Reposted by Pushpdeep

arxiv cs.CL

@arxiv-cs-cl.bsky.social

Tushar Aggarwal, Kumar Tanmay, Ayush Agrawal, Kumar Ayush, Hamid Palangi, Paul Pu Liang
Language Models' Factuality Depends on the Language of Inquiry
https://arxiv.org/abs/2502.17955

February 26, 2025 at 8:26 AM

Reposted by Pushpdeep

Vicki

@vickiboykis.com

Every day we get closer and closer to OG information retrieval , imagine spending billions of dollars to do the same thing that Tf-IDF did

noperator.dev/posts/docume...

Hard problems that reduce to document ranking

There are two claims I’d like to make: LLMs can be used effectively1 for listwise document ranking. Some complex problems can (surprisingly) be solved by transforming them into document ranking proble...

noperator.dev

February 26, 2025 at 2:07 AM

Reposted by Pushpdeep

Linguistic Discovery

@linguisticdiscovery.com

A new book on mixed languages in South Asia has just been released! @PenguinBooks

February 23, 2025 at 8:23 PM

Reposted by Pushpdeep

Jessy Li

@jessyjli.bsky.social

Do you want to know what information LLMs prioritize in text synthesis tasks? Here's a short 🧵 about our new paper, led by Jan Trienes: an interpretable framework for salience analysis in LLMs.

First of all, information salience is a fuzzy concept. So how can we even measure it? (1/6)

February 21, 2025 at 6:15 PM

Reposted by Pushpdeep

Hiba Ahsan

@hibaahsan.bsky.social

LLMs are known to perpetuate social biases in clinical tasks. Can we locate and intervene upon LLM activations that encode patient demographics like gender and race? 🧵

Work w/ @arnabsensharma.bsky.social, @silvioamir.bsky.social, @davidbau.bsky.social, @byron.bsky.social

arxiv.org/abs/2502.13319

February 22, 2025 at 4:18 AM

Reposted by Pushpdeep

Thomas Wolf

@thomwolf.bsky.social

After 6+ months in the making and over a year of GPU compute, we're excited to release the "Ultra-Scale Playbook": hf.co/spaces/nanot...

A book to learn all about 5D parallelism, ZeRO, CUDA kernels, how/why overlap compute & coms with theory, motivation, interactive plots and 4000+ experiments!

The Ultra-Scale Playbook - a Hugging Face Space by nanotron

The ultimate guide to training LLM on large GPU Clusters

hf.co

February 19, 2025 at 6:10 PM

Reposted by Pushpdeep

Adi Simhi

@adisimhi.bsky.social

🚨New arXiv preprint!🚨
LLMs can hallucinate - but did you know they can do so with high certainty even when they know the correct answer? 🤯
We find those hallucinations in our latest work with @itay-itzhak.bsky.social, @fbarez.bsky.social, @gabistanovsky.bsky.social and Yonatan Belinkov

February 19, 2025 at 3:50 PM

Reposted by Pushpdeep

Ishika Agarwal

@wonderingishika.bsky.social

🚀Very excited about my new paper!

NN-CIFT slashes data valuation costs by 99% using tiny neural nets (205k params, just 0.0027% of 8B LLMs) while maintaining top-tier performance!

February 17, 2025 at 4:06 AM

Reposted by Pushpdeep

Christopher Barrie

@cbarrie.bsky.social

We have a new version of our 𝕡𝕣𝕠𝕞𝕡𝕥𝕤𝕥𝕒𝕓𝕚𝕝𝕚𝕥𝕪 paper now on arxiv!

This is a significant update that test *a lot* more data, suggests post-processing techniques, outlines how to compare across models, and tests with new models...

February 18, 2025 at 11:46 AM

Reposted by Pushpdeep

Linguistic Discovery

@linguisticdiscovery.com

Read more in this issue of the Linguistic Discovery newsletter, where I explore the Trisolaran language from the Three-Body Problem and how it compares to human language!

https://buff.ly/3ErakOl

#Trisolarans #aliens #xenolinguistics #ThreeBodyProblem #linguistics #language #SciFi #review

What if we could hear each other's thoughts? The linguistics of The Three-Body Problem

Imagine if every word you thought could be heard by everyone around you. In this world, thinking would be the same as communicating. What would language—and society—be like?

buff.ly

February 8, 2025 at 3:25 PM

Reposted by Pushpdeep

Queer in AI

@queerinai.com

Our workshop has been extended till Feb 20. We are looking forward for your papers at NAACL's Queer in AI workshop.

Queer in AI @queerinai.com · Dec 9

1/7 🌈 BIG NEWS ALERT! The hottest Queer in AI workshop is back - and this time we're official! We're thrilled to announce we'll be at #NAACL2025 as an official workshop, meaning your work can now be published in the ACL anthology! 🎉

February 3, 2025 at 2:10 PM

Reposted by Pushpdeep

Shaily

@shaily99.bsky.social

This is a good start!! More *CL conferences should come 😍
@aaclmeeting.bsky.social

February 1, 2025 at 1:32 PM

Reposted by Pushpdeep

Ólafur Waage

@olafurw.com

Programming languages: "We are just a way to operate computers in a way that makes sense to humans."

Programming languages [takes a big joint hit]: "What if there were 5 kinds of nothingness?"

January 31, 2025 at 8:51 AM

Reposted by Pushpdeep

Nathan Lambert

@natolambert.bsky.social

A useful oversimplification
Instruction finetuning (IFT/SFT): imprinting features or shape in responses
Preference finetuning (RLHF/DPO/etc): style
Reinforcement finetuning (RFT/RLVR/etc): learning new behaviors

January 31, 2025 at 2:31 PM

Reposted by Pushpdeep

Dustin Wright

@dustinbwright.com

📄 New preprint: "Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence"

We show: fact checking w/ crowd workers is more efficient when using LLM summaries, quality doesn't suffer.

arxiv.org/abs/2501.18265

Collecting Cost-Effective, High-Quality Truthfulness Assessments with LLM Summarized Evidence

With the degradation of guardrails against mis- and disinformation online, it is more critical than ever to be able to effectively combat it. In this paper, we explore the efficiency and effectiveness...

arxiv.org

January 31, 2025 at 1:18 PM

Reposted by Pushpdeep

Mor Geva

@megamor2.bsky.social

How can we interpret LLM features at scale? 🤔

Current pipelines use activating inputs, which is costly and ignores how features causally affect model outputs!
We propose efficient output-centric methods that better predict the steering effect of a feature.

New preprint led by @yoav.ml 🧵1/

January 28, 2025 at 7:34 PM

Reposted by Pushpdeep

Nature

@nature.com

Roughly 6,000 readers answered our poll, with many declaring that Bluesky was nicer, kinder and less antagonistic to science than X

https://go.nature.com/42tH8Ai

Bluesky’s science takeover: 70% of Nature poll respondents use platform

Roughly 6,000 readers answered our poll, with many declaring that Bluesky was nicer, kinder and less antagonistic to science than X.

go.nature.com

January 24, 2025 at 11:52 AM

Reposted by Pushpdeep

MilaNLP Lab

@milanlp.bsky.social

#ThrowbackThursday #NLProc

"My Answer is C" by Wang et al. highlights that first-token evaluation does not accurately reflect LLM behavior in user interactions, urging against sole reliance on this method.

“My Answer is C”: First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language Models

Xinpeng Wang, Bolei Ma, Chengzhi Hu, Leon Weber-Genzel, Paul Röttger, Frauke Kreuter, Dirk Hovy, Barbara Plank. Findings of the Association for Computational Linguistics: ACL 2024. 2024.

buff.ly

January 16, 2025 at 3:00 PM

Reposted by Pushpdeep

Miriam Schirmer

@miriamschirmer.bsky.social

📕 My dissertation on #NLP for #Violence Studies has been published: mediatum.ub.tum.de?id=1751256

I've been looking at #abusive behavior online, as well as sharing of personal experiences with violence, incl. psychological #trauma.

Excited to push this research forward and connect with others 🌐

mediaTUM - Medien- und Publikationsserver

mediatum.ub.tum.de

January 17, 2025 at 6:12 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news