Lightnews — Scholar-powered news

ruggsea

@ruggsea.bsky.social

The more I spend time in Austria, the more this feels true

November 4, 2025 at 4:58 PM

ruggsea

@ruggsea.bsky.social

We are hiring a Senior Scientist! If questions like "how much of Common Crawl can we actually store and process on our infrastructure?" get you excited, you would prob love this job. Come do research at fun scales with us

Jana Lasser @janalasser.bsky.social · 17d

My lab is looking for a Senior Scientist (= PostDoc with option of permanency)!

We are looking for someone interested in doing cutting-edge computational social science + helping us with data & software engineering 🤓.

See job ad for details jobs.uni-graz.at/en/jobs/7d14...

Universität Graz

jobs.uni-graz.at

October 28, 2025 at 6:07 PM

ruggsea

@ruggsea.bsky.social

A thing I've been working on for the past year: an LLM benchmark on the dreaded Italian medicine faculty entry exam. I will present it in two weeks at @ailc-nlp.bsky.social Clic-it in Cagliari!

arxiv cs.CL @arxiv-cs-cl.bsky.social · Sep 10

Ruggero Marino Lazzaroni, Alessandro Angioi, Michelangelo Puliga, Davide Sanna, Roberto Marras
MedBench-IT: A Comprehensive Benchmark for Evaluating Large Language Models on Italian Medical Entrance Examinations
https://arxiv.org/abs/2509.07135

September 10, 2025 at 12:47 PM

ruggsea

@ruggsea.bsky.social

from this awesome blogpost: snats.xyz/pages/articl...

September 9, 2025 at 9:27 AM

ruggsea

@ruggsea.bsky.social

Life is going from one cool spot to train AI models to another

In this fancy Berlin library you can listen to vinyls while you do it!

July 30, 2025 at 9:26 AM

ruggsea

@ruggsea.bsky.social

Me looking very dumb while pointing at things at two academic events:

1. Pointing at a logits inside @repligate.bsky.social's loom for the "Braive New World" conference at @uni-graz.at
2. Pointing at my poster at @ic2s2.bsky.social last week on improving LLM agentic natural conversation synthesis

July 28, 2025 at 1:56 PM

ruggsea

@ruggsea.bsky.social

Only prisoners have time to read, and if you want to engage in a twenty-year long research project funded by the state, you will have to kill someone.

Sorry for the Fisher posting but it's so good

ruggsea @ruggsea.bsky.social · May 16

was true in 2012 and it is true now

July 27, 2025 at 1:36 PM

ruggsea

@ruggsea.bsky.social

I look really bad/funny in this picture but I am glad I was given the opportunity to talk about LLM research to an audience of cool researchers!

Fun fact: I also held a hands-on session that involved playing around with Bluesky data!

Complexity Science Hub @csh.ac.at · Jul 10

Scientists know it's happening: #LLMs like #ChatGPT are quietly transforming #academia —helping write papers, draft grants & process data. CSH's @lespin.bsky.social organized a 3-day workshop to explore their ethical, practical use in research—from writing to coding to data annotation.

July 10, 2025 at 2:39 PM

ruggsea

@ruggsea.bsky.social

Karpathy on training Neural Networks: you should go slowly and be paranoid

me, while vibecoding torch code: what if i just increase the paranoid part?

June 9, 2025 at 2:27 PM

ruggsea

@ruggsea.bsky.social

how not to name your ai agents

June 3, 2025 at 7:19 PM

ruggsea

@ruggsea.bsky.social

genz semantic embeddings engineering from my collegue

May 27, 2025 at 10:48 AM

ruggsea

@ruggsea.bsky.social

backpropagation was inspired by freud

May 21, 2025 at 10:53 AM

ruggsea

@ruggsea.bsky.social

“How about we pull over for a bit and get some rest?" - GPT4, when it's their turn to drive

May 20, 2025 at 9:23 PM

ruggsea

@ruggsea.bsky.social

I wish all textbooks were written like this one from @jurafsky.bsky.social‬

May 18, 2025 at 11:14 PM

ruggsea

@ruggsea.bsky.social

was true in 2012 and it is true now

May 16, 2025 at 9:56 AM

ruggsea

@ruggsea.bsky.social

Stumbled upon new ammunition for my personal struggle against RHLF: the tendency of models trained this way to give correct sounding answers make us believe in wrong things more

arxiv.org/abs/2409.12822

Language Models Learn to Mislead Humans via RLHF

Language models (LMs) can produce errors that are hard to detect for humans, especially when the task is complex. RLHF, the most popular post-training method, may exacerbate this problem: to achieve h...

arxiv.org

May 15, 2025 at 1:03 AM

ruggsea

@ruggsea.bsky.social

Looks important for my rec algorithm people: hard confirmation that Google uses essentially hybrid rankings for search: page rank, bert embeddings and a secret ingredient (user data)

Mark Williams-Cook @markwilliamscook.com · May 13

New Google/DOJ docs realised and to my knowledge the first time we've had a primary source specifically saying Google uses Chrome data in a popularity signal for ranking. 👀

Excerpt from DOJ docs saying "popularity signal that uses Chrome data"

May 13, 2025 at 9:07 PM

ruggsea

@ruggsea.bsky.social

All of this mentions of Zero make me think about Slothrop

tachikoma @tachikoma.elsewhereunbound.com · May 7

seems interesting, perhaps even promising

arxiv.org/pdf/2505.03335

May 8, 2025 at 10:31 AM

ruggsea

@ruggsea.bsky.social

Reposting from the other site to spread it here: apparently, thinking that RLHF irons out creativity in LLMs is now corroborated by this paper arxiv.org/pdf/2505.00047

May 6, 2025 at 11:45 PM

ruggsea

@ruggsea.bsky.social

Protestant-Catholic German beefs are my favorite kind of beefs

April 9, 2025 at 1:23 PM

ruggsea

@ruggsea.bsky.social

At Weizenbaum today, talking about slightly more complicated chatbots!

April 3, 2025 at 12:04 PM

ruggsea

@ruggsea.bsky.social

A friend and colleague at my lab (the CS² under @janalasser.bsky.social ) is organizing a super nice school on democracy and social media here in Graz!

If you're interested, check it out and share! Link: sicss.io/2025/graz/

CS² Summer School Poster: A black background featuring a fragmented, digital-looking olive branch or plant with green leaves and fruits. The poster announces "DATA, DEMOCRACY & THE NEW SOCIAL FABRIC" in a red box, hosted by Complex Social & Computational Systems. Located in Graz, Austria from September 15-26, 2025, the summer school explores the intersection of social media, technology, and democracy. Themes include philosophical foundations, social media's impact on democracy, ethics of large-scale data use, and the future of democracy and the internet. A QR code for application is provided, with an April 13, 2025 application deadline.

Program details for CS² Summer School on Data, Democracy, and Social Fabric. The image features a textured, architectural background with fragmented brick walls. A schedule shows a two-week program: Week One covers daily lectures on democracy, social technologies, and ethics, while Week Two focuses on group project work. The poster details the audience (Masters, Doctoral students, Postdocs in computational social sciences), key details about application (deadline April 13, 2025), and potential financial support. Organized by Complex Social & Computational Systems (SICSS), with a QR code for application and further details.

March 27, 2025 at 2:46 PM

ruggsea

@ruggsea.bsky.social

Claude, my friend, are you ok?

March 24, 2025 at 3:56 PM

ruggsea

@ruggsea.bsky.social

Has anybody ever tried GRPO training with annealing wrt the generations temperature?

I feel like that could be a way to explore more of the search space at the beginning of the training, to then be less "creative" once the model is hopefully already on the right track

March 20, 2025 at 3:22 PM

ruggsea

@ruggsea.bsky.social

And so it begins

March 5, 2025 at 2:33 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news