Peter B
pmbaumgartner.bsky.social
Peter B
@pmbaumgartner.bsky.social
Data Scientist and Software Developer @ RTI International
Good blog post, this helped clarify some things for me.

My concern is we don't have answers to many of the open questions or limitations even with vanilla LLMs, so throwing a bunch of them together in more complex ways seems... dumb?
January 14, 2025 at 12:16 AM
Reposted by Peter B
LLMs struggle with perception, not reasoning, in ARC-AGI by Mikel Bober-Irizar

What made o3 so much better than previous models on this benchmark?

anokas.substack.com/p/llms-strug...
LLMs struggle with perception, not reasoning, in ARC-AGI
What made o3 so much better than previous models on this benchmark?
anokas.substack.com
December 25, 2024 at 2:36 PM
Reposted by Peter B
The article discusses how the AI ethics community's focus on "fairness" significantly limits how it approaches and addresses algorithmic harm, and proposes reframing harms in terms of domination and oppression per Iris Marion Young's framework.

firstmonday.org/ojs/index.ph...
Automated decision-making as domination | First Monday
firstmonday.org
December 22, 2024 at 6:48 PM
I'm sorry, this has to be the dumbest study with the dumbest framing. This is an actual sentence from the summary:

'Moreover, as in humans, age is a key determinant of cognitive decline: “older” chatbots, like older patients, tend to perform worse on the MoCA test.'
Susceptibility of large language models to cognitive impairment. #bmj

"With the exception of ChatGPT 4o, almost all large language models subjected to the MoCA test showed signs of mild cognitive impairment."

www.bmj.com/content/387/...
December 24, 2024 at 1:36 AM
goblin.tools/About is the one LLM "product" that really hits the sweet spot for me in terms of a useful and specific application of generative AI. Is there more stuff like this? I just love the idea of a collection of simple, task-specific tools with a basic interface.
About - GoblinTools
goblin.tools
December 22, 2024 at 12:16 PM
Reposted by Peter B
In fact, @sayash.bsky.social and I have just published an essay with them, where we play our usual role of looking at the evidence and tamping down AI hype and fears instead of playing them up.
knightcolumbia.org/blog/we-look...

(Cross-posted to AI Snake Oil aisnakeoil.com/p/we-looked-...)
We Looked at 78 Election Deepfakes. Political Misinformation Is Not an AI Problem.
knightcolumbia.org
December 15, 2024 at 2:23 PM
Reposted by Peter B
It's been interesting to witness in real-time how the usage of "algorithm" in many places has shifted from a neutral "sequence of instructions" to a negative "controlled ordering and boosting of information".
November 30, 2024 at 10:26 AM
This Thanksgiving we pitted ChatGPT against Claude in battle of the side dishes.

Claude gave us Za'atar Roasted Cauliflower with Whipped Feta and ChatGPT gave us Stuffed Acorn Squash with Quinoa, Kale, and Goat Cheese. Which was better? 🧵
November 30, 2024 at 3:23 AM
Reposted by Peter B
In August I had the pleasure of presenting a talk at posit::conf, called A Future of Data Science, in which I assert that data science exists because statistics missed the boat on computation.
The video is up now...
www.youtube.com/watch?v=YKMZ...
A Future of Data Science - posit conf 2024
YouTube video by Posit PBC
www.youtube.com
November 2, 2024 at 2:37 PM
We're back!
October 27, 2024 at 8:23 PM