Martin Goodson
martingoodson.bsky.social
Martin Goodson
@martingoodson.bsky.social
CEO Evolution AI. Organiser of the London Machine Learning Meetup (the largest network of AI practitioners in Europe) https://www.meetup.com/london-machine-learning-meetup/
Reposted by Martin Goodson
Changing conspiracy theory beliefs is very hard, but a replicated finding shows a short chat with GPT-4 changes people’s belief in conspiracy theories for the long term.

Why? It isn’t rhetorical tricks, it's that AI provides relevant facts and evidence tailored to each person's specific beliefs.
February 19, 2025 at 1:22 PM
Reposted by Martin Goodson
OpenAI launched a new benchmark based on real-world software engineering tasks from Upwork

Scores are awarded monetarily, by how much an AI could theoretically earn

And Sonnet is currently the top model. Bold.

arxiv.org/abs/2502.12115
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?
We introduce SWE-Lancer, a benchmark of over 1,400 freelance software engineering tasks from Upwork, valued at \$1 million USD total in real-world payouts. SWE-Lancer encompasses both independent engi...
arxiv.org
February 19, 2025 at 11:28 AM
Erm, the rate of innovation is already a derivative. Can't you just look at that?
February 18, 2025 at 8:46 PM
Reposted by Martin Goodson
"Who sows the wind shall reap the whirlwind." (Hosea 8:7)
January 29, 2025 at 5:52 AM
Reposted by Martin Goodson
One of the funniest parts about this story rn is that Deepseek probably trained their model on the output of OpenAI's models. Extreme schadenfreude watching a company built on theft have their IP stolen, improved, then given away for free.
DeepSeek’s R1 model is challenging the very foundations of the past two years of generative AI hype, wiping $1 trillion of the value of major US tech companies.

But it’s also another example that the US strategy to try to contain Chinese tech is failing — and pushing them to get even more creative.
DeepSeek shows the US failure to contain Chinese tech
Despite chip restrictions, Chinese AI threatens the foundation of the generative AI hype cycle
www.disconnect.blog
January 28, 2025 at 5:30 AM
Maybe use a different model to find out about Tiananmen Square. Or look it up in Wikipedia or something.
January 28, 2025 at 7:35 AM
Oh dear
January 27, 2025 at 9:05 AM
Reposted by Martin Goodson
What could contribute more to government efficiency than by securing lucrative contracts for your crypto donors to implement inefficient technology that is uniquely suited to solving problems you don’t have?

www.bloomberg.com/news/article...
Musk Exploring Blockchain Use in US Government Efficiency Effort
Elon Musk has initiated conversations about using blockchain technology at the new Department of Government Efficiency, according to people with knowledge of the discussions. It’s the latest sign of t...
www.bloomberg.com
January 25, 2025 at 8:19 PM
How would the US government respond to this level of devastation if it were caused by a terrorist organisation? Decades of war and trillions of dollars spent?

But since it was just caused by climate change, not so much.
January 11, 2025 at 8:37 PM
I've never read anything by Melanie Mitchell I didn't like but this, on the recent breakthrough by OpenAI, is particularly good.
December 24, 2024 at 7:59 AM