https://selfsupervised.substack.com/
I've build an app to explore open-ended idea discovery using LLMs in a multi-agent evolutionary algorithm.
I've also open source the code so you can play with yourself
I've build an app to explore open-ended idea discovery using LLMs in a multi-agent evolutionary algorithm.
I've also open source the code so you can play with yourself
Hopefully the was something more exciting in the deal.
Hopefully the was something more exciting in the deal.
#neosky #medsky
www.irishtimes.com/business/202...
#neosky #medsky
www.irishtimes.com/business/202...
Lots of work went into this one co-led with @johnotoole.bsky.social.
Blog: www.cergenx.com/blog/scaling...
Paper: www.nature.com/articles/s41...
#Neosky
Lots of work went into this one co-led with @johnotoole.bsky.social.
Blog: www.cergenx.com/blog/scaling...
Paper: www.nature.com/articles/s41...
#Neosky
Here, the model’s latent representations show a grid structure matching the task.
#MLSKy #NeuroAI
Here, the model’s latent representations show a grid structure matching the task.
#MLSKy #NeuroAI
New blog post
open.substack.com/pub/selfsupe...
New blog post
open.substack.com/pub/selfsupe...
youtu.be/UhG56kltfP4?...
youtu.be/UhG56kltfP4?...
This is a great example of 'emergent phenomenon':
- None of the ants understand the problem they're solving.
- None of them can see the whole shape.
- A series of small decisions or rules add up to something with a new layer of complexity.
This is a great example of 'emergent phenomenon':
- None of the ants understand the problem they're solving.
- None of them can see the whole shape.
- A series of small decisions or rules add up to something with a new layer of complexity.
I think there are some strong parallelism between what's happened in the last few years of AI with what happened in last few centuries of physics.
Check it out here open.substack.com/pub/selfsupe...
I think there are some strong parallelism between what's happened in the last few years of AI with what happened in last few centuries of physics.
Check it out here open.substack.com/pub/selfsupe...
Check it out here open.substack.com/pub/selfsupe...
Is _anyone_ surprised?
Is _anyone_ surprised?
blog.google/products/gem...
blog.google/products/gem...
Excellent write up of the surface code and Google's result with their new quantum computer chip Willow 🧪⚛️
www.quantamagazine.org/quantum-comp...
Excellent write up of the surface code and Google's result with their new quantum computer chip Willow 🧪⚛️
www.quantamagazine.org/quantum-comp...
So in some ways the more interesting question to me at the moment is: Why aren’t we in a pandemic yet?
Story here, 🧵 to come:
🧪#IDSky
So in some ways the more interesting question to me at the moment is: Why aren’t we in a pandemic yet?
Story here, 🧵 to come:
🧪#IDSky
🧪
renaissancephilanthropy.org/initiatives/...
🧪
renaissancephilanthropy.org/initiatives/...
We overlapped in TPPC group in King's College London during our PhDs. An huge amount PhDs from there are now working with AI for various applications in science and industry.
arxiv.org/abs/2411.02453
- very nice results
- interesting that TTT helps smaller models more (sign that scale is disproportionately useful for more memorization?)
- What a pain, I really hope we don't have to do this 😅
arxiv.org/abs/2411.07279
- very nice results
- interesting that TTT helps smaller models more (sign that scale is disproportionately useful for more memorization?)
- What a pain, I really hope we don't have to do this 😅
arxiv.org/abs/2411.07279
Highlighted difference between screening and clinical settings. AI can actually increase workload in clinical settings when an abnormality is found due to time spent in differential diagnosis.
jamanetwork.com/journals/jam...
Highlighted difference between screening and clinical settings. AI can actually increase workload in clinical settings when an abnormality is found due to time spent in differential diagnosis.
jamanetwork.com/journals/jam...
They found you can totally destroy a model by pruning a single weight.
Also have some nice results that use this to improve quantization.
My main concern though is what this tells us about how brittle these models can be.
arxiv.org/html/2411.07...
They found you can totally destroy a model by pruning a single weight.
Also have some nice results that use this to improve quantization.
My main concern though is what this tells us about how brittle these models can be.
arxiv.org/html/2411.07...