Trying to learn jazz piano.
Probably camping.
I believe in you.
📍 DC, SF, Taiwan
If you zoomed into a workflow node, you'd say: "oh yeah, this is agentic computing"
But if you zoom out, you'd say: "this is a classic workflow engine"
If you zoomed into a workflow node, you'd say: "oh yeah, this is agentic computing"
But if you zoom out, you'd say: "this is a classic workflow engine"
It's difficult from any one point on the frontier to make strong inferences about adjacent points.
E.g. AI can draft top-quality research memos, but still struggles at cocktail party chat.
It's difficult from any one point on the frontier to make strong inferences about adjacent points.
E.g. AI can draft top-quality research memos, but still struggles at cocktail party chat.
..in which the first key in this pic is an example of "ugly", while the second key in this pic is an example of "beautiful"
..in which the first key in this pic is an example of "ugly", while the second key in this pic is an example of "beautiful"
If a government trained an LLM on its intelligence materials, the model weights would be the most sensitive asset in its possession.
The opposite of compartmentalized information.
If a government trained an LLM on its intelligence materials, the model weights would be the most sensitive asset in its possession.
The opposite of compartmentalized information.
It's bound to happen..
It's bound to happen..
Eg with an imperfect agent, you can:
- Maximize shots on goal, then detect which went in
- Maximize shot opportunities, then just take the best K
Eg with an imperfect agent, you can:
- Maximize shots on goal, then detect which went in
- Maximize shot opportunities, then just take the best K
ChatGPT is better at GSheets formula help than the in-app Gemini.
ChatGPT is better at GSheets formula help than the in-app Gemini.
T-3 years to my dream of playing Mario Kart inside of the Google Maps dataset.
deepmind.google/discover/blo...
T-3 years to my dream of playing Mario Kart inside of the Google Maps dataset.
deepmind.google/discover/blo...
Fine-grained permissions with OAuth is a tornado of pain even for developers.
Shopify nailed it -- and for non-developers no less!
Fine-grained permissions with OAuth is a tornado of pain even for developers.
Shopify nailed it -- and for non-developers no less!
- The Recipe Blog Lobby
- The Recipe Blog Lobby
LLMs don't yet universally outperform those task-specific models yet.. but it's pretty clear that they're on a path to.
.. So does Big Tech just freeze those products in place to wait?
LLMs don't yet universally outperform those task-specific models yet.. but it's pretty clear that they're on a path to.
.. So does Big Tech just freeze those products in place to wait?
When OCRing a full-page of text w/ an LLM, it can go off the rails and, when it does - it usually stays off the rails.
Feels like an interesting substrate to create experiments to study hallucination.
When OCRing a full-page of text w/ an LLM, it can go off the rails and, when it does - it usually stays off the rails.
Feels like an interesting substrate to create experiments to study hallucination.
We've been doing "agent progress bar" experiments at @everpilotapp that let you know what's happening & also invite you to collab with the agent.
We've been doing "agent progress bar" experiments at @everpilotapp that let you know what's happening & also invite you to collab with the agent.
I hope that’s not happening right now..
I hope that’s not happening right now..
I think the final step in empowering users would be tags on posts that work with feeds.
I could auth a 3rd party service to tag posts to me to up/down rank them in my own feed.
Keep $USER but not their rants about $TOPIC
I think the final step in empowering users would be tags on posts that work with feeds.
I could auth a 3rd party service to tag posts to me to up/down rank them in my own feed.
Keep $USER but not their rants about $TOPIC
Here is an agent that can negotiate prices, make package deals, and actually sell you candy online:
hawke.bot
Who wants to be the first human in history to buy something from an AI street hawker?
Blog post about it:
edwardbenson.com/2024/11/the-...
Here is an agent that can negotiate prices, make package deals, and actually sell you candy online:
hawke.bot
Who wants to be the first human in history to buy something from an AI street hawker?
Blog post about it:
edwardbenson.com/2024/11/the-...