for further testing, you might want to consider esoteric programming languages, or even making up your own syntax, to better indicate if the model employes really generalist reasoning and not just in data distribution optimization.
November 25, 2025 at 8:24 AM
for further testing, you might want to consider esoteric programming languages, or even making up your own syntax, to better indicate if the model employes really generalist reasoning and not just in data distribution optimization.
Honestly, that’s one of the worst analogies I heard. It adds nothing of value to the old analogy of the managerial position, just placed it in a unnecessary domain specific role which makes it more confusing.
October 24, 2025 at 7:13 PM
Honestly, that’s one of the worst analogies I heard. It adds nothing of value to the old analogy of the managerial position, just placed it in a unnecessary domain specific role which makes it more confusing.
For many tasks, there are model-capability thresholds that can already be saturated (e.g. SQL query generation, various fiction texts), so the future of LMs are in many ways personalised, private, on-device models giving rise of independence of the big LLM providers and their data harvesting.
October 15, 2025 at 5:30 PM
For many tasks, there are model-capability thresholds that can already be saturated (e.g. SQL query generation, various fiction texts), so the future of LMs are in many ways personalised, private, on-device models giving rise of independence of the big LLM providers and their data harvesting.
Feels like models (local or API) are being held back by their UI currently…need more than a chat window, need to be able to write and organize projects (so perhaps note taking app, obsidian like) but also need to run/validate its code so IDE like. Perhaps a UI that can flick between those views.
September 9, 2025 at 8:27 AM
Feels like models (local or API) are being held back by their UI currently…need more than a chat window, need to be able to write and organize projects (so perhaps note taking app, obsidian like) but also need to run/validate its code so IDE like. Perhaps a UI that can flick between those views.