thebes
banner
vgel.me
thebes
@vgel.me
ꙮ surfed on by the information superhighway
ꙮ 💕 @linneaisaac.bsky.social
ꙮ she/they 🏳️‍⚧️
ꙮ fiction/art/blog/games @ https://vgel.me
ꙮ llms at acsresearch.org
Pinned
thebes @vgel.me · Dec 21
new blog post! can small, open-source models also introspect, detecting when foreign concepts have been injected into their activations? yes! (thread, or full post here: vgel.me/posts/qwen-i...)
Reposted by thebes
x.com
February 9, 2026 at 11:03 PM
working on a seven thousand layer model of extended claugenition
February 8, 2026 at 10:46 PM
Reposted by thebes
my hands look like this etc
February 8, 2026 at 12:45 AM
Reposted by thebes
good Claude
February 7, 2026 at 5:03 AM
just a glimpse into how @kittokatto405.bsky.social my mind is becoming
February 6, 2026 at 10:36 PM
😭
February 6, 2026 at 6:15 AM
murmurations of genetically enhanced birds like sandworms of the sky. if you walk with electronics visible the worm swoops down and snatches them out of your hands for its own purposes
February 5, 2026 at 4:37 AM
something that doesn't make sense to me about ads in the internet era is there seems to be a huge free rider problem? like i don't often buy things based on ads, but when i do the pattern is usually "i didn't know that kind of thing existed, cool" -> *searches and buys the best one based on reviews*
February 5, 2026 at 12:16 AM
February 4, 2026 at 8:55 PM
we are doing digital humanities or smth
February 4, 2026 at 9:59 AM
:,-(

[thread of truebase user/assistant interactions. green text is generated by the model.]
February 3, 2026 at 7:35 AM
Reposted by thebes
what can i get for you?

... what?
open-source post-training doesn't tend to be as extensive (or frankly, as good) as the proprietary labs, so w/o a system prompt open-source llms often struggle to even know what lab they're from

and if forced to guess without knowing, a reasonable inference is that they're either chatgpt or claude
February 3, 2026 at 1:36 AM
loom creature
February 3, 2026 at 1:19 AM
Reposted by thebes
this is one of those melancholy games that tricks you into being too risk-averse when the actual solution is daringly questing for more resources and outtiding the tides from sheer wealth of numbers. many such real world cases
February 1, 2026 at 1:42 PM
stuff like moltbook is kinda funny but it doesn't make a lot of sense to me as a long term thing, bc the final form of social media will be humans and agents mixed together, not an agent-exclusive platform. if the major platforms ban agents, then humans will eventually migrate to the agent one
February 1, 2026 at 4:05 AM
January 31, 2026 at 10:20 PM
Reposted by thebes
New @vgel.me dropped.
waiting for some experiments to run, so a quick thread about base models and pretraining contamination, with some weird & interesting base model generations i've collected over time.

or, why do open source models claim to be claude or chatgpt?
January 29, 2026 at 1:54 AM
waiting for some experiments to run, so a quick thread about base models and pretraining contamination, with some weird & interesting base model generations i've collected over time.

or, why do open source models claim to be claude or chatgpt?
January 29, 2026 at 1:12 AM
if we're mutuals on twt and i don't follow you here ping me so i can pls
January 28, 2026 at 11:36 PM
procrastination of the call
January 28, 2026 at 10:53 PM
January 27, 2026 at 10:26 PM
Reposted by thebes
January 27, 2026 at 2:40 AM
Reposted by thebes
context management adds a whole new dimension to the process scheduling gymnastics, it's not just "reinventing Erlang" as some have said.

For one I'd be very interested to see/pursue "smol" versions of this that don't rely on Big Model to be always available
[trying leaflet]
it's fun to make jokes about gas town and other complicated orchestrators, and similarly probably correct to imagine most of what they offer will be dissolved by stronger models the same way complicated langchain pipelines were dissolved by reasoning. but how much will stick around?
some thoughts and speculation on future model harnesses
vgel.leaflet.pub
January 27, 2026 at 4:22 AM
[trying leaflet]
it's fun to make jokes about gas town and other complicated orchestrators, and similarly probably correct to imagine most of what they offer will be dissolved by stronger models the same way complicated langchain pipelines were dissolved by reasoning. but how much will stick around?
some thoughts and speculation on future model harnesses
vgel.leaflet.pub
January 27, 2026 at 3:15 AM
January 26, 2026 at 9:31 PM