prolewiki.bsky.social
@prolewiki.bsky.social
Communism doesn't belong only to those who can afford to pay for it. We remain committed to our mission of offering proletarian education without discrimination.
December 16, 2025 at 4:58 PM
Unfortunately no ETA yet on our RAG files but they have also worked on the MIA alongside the PW archive they are making: github.com/percy-raskov...
December 16, 2025 at 4:58 PM
But it's also going to be FOSS and usable by anyone for any purpose.

I don't know about you but this seems much more beneficial to the workers than another SaaS you have to pay for to regurgitate only things the service deems acceptable.
December 16, 2025 at 4:58 PM
This means you will be able to make RAG models out of local models too without having to go through us every time. Or fine-tune them with our content. Or anything you like.

Sure, this is less "sexy" than a neat react.js webapp with animations and a ready-made commercial model.
December 16, 2025 at 4:58 PM
We ourselves are making ProleWiki available for RAG.

We are working with a dev who has received our entire wiki corpus, and will make it available open source once it's parsed and cleaned up.
December 16, 2025 at 4:58 PM
One last thing on their AI - since it communicates with API, there is also not only the model's bias (both GPT and Deepseek) but also a lighter system prompt injected by both. APIs are generally less censored than the web interfaces, but still have *some* system prompt in it.
December 16, 2025 at 4:58 PM
Is this really what a "major breakthrough" looks like (to reuse their super lengthy self-congratulatory press release), or is this yet another profit-driven SaaS?
December 16, 2025 at 4:58 PM
But they did not publish this documentation anywhere to help other comrades build their own systems. Instead, they want to sell paid tiers on a cloud service that talks to OpenAI.

Guess if you can't afford it you can't be a communist.
December 16, 2025 at 4:58 PM
The takeaway is the WSWS is very cryptic about their technical implementation and very protective of their system. You don't just feed the AI the blog links, at least, if you do RAG correctly. You clean it in a way the AI can parse it - whether they did that we can't say.
December 16, 2025 at 4:58 PM
As for how we found the info - it's all there in the HTML code since it needs to call JS and JSON files to work. Nothing was "reverse-engineered" (which their ToS says is verbotten 😡)
December 16, 2025 at 4:58 PM
Disclaimer, it is possible our investigation is wrong in some way. We looked at the JSON and JS files they call in the HTML of their homepage, so the information is there - whether it's used in the way it's presented is something else.
December 16, 2025 at 4:58 PM
RAG is legitimate, and it's also the least-effort service to make LLM models more specific. You start with a fully-trained model and then just tell it "look at these pages first before answering".
December 16, 2025 at 4:58 PM
In this method the LLM looks up a corpus of data before answering the user query. It can indeed look at the WSWS whopping 250k magazine articles of the past 25 years (do trots do nothing but write all day long?) and makes an answer on it.
December 16, 2025 at 4:58 PM
This gives some indication as to how their AI works. First, your chats are sent to OpenAI. So maybe don't discuss things too openly with Socialism AI.

Second, it seems to be using RAG, or retrieval augmented generation.
December 16, 2025 at 4:58 PM
Our investigation shows they call two models: GPT4.1 and Deepseek.
GPT is OpenAI and is a closed-source model. So they won't have downloaded it to fine-tune train it - not possible.

Instead you have to communicate with GPT through OpenAI's API endpoint.
December 16, 2025 at 4:58 PM
They're also very protective of how exactly their AI works, how it was made, etc, but we looked into it. Let's untangle it.
December 16, 2025 at 4:58 PM
Let's start the privacy policy. Lots of mentions for paid tiers and copyright retention.

They claim everything "Socialism AI" outputs belong solely to WSWS.

The trotskist revolution will not be televised - because the workers won't have paid the broadcasting fees.
December 16, 2025 at 4:58 PM
So we took a technical look at the new "Socialism AI" from WSWS - the trots.
tl;dr: lots of intellectual property protection, paid services, and your chats get routed to commercial AI services (GPT and Deepseek).

A short thread on what we found and what we'd do different. 🧵
December 16, 2025 at 4:58 PM
The most effective way to achieve revolution in the imperial core countries is to whine about Joseph Stalin at every opportunity
September 12, 2025 at 5:28 PM
What's your favorite fruit and why is it the fruits of your labor?
August 8, 2025 at 5:28 PM
Squirrel and Hedgehog is a cartoon from the DPRK which ended up becoming so popular that it eventually reached foreign audiences, even in the West. It shows an espionage story within a war between Flower Hill and the invading Weasel Empire. You can watch the series here: archive.org/search?query...
Internet Archive: Digital Library of Free & Borrowable Texts, Movies, Music & Wayback Machine
Redirecting you to a lite version of archive.org...
archive.org
August 1, 2025 at 3:07 PM
Hello
May 17, 2025 at 8:27 PM