micaebe
micaebe
@micaebe.bsky.social
You can literally train it yourself relatively easily using e.g veRL starting from a base/instruct model
January 27, 2025 at 3:05 AM