🚴♂️🧗♂️🏃♂️🚄🚎⛴️
Average enjoyer of the outdoors and public transit
Definitely see this as some level of proof that smaller companies could still train effective LLMs with fewer resources.
Additionally, the availability of the DeepSeek weights will let even more people fine-tune it, or train distilled models.
Definitely see this as some level of proof that smaller companies could still train effective LLMs with fewer resources.
Additionally, the availability of the DeepSeek weights will let even more people fine-tune it, or train distilled models.