Is there anywhere I can find media coverage about this that isn't a press release or clickbait about "controversial startup is doing something scary"?
Is there anywhere I can find media coverage about this that isn't a press release or clickbait about "controversial startup is doing something scary"?
Not sure what the gripe is here, its not like this is meant to be a playable replacement for quake
Not sure what the gripe is here, its not like this is meant to be a playable replacement for quake
Their most expensive output tokens (after current promotional discount is over) is $2.19/million tokens (~11hrs of continuous output?)
The figure I've found for industrial energy pricing in Hangzhou is $0.091/kwh
Their most expensive output tokens (after current promotional discount is over) is $2.19/million tokens (~11hrs of continuous output?)
The figure I've found for industrial energy pricing in Hangzhou is $0.091/kwh
I found this for apple silicon, but it's worth mentioning this is a 3-bit quantization
Sorry for the X link
x.com/awnihannun/s...
I found this for apple silicon, but it's worth mentioning this is a 3-bit quantization
Sorry for the X link
x.com/awnihannun/s...
(Or one macbook, if you're okay with slow performance)
The larger distilled models are still very impressive from what I've read
(Or one macbook, if you're okay with slow performance)
The larger distilled models are still very impressive from what I've read
I guess they could publicize their utility bills
I guess they could publicize their utility bills
The weights Deepseek published are just sets of numbers, so if you have the VRAM to run the model on your local machine, there's not much of a program that could even be spyware - its just matrices that your GPU runs math on
The weights Deepseek published are just sets of numbers, so if you have the VRAM to run the model on your local machine, there's not much of a program that could even be spyware - its just matrices that your GPU runs math on
& that data could be cross-referenced with however carbon-intensive the grid is in Hangzhou
& that data could be cross-referenced with however carbon-intensive the grid is in Hangzhou