Robert Nishihara
robertnishihara.bsky.social
Robert Nishihara
@robertnishihara.bsky.social
Co-founder of Anyscale. Co-creator of Ray. Previously PhD ML at Berkeley.
DeepSeek released smallpond, a big data processing framework built on top of Ray.
- Smallpond targets high performance data processing.
- It provides a high-level dataframe API
- Targets petabyte-level scaling

The challenges around training data prep only grow when you include multimodal data.
March 4, 2025 at 6:34 AM