Ning
banner
ningcao.bsky.social
Ning
@ningcao.bsky.social
Thinking about Data, a lot
If you’re as excited as we are about pushing the boundaries of data curation, stop by booth 303 at NeurIPS to chat with us! We’re also hiring across Research and Engineering: jobs.ashbyhq.com/DatologyAI
DatologyAI Jobs
DatologyAI Jobs
jobs.ashbyhq.com
November 26, 2024 at 1:35 AM
I’m incredibly grateful to have contributed to this mission of building the best LLM data pipeline. Collaborating with the team, I’ve learned so much about Data-Centric ML, designing thoughtful and rigorous experiments, and the engineering principles behind creating a resilient data pipeline.
November 26, 2024 at 1:35 AM
Over the past few months, we’ve run hundreds of ablations, rigorously tested hypotheses, and experimented relentlessly to ensure our results are both scalable and robust. Read more here: www.datologyai.com/post/technic...
Technical Deep-Dive: Curating Our Way to a State-of-the-Art Text Dataset
Our data curation pipeline to obtain substantial improvements in LLM quality, training speed, and inference efficiency.
www.datologyai.com
November 26, 2024 at 1:35 AM
Thrilled to share that we’ve surpassed DCLM and built a state-of-the-art data curation pipeline to enable better, faster, and more cost-efficient LLMs!
DatologyAI Jobs
DatologyAI Jobs
jobs.ashbyhq.com
November 26, 2024 at 1:35 AM