Jacob Portes
banner
jacobianneuro.bsky.social
Jacob Portes
@jacobianneuro.bsky.social
Research Scientist @MosaicMLxDatabricks. I like it when neuroscience inspires AI 🧠+🖥️
Reposted by Jacob Portes
We're probably a little too obsessed with zero-shot retrieval. If you have documents (you do), then you can generate synthetic data, and finetune your embedding. Blog post lead by @jacobianneuro.bsky.social shows how well this works in practice.

www.databricks.com/blog/improvi...
Improving Retrieval and RAG with Embedding Model Finetuning
Fine-tune embedding models on Databricks to enhance retrieval and RAG accuracy with synthetic data—no manual labeling required.
www.databricks.com
February 26, 2025 at 12:48 AM
Reposted by Jacob Portes
What’s the most effective way to add new domain knowledge into an open LLM? A new blog post from my team covers experiments we did at the beginning of the year to start answering this question. It starts, unsurprisingly, with sweeping your learning rate… www.databricks.com/blog/charact...
Characterizing Datasets and Building Better Models with Continued Pre-Training
www.databricks.com
November 25, 2024 at 11:29 PM