Alexander Junge
jungealexander.bsky.social
Alexander Junge
@jungealexander.bsky.social
CTO & Co-founder at http://amass.tech | PhD bioinformatics | Applies machine learning to understand what humans write and say.
💬 Reach out if you want to know more about our study and related work at amass.
February 28, 2025 at 9:06 AM
⁉️ What this means for your RAG applications:
1) Advanced re-rankers aren't always worth the cost.
2) Real-world data brings unique challenges not captured in standard benchmarks.
3) Domain- and dataset-specific evaluation is critical to decide when to use re-rankers or simpler alternatives.
February 28, 2025 at 9:06 AM
📊 We found that computationally expensive neural re-rankers can struggle to outperform simple keyword matching in real-world scenarios. This is an issue because re-ranking is often seen as a way to "fix" impresice document search and retrieval.
February 28, 2025 at 9:06 AM
...to decrease latency for each request, you need a bit more. Read more here and take this primarily as a nudge to run your own analyses for your application: www.alexanderjunge.net/blog/fastapi...
Basic async performance testing with FastAPI and Locust
www.alexanderjunge.net
January 18, 2025 at 11:57 AM