InterviewDroid.com
"Semantic search" has mathematical blind spots. If you abandon exact keywords for pure vector search, you lose visibility on logic-based queries.
arxiv.org/pdf/2508.21038
"Semantic search" has mathematical blind spots. If you abandon exact keywords for pure vector search, you lose visibility on logic-based queries.
arxiv.org/pdf/2508.21038
❌ SOTA Embedding Models (Google, OpenAI) failed hard, scoring <20% recall.
✅ Old-school BM25 (Keyword matching) scored nearly 100%.
Embeddings are great at concepts, but bad at precise boolean logic.
❌ SOTA Embedding Models (Google, OpenAI) failed hard, scoring <20% recall.
✅ Old-school BM25 (Keyword matching) scored nearly 100%.
Embeddings are great at concepts, but bad at precise boolean logic.
Would be awesome to chat when you feel the timing’s right. Here's my calendar link if and when you feel up to it! meetnicolas.com.
Would be awesome to chat when you feel the timing’s right. Here's my calendar link if and when you feel up to it! meetnicolas.com.
larslofgren.com/forbes-marke...
Long read but a good one.
written by: @larslofgren.bsky.social
larslofgren.com/forbes-marke...
Long read but a good one.
written by: @larslofgren.bsky.social
3: The breakdown is for a single industry and results may vary for other industries (though I doubt it).
4: Similarity scores on page title aren't the *BEST* way to determine content relevance. But it's not a bad way either. More factors at play around content quality, etc...
3: The breakdown is for a single industry and results may vary for other industries (though I doubt it).
4: Similarity scores on page title aren't the *BEST* way to determine content relevance. But it's not a bad way either. More factors at play around content quality, etc...
Limitation 2: The analysis was done on a mix of non-branded head and tail terms more geared towards information queries.
* PS - I hate this character count limit!
Limitation 2: The analysis was done on a mix of non-branded head and tail terms more geared towards information queries.
* PS - I hate this character count limit!
Limitation 1: based on one of the "big 3" SEO tools provider so it could very well the other tools KW Difficulty Scores are more accurate. I will not name the company because the purpose of this is not to "throw any shade". The assumption is they all are equally as problematic.
Limitation 1: based on one of the "big 3" SEO tools provider so it could very well the other tools KW Difficulty Scores are more accurate. I will not name the company because the purpose of this is not to "throw any shade". The assumption is they all are equally as problematic.