Jan
banner
janschulte.com
Jan
@janschulte.com
indiehacker, solopreneur, deep learning

building: https://searchattention.com (launching soon!)
consulting: https://betalyra.pt
So pretty similar to a typical RAG pipeline but on a giant scale.

The search api announcement:
www.perplexity.ai/hub/blog/int...

The insights article
research.perplexity.ai/articles/arc...

hashtag#aio hashtag#aiso hashtag#geo hashtag#ai hashtag#llm hashtag#aeo
Architecting and Evaluating an AI-First Search API
Building a scalable Search API that handles 200 million daily queries using hybrid retrieval and intelligent context curation for AI models
research.perplexity.ai
September 25, 2025 at 8:03 PM
The gist is:
- they start with semantic embeddings and lexical retrieval (probably BM25 and similar) for a first very fast retrieval
- then they apply heuristics based filtering and base filters for things like stale content
- then using cross-encoders aka rerankers to rank the final result sets
September 25, 2025 at 8:03 PM
if true this would make it an amazing api for AIO research.

They also published an article giving some super interesting insights into how their index is constructed.
September 25, 2025 at 8:03 PM
Yes exactly, there is no need for a special treatment of schemas
August 19, 2025 at 11:26 AM
So the model "knows" that these tokens belong together in this context of a json schema.
Here is an example of how that looks in practise for the prompt "What organization is that?" where you see that the model looks at the context & type
August 19, 2025 at 11:21 AM
The tokens are transformed into embeddings with a positional encoding added to them (e.g. RoPE), so tokens close to each other are more "similar" than tokens further away. This way the attention mechanism learns to attend to these tokens together (being trained on thousands of schema examples).
August 19, 2025 at 11:21 AM
Can recommend better-auth which has organization support
June 19, 2025 at 11:30 AM
There are many good corporate blogs that are highly informative, blog.logrocket.com eg is one I have in high esteem
May 7, 2025 at 7:51 AM
import { oc } from '@orpc/effect' ;)
April 8, 2025 at 2:53 PM
awesome! would be great if you could add support for Effect- and Effect<Stream>-based handlers ;)
Introduction to Streams
Learn the fundamentals of streams, a powerful tool for emitting multiple values, handling errors, and working with finite or infinite sequences in your applications.
effect.website
April 8, 2025 at 2:49 PM
AI generated marketing videos saas
March 10, 2025 at 7:53 AM
Yeah not saying this is the case with every traditional bank and some even have fast transfers. But there are also the traditional banks where you wait 3 days for an intra-EU transfer, speaking from experience, while with my modern bank the transfer arrives in seconds.
March 5, 2025 at 1:07 PM
just give me real-time indexing. It's 2025!! The technology is there. Perplexity & co can leverage this to appear always up to date in comparison to Google, Bing & Co
March 5, 2025 at 9:16 AM
This feels like traditional banking vs. modern banking like revolut. Google feels like traditional banks that rely on some batch COBOL jobs that some dude in the 70s wrote and that run once a night and if it fails, will run again the night after. also not running on the weekends.
March 5, 2025 at 9:16 AM