Lightnews — Scholar-powered news

Jan

@janschulte.com

200 followers 240 following 35 posts

indiehacker, solopreneur, deep learning

building: https://searchattention.com (launching soon!)
consulting: https://betalyra.pt

Posts Replies Media Videos

Jan

@janschulte.com

So pretty similar to a typical RAG pipeline but on a giant scale.

The search api announcement:
www.perplexity.ai/hub/blog/int...

The insights article
research.perplexity.ai/articles/arc...

hashtag#aio hashtag#aiso hashtag#geo hashtag#ai hashtag#llm hashtag#aeo

Architecting and Evaluating an AI-First Search API

Building a scalable Search API that handles 200 million daily queries using hybrid retrieval and intelligent context curation for AI models

research.perplexity.ai

September 25, 2025 at 8:03 PM

Jan

@janschulte.com

The gist is:
- they start with semantic embeddings and lexical retrieval (probably BM25 and similar) for a first very fast retrieval
- then they apply heuristics based filtering and base filters for things like stale content
- then using cross-encoders aka rerankers to rank the final result sets

September 25, 2025 at 8:03 PM

Jan

@janschulte.com

if true this would make it an amazing api for AIO research.

They also published an article giving some super interesting insights into how their index is constructed.

September 25, 2025 at 8:03 PM

Jan

@janschulte.com

Yes exactly, there is no need for a special treatment of schemas

August 19, 2025 at 11:26 AM

Jan

@janschulte.com

So the model "knows" that these tokens belong together in this context of a json schema.
Here is an example of how that looks in practise for the prompt "What organization is that?" where you see that the model looks at the context & type

August 19, 2025 at 11:21 AM

Jan

@janschulte.com

The tokens are transformed into embeddings with a positional encoding added to them (e.g. RoPE), so tokens close to each other are more "similar" than tokens further away. This way the attention mechanism learns to attend to these tokens together (being trained on thousands of schema examples).

August 19, 2025 at 11:21 AM

Jan

@janschulte.com

Can recommend better-auth which has organization support

June 19, 2025 at 11:30 AM

Jan

@janschulte.com

There are many good corporate blogs that are highly informative, blog.logrocket.com eg is one I have in high esteem

May 7, 2025 at 7:51 AM

Jan

@janschulte.com

import { oc } from '@orpc/effect' ;)

April 8, 2025 at 2:53 PM

Jan

@janschulte.com

awesome! would be great if you could add support for Effect- and Effect<Stream>-based handlers ;)

Introduction to Streams

Learn the fundamentals of streams, a powerful tool for emitting multiple values, handling errors, and working with finite or infinite sequences in your applications.

effect.website

April 8, 2025 at 2:49 PM

Jan

@janschulte.com

AI generated marketing videos saas

March 10, 2025 at 7:53 AM

Jan

@janschulte.com

Yeah not saying this is the case with every traditional bank and some even have fast transfers. But there are also the traditional banks where you wait 3 days for an intra-EU transfer, speaking from experience, while with my modern bank the transfer arrives in seconds.

March 5, 2025 at 1:07 PM

Jan

@janschulte.com

just give me real-time indexing. It's 2025!! The technology is there. Perplexity & co can leverage this to appear always up to date in comparison to Google, Bing & Co

March 5, 2025 at 9:16 AM

Jan

@janschulte.com

This feels like traditional banking vs. modern banking like revolut. Google feels like traditional banks that rely on some batch COBOL jobs that some dude in the 70s wrote and that run once a night and if it fails, will run again the night after. also not running on the weekends.

March 5, 2025 at 9:16 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news