Topic
RAG
Field reports on retrieval-augmented generation: chunking, ranking, and what moves the needle.
2 posts
RAG LLMs
Retrieval Is Mostly Data Work
After a year of RAG systems, the pattern is clear: the interesting problems are upstream of the vector database.
• 9 min
RAG Infra
Latency Budgets for RAG Applications
If you haven't written the budget down, the model is deciding it for you. Here is the template I use.
• 5 min