Blog

LLM

LLM-on-Spark: Four Patterns That Actually Scale

"Just call the LLM in a loop." 9.6 years later, you finish. Here are the 4 patterns that actually scale to a billion rows: Spark UDFs, Ray+vLLM, warehouse-native SQL, or the Batch API. Code + costs.

AI

Learning in the age of AI

AI already knows more than you ever will. That’s not the advantage anymore. Your edge is simple: ask better questions, get better answers.

LLM AI Agents

How LLM applications learned to remember

We went from 4K token context windows to virtual memory filesystems in four years. Here's the engineering story of how LLM memory evolved - and what you should actually use today.

LLM AI Agents

What Happens When You Let an AI Rewrite Its Own Instructions?

Most of us are stuck on the prompt treadmill - manually tweaking instructions that break every time the task shifts. This post lays out an architecture where the AI agent grades its own work, rewrites its own prompts, builds its own tools, and rolls back when things get worse. Every idea is backed by published research. No jargon, just the blueprint.

LLM AI Agents

What If Your AI Agents Could Find Each Other?

You're copying the same agents into every new workflow. There's a better way. A self-organizing architecture with RAG-based discovery, reputation scores, budget-aware planning, and dynamic composition that solves problems nobody anticipated.

LLM AI Agents

What If Your AI Agents Could Find Each Other?

You're copying the same agents into every new workflow. There's a better way. A self-organizing architecture with RAG-based discovery, reputation scores, budget-aware planning, and dynamic composition that solves problems nobody anticipated.