Sunil Mallya
Writing about LLMs, reinforcement learning, and building intelligent systems.
Writing
-
Models Eat Their Scaffolding
AI platforms mature by absorbing the scaffolding built around them. What starts as wrappers, retrieval layers, tool glue, and agent frameworks turns into model primitives or platform infrastructure.
AI Platforms AI Agents -
AgentDiff: git blame for vibe coding
When an AI writes your code, git can tell you what changed, but not why. AgentDiff captures every decision your coding agent makes: the prompt, the reasoning, and which requirement it maps to.
Developer Tools AI Agents -
The Great Compression
How AI is collapsing the enterprise stack. When models can reason, act, and interact, the layers built for human navigation of software start to hollow out from the middle. Five layers become three.
Enterprise AI AI Agents -
The Myth of RLVR
Why verifiable rewards aren't enough and what it takes to build open-world agents. The belief that correctness is the bottleneck is the myth. The real bottlenecks are information flow, credit assignment, and trajectory-level learning.
Reinforcement Learning AI Agents