MCP vs. API: Why the AI Industry is Shifting
AITechDad
"Standards are for discovery; APIs are for performance."
If you've been following the world of AI agents, you've likely heard of **MCP** — the Model Context Protocol. Released by Anthropic, it was hailed as the "universal connector" for AI. But just months later, industry giants like Perplexity are shifting their focus back to traditional **APIs** and **CLIs**. If you're building AI tech or automated agents, you need to understand why this shift is happening — and why it matters for your bottom line.
Section 1: What is MCP?
The **Model Context Protocol (MCP)** is a standardized way for AI models (like Claude) to connect to external data sources and tools. Think of it like a USB port for LLMs. Instead of writing custom code to let an AI read your Google Drive, search the web, or check your database, you just use an MCP server. It's excellent for developers who want to quickly experiment with "tool-use" inside their IDEs (like Claude Code or Cursor).
⚡ Atomic Fact: MCP's greatest strength is its ability to "announce" tools to a model automatically, eliminating the need for hard-coded schemas in the initial development phase.
Section 2: CLI vs. API — The Basics
Before we dive into the shift, let's define the two pillars of production AI:
- API (Application Programming Interface): A machine-to-machine connection. It's how one piece of software talks to another reliably and at scale. It’s structured, predictable, and fast.
- CLI (Command Line Interface): A way for humans (or agents) to interact with software via text commands in a terminal. Tools like
claude-codeoperate here.
Section 3: The "Context Tax"
Perplexity’s CTO, **Denis Yarats**, recently signaled a move away from MCP for production-grade systems. The reason? **The Context Tax.**
MCP works by sending detailed descriptions of every available tool to the model in every single request. In a production environment with hundreds of tools, this "bloats" the context window. It makes requests more expensive, slower, and prone to "hallucinating" which tool to use.
Section 4: Garry Tan’s Take
**Garry Tan**, President of Y Combinator, has been even more vocal. He recently shared his frustration with the protocol's overhead and context consumption:
MCP sucks honestly
— Garry Tan (@garrytan) March 12, 2026
It eats too much context window and you have to toggle it on and off and the auth sucks
I got sick of Claude in Chrome via MCP and vibe coded a CLI wrapper for Playwright tonight in 30 minutes only for my team to tell me Vercel already did it lmao
But it… https://t.co/fWGVqIHK3I
Tan emphasized that while MCP is great for "vibing" (quick, instruction-led coding), it often lacks the reliability needed for high-stakes business logic. Description of many early MCP implementations as "clunky shims" over existing APIs highlights the need for a more hardened approach.
Tan emphasizes a shift toward **"purpose-built engineering"**. Instead of a general protocol, he advocates for building agents that use optimized CLI wrappers and direct API calls. This "lean" approach prevents **Agent Drift** — where an agent gets confused by too many options and starts making mistakes in its logic.
Section 5: Why This Matters for You
In the context of building AI tech and automated agents, you are caught between two worlds:
- Experimentation (The MCP Phase): Use MCP when you’re exploring what’s possible. It’s fast, it’s standardized, and it gets your agent talking to tools in minutes.
- Production (The API/CLI Phase): Once you know *exactly* what your agent needs to do, move to direct APIs. This is where you gain speed (latency), save money (token costs), and ensure reliability.
The most successful AI startups in 2026 use MCP during their weekly hackathons to prototype features, but their core production "vibe" is built on hardened, optimized APIs and custom CLI wrappers. Build for speed, but architect for scale.
Conclusion: The Future is Hybrid
The industry isn't "killing" MCP; it's maturing past the hype. We are moving from a world where we "chat with everything" to a world where our agents execute precisely. Understanding when to use the universal connector (MCP) and when to use the high-performance pipeline (API) is what separates the weekend hobbyists from the founders building the next generation of AI-first companies.