Back to Articles

AI Models, Agentic Tools, and Great Yogurt Economy

By Ehsan Tork · · 11–13 min read

AI development tools comparison showing multiple AI models and code interfaces
Cover: When AI is not magic but electricity with a billing department

A very serious technical analysis written with a straight face.


The Year I Learned AI Is Not Magic, It's a Meter

There was a time when I thought AI was a friendly ghost living inside my editor.

You ask nicely. It helps. End of story.

Then one day my plan finished in two days.

That was the day I realized:

AI is not magic. AI is electricity with a billing department.

The Early Days: When AI Was Cheap and Innocent

Back in the peaceful era, I used tools like Cody.

I even forgot to pay once. They froze my account, gave me $4 credit, then said:

"Okay fine, take Pro forever."

That should have been my first warning sign.

Soon after, Cody was sunset. Only later did I understand why.

I was using AI to apply diffs carefully. Others were using AI to vibe-code startups in 10 minutes.

Guess which group gets more YouTube views.

Agentic AI: When Tools Started Looking Busy

Then came the agentic era.

Agents that:

Progress felt dramatic. Results felt optional.

One memorable session:

But wow, the AI was working very hard.

Claude: The Brilliant Consultant Who Sends Invoices

Claude is smart. Sometimes extremely smart.

It will:

It will also:

Claude charges you for thinking.

Sometimes that's worth it. Sometimes you just wanted the bug gone.

Codex: The Calm Engineer Who Respects Your Repo

Then I met Codex.

Codex doesn't:

Codex:

Most tasks take 12%. Some tasks also take 12%.

And you never feel stressed.

That's not an accident. That's product design.

Cursor: Heaven for New Users

Cursor did something radical:

Heavy task? Same counter. Light task? Same counter.

Cursor absorbs complexity so users can learn.

This is why beginners love it. This is why professionals respect it.

AMP: The Quiet Student Eating Yogurt Alone

Sourcegraph AMP is powerful. Very powerful.

It just doesn't tell you:

No dropdown. No override. Just trust.

AMP isn't dumb. It's cost-optimized.

Internally it likely does:

  1. cheap model
  2. medium model
  3. expensive model (very carefully)

Great for providers. Frustrating for people who value time over tokens.

AMP sits in the last row, eating yogurt, solving enterprise problems quietly.

Grok: Potential, Strategy Sold Separately

Grok has potential.

It also has:

At $300/month, users expect:

"Use me everywhere."

Instead they get:

"Think carefully before clicking."

Developers don't like thinking about invoices mid-flow.

OpenCode: Free Haircuts for Training Models

OpenCode feels… fine.

Because it's a training ground.

New models show up like junior hairdressers:

"Free haircut! Please sit!"

You leave with:

The real payment is data.

Some of these models will grow up. Some will start charging. Everyone will say they got better suddenly.

They didn't. They just finished their apprenticeship.

The Final Lesson

In 2026:

The real differentiator isn't intelligence.

It's friction per task.

And if your AI tool makes you forget the meter exists…

That's real magic.