Tokens Still Sell — Agent Power Sells Too. Gemma's Signal Flare, AgentZero's Wedge

Tone of this piece: Help business readers grab the AI shift quickly and answer the question — "so where do I make money?"

Question parked at the front: Now that giant LLMs are practically commoditized, where does differentiation in Agent come from — and what does a small tool wedged between the giants, like AgentZero, need to add to actually make money?

1. The signal flare: Gemma 4, the start of "an agent that runs inside my phone"

In April 2026, Google released Gemma 4. Not just "another generation of an open model." From a business angle, three lines tell the story:

Agent capability now runs directly on the phone. Android Studio integrated first; under the name "Agent Skills," multi-step autonomous workflows execute on-device.

4× faster, 60% less battery. Compared to the previous generation. Past "yeah, it works" — into "actually usable."

Apache 2.0, free, commercial use OK. The 31B Dense model beats many closed giant models on benchmarks.

For non-developers, one line:

"Now, even without internet, my phone has started handling tasks on its own."

Until last year the mood was "bigger model, bigger datacenter." After Gemma 4 the mood shifted: weight is moving toward "a small model sized exactly for the job." The industry slogan is moving from Bigger is Better to Right-sized is Smarter.

Market signal: NVIDIA immediately announced Gemma 4 acceleration on RTX/Spark lines, and variants that run on Raspberry Pi-class boards shipped alongside. Hardware vendors treating this wave as a "real market" is the proof.

2. The landscape: nobody monopolizes — the "AI Agent CLI three-way war"

In coding and workflow automation, a different scene plays out. Agents that work on the terminal in place of a human — so-called AI Agent CLI — three companies are clashing, each with their own weapon.

Tool	Strength	Business positioning
Claude Code (Anthropic)	Best-in-class reasoning & architecture awareness. Real-bug-fix bench 80%+	"Expensive but smartest senior"
Codex CLI (OpenAI)	Auto-sandbox execution, open source, bundled with ChatGPT Plus	"The mid-tier that runs safely on autopilot"
Gemini CLI (Google)	1M-token context, generous free tier	"Free with giant memory — frictionless entry"

The funny truth:

Nobody monopolizes, and anyone doing serious work installs all three. The "OpenAI vs Anthropic vs Google" clash you read about in the news shows up on a working professional's laptop as three coexisting tabs.

A small insight falls out of this:

If a market needs all three, the seat for "a tool that handles all three" is empty.

AgentZero aims at exactly that gap.

"You three compete. I'll drive all three CLIs from one screen."

3. AgentZero — "so we built it"

AgentZero is an AI-assistant cockpit running on the Windows desktop. Metaphor: a small office on a single window where several AI workers (Claude, Codex, Gemini) and one human sit around the same desk.

What's already built (six core features as of v1.4)

Multi-terminal cockpit — PowerShell, CMD, Claude Code, Codex, etc. all bundled as tabs in one window, run in parallel.

AgentBot chat panel — Human chats a command, panel forwards it to the active terminal. Shift+Tab once toggles between chat / direct keys / AI mode.

AI delegation modes — Hand the wheel entirely to Claude Code, OR put Gemini in the secretary seat with 4 Claudes + 3 Codexes as workers below it — multi-agent orchestration.

Desktop automation (AgentScrap + CLI) — Scrape text from other windows, auto-capture infinite scroll, perform coordinate clicks and keystrokes headlessly. Pulls full Slack / Teams / KakaoTalk threads too.

Approval toasts — Small safety gate that pops up when the AI issues a risky command. Human responds Yes / No / Explain.

Harness monitor + Pencil renderer — See the rules / personas / evaluation logs the agents must follow, all in one place. Draw .pen design files directly to visualize the workflow.

Why we built it this way

While the giants shout "I'll do everything inside my model," the people doing the actual work were keeping three tools open at the same time and orchestrating them by hand on their desks. AgentZero turns that orchestration itself into a tool. There are plenty of friends already great at reasoning — so we picked the path of being "the manager who gathers the smart friends in one place and decides who does what."

4. The new three axes of AI business — not LLMs, but Agents

If the last 2–3 years of competition were "bigger LLM," then from 2026 the real money line is Agent. The LLM itself has effectively entered a leveling zone (Claude vs GPT vs Gemini differ by single-digit benchmark percentages). Differentiation comes not from the model but from how you make the model work — i.e., the Agent.

OpenAI putting "agent" front and center in its messaging, and openly studying / borrowing Claude's multi-agent patterns, is itself a signal of this current. "Senpai, allow me to learn from you" — that mood is in the air across the market.

From a business angle, the Agent market splits roughly into three axes.


flowchart TD
    Hub["AI Agent business — 3 axes"]
    Hub --> A1["① Lifecycle Agent<br/>(phone, OS, wearables, cars)<br/><br/>Owned by Apple, Microsoft, Google"]
    Hub --> A2["② Code &amp; Specialist Agent<br/>(Claude Code, Codex, Gemini Code)<br/><br/>Owned by Anthropic, OpenAI, Google"]
    Hub --> A3["③ Workflow Agent ★<br/>(office tedium, desktop ops,<br/>internal rule automation)<br/><br/>★ The opportunity zone<br/>(packed with tacit knowledge)"]

① Lifecycle Agent — practically no opening

Phone, OS, cars, Google Glass, Apple Vision. The 24/7 secretary that follows the user's daily life. Apple, Microsoft, and Google will eat this directly with their devices and OSes. Platform game — almost no room for outside players to wedge in.

② Code & Specialist Agent — head-on competition is hard

Claude Code, Codex CLI, and Gemini CLI already own the seats. Head-on competition is effectively impossible. Even in South Korea (대한민국), Naver and Kakao have failed to claim a meaningful spot in this game. The smart move isn't "build my own again" — it's "use the latest and best of what's already there, well."

③ Workflow Agent — ★ the biggest gap

This is the real opportunity. Every company, every team, every person has piles of "tacit knowledge" — who fills what form with what value, where it gets sent, how the response gets organized. A giant LLM can't solve this in one swing. And now we can throw on-device agents (4B-class models like Gemma 4) at it too. That means: data doesn't leave the building, costs are negligible, small models handle it.

Akka CEO Tyler Jewell's words apply here:

"95% of Agentic AI projects fail — not from a lack of imagination, but from a lack of infrastructure engineering."

The Workflow Agent isn't a giant-model bragging contest. It's a game of field detail and stability — terrain that favors small players.

5. So what does AgentZero need to add to make money?

From here this is "feature ideation." As a small tool aiming at the Workflow Agent gap, what should AgentZero add and what should it cut in the next six months?

What to add

On-device model slot — Gemma 4 as secretary / remote control

Leave reasoning to Claude / Gemini / Codex, but hand command routing, summaries, approval gates — the frequent and lightweight work — to a Gemma 4 4B running on the phone or laptop. Fast response, effectively zero cost, data never leaves the device.

Workflow recorder → skill automation

When the user demonstrates a task once (Slack capture → summarize → publish to Confluence), automatically congeal it into a skill. AgentZero already has desktop capture and multi-terminal — only one step further to go. "Macro-ize my office work" — a value prop that even non-developers will salivate over.

Approval gates as policy

Extend today's simple toast (Yes / No) into per-department, per-document-type, per-time-window policies. Security teams will love it. A decisive differentiator for B2B sales.

Agent team Kanban board

If you're commanding 4 Claudes + 3 Codexes, you need to see who's doing what at a glance. The seat for "external GUI on top of Claude's Agent Teams feature." A Kanban-shaped PM dashboard is intuitive for non-developer managers.

Marketplace — "buy someone else's workflow"

A bazaar where you download "Weekly Report Auto-Writer skill" or "Excel-cleanup-then-PPT-auto-generate skill" and use it instantly. Users buy time, makers earn revenue. The decisive move that turns AgentZero from a tool into an ecosystem.

Mobile companion

Desktop is the home base, but when the user is away from the desk they need to approve actions or check progress from their phone. Natural extension in an era where Gemma 4 runs inside the phone.

What to cut

The "yet another LLM chat window" temptation — Don't compete on chatbot itself. AgentZero's essence is orchestration, not conversation.

Training / tuning your own reasoning model — Don't enter the giant-model game. The moment a manager starts coding directly, his value as a manager drops.

All-OS support — Going deep into the narrow territory of the Windows desktop is the small-tool path. macOS / Linux comes back on a later branch.

Closing — A question for those making money with AI

LLMs have already become infrastructure. They're commoditizing like electricity.

Money doesn't come from electricity itself — it comes from the appliances that run on electricity.

Right now — somewhere in the company, customer, or industry you're working with, is there "work that follows the same pattern every day, that a person still does by hand"? A giant LLM is overkill for that work. For that work, a small cockpit like AgentZero plus one Gemma 4 running inside the phone might be all you need.

Big tech builds the giant models.

The next cycle's winners are the ones who build small offices on top — and collect the rent.

AgentZero is a toddler-walking-stage experiment, but the empty seat is clearly visible. On the screen you're looking at right now, there is probably still at least one full page of automation whitespace left.

References

Gemma 4: Byte for byte, the most capable open models — Google Blog

Bring state-of-the-art agentic skills to the edge with Gemma 4 — Google Developers Blog

Android Developers Blog: Gemma 4 — The new standard for local agentic intelligence on Android

Claude Code vs Codex CLI vs Gemini CLI: Which AI Terminal Agent Wins in 2026? — DEV Community

AgentZero — GitHub Pages

Memorizer internal: Agentic AI: Why experience matters more than hype (Akka CEO perspective), Edge AI revolution — Gemma3 running in 500MB, Andrej Karpathy on the 2026 AI dev paradigm shift, Claude Code Agent Teams + TaskForce AI MCP

TECH LINKS

𝕏 @webnori

📘 Akka Labs (Facebook group)

📝 More writings — webnori wiki