· · AI Strategy

CEO of Tallyfy · AI advisor at Blue Sheen for mid-size companies

Claude Computer Use - why the Chrome plugin misses the point

Claude Computer Use from Anthropic scores 61.4 percent on the OSWorld benchmark for full-desktop control. Yet everyone rushes to build Chrome extensions that cover roughly 10 percent of the real problem. Browser automation is not the revolution.

If you remember nothing else:

  • Computer Use coordinates your entire digital workspace, not just the browser. Chrome extensions solve maybe 10% of the problem.
  • Context switching between disconnected tools is the real productivity killer, and browser-only automation doesn't fix it
  • You shouldn't need to know which application contains which data. That's what a universal software translator changes.

The wrong tool for a very big problem

The tech world got handed a full symphony orchestra and decided they only need the second violin. That’s Claude’s Computer Use as of 2025.

Dario Amodei’s Anthropic built something that can see and interact with your entire digital workspace. Every application, every window, every pixel on your screen. Claude Sonnet 4.5 scores 61.4% on OSWorld, a benchmark measuring computer control capabilities across real applications. And what’s the first thing everyone builds? Chrome extensions. Browser plugins. Even Anthropic’s own Chrome extension limits Claude to just the browser.

We’ve apparently learned nothing from decades of digital fragmentation. The same fragmentation that undermines AI readiness across enterprises.

What a real workday actually looks like

Watch any knowledge worker for a day and the pattern is, depressingly consistent.

The average employee has dozens of applications open. They switch between them constantly. Each switch breaks concentration. Each context change burns mental energy that doesn’t come back.

Your own workday: email, Slack, Excel, your CRM, a project management tool, the documentation wiki, your code editor, a banking portal, an analytics dashboard. You’re not working. You’re running a frantic relay race between tools that don’t talk to each other.

A browser extension is going to fix this?

That’s like putting a bandaid on a severed artery and calling it surgery.

Turns out, what most people miss is this: Claude’s Computer Use doesn’t just automate clicking. It understands the visual language of software itself. Every application you use, whether that’s Slack, Excel, Salesforce, your IDE, or that proprietary tool your company built in 2003, they all speak the same visual language. Buttons look like buttons. Text fields look like text fields. Menus behave like menus.

Claude can read this language across every application at once. The Model Context Protocol (MCP) standardizes how Claude integrates with external tools and data sources, and adoption has exploded from around 1,000 servers to tens of thousands in a matter of months.

Not a browser automation tool. A universal software translator.

When I was growing up in Kenya, I watched my music teacher struggle with something called a “symphony desk.” A massive piece of furniture designed to hold all the sheet music for different instruments. Violin parts here, brass there, percussion in another drawer. The conductor had to physically shuffle between sections, losing the flow of the entire piece.

That’s our desktops now. Each application is a different section of the orchestra, physically separated, requiring constant shuffling. Computer Use changes this. One view. All instruments. Real conducting.

Why everyone built browser tools first

I think I understand why this is happening, even if I find it frustrating.

Fear of scope is real. A browser is contained, predictable, safe. You can’t accidentally delete system files or expose sensitive data outside the browser sandbox. It’s the kiddie pool of automation.

Chrome’s extension API is also mature. Well-documented. Thousands of examples. Why think harder when you can think easier.

And then there’s venture capital theater. “We’re building the Chrome extension for Claude” fits on a slide. “We’re rebuilding how computers interface with human intention” doesn’t. Guess which one gets funded faster.

The deeper problem is that we’re measuring the wrong things. We’re tracking “time saved on browser tasks” when we should be tracking “cognitive load eliminated from workflow fragmentation.” Good luck finding that metric in any dashboard. This exact disconnect is what drives the process failures we see in AI incidents.

When the abstract becomes “do this on Monday morning,” Blue Sheen is who I’d call.

The productivity stack Chrome extensions can’t reach

Your work data isn’t in your browser. It’s scattered across what I think of as the seven-layer productivity stack:

  1. Communication layer: Slack, Teams, Discord, email
  2. Documentation layer: Notion, Confluence, Google Docs, wikis
  3. Data layer: Excel, Sheets, Airtable, databases
  4. Development layer: VS Code, GitHub, terminal, Docker
  5. Customer layer: CRM, support tickets, user analytics
  6. Financial layer: QuickBooks, Stripe, banking portals
  7. Proprietary layer: That custom tool only your company uses

A Chrome extension touches maybe one and a half of those layers. Computer Use coordinates all seven.

Think about what this actually unlocks.

The Monday morning ritual: instead of spending 45 minutes gathering data from six different tools for your weekly report, you describe what you need. The AI pulls from everywhere, assembles it, and presents it for your review.

The customer fire drill: a support ticket comes in. Instead of jumping between the CRM, codebase, logs, and documentation yourself, the AI instantly correlates the issue across all systems and gives you the full picture.

The proposal process: no more copy-pasting between pricing spreadsheets, document templates, and CRM data. One request, full coordination, complete proposal.

This isn’t about saving minutes. It’s about preserving cognitive flow.

What’s actually coming

Most companies aren’t ready for this. Not technically. Technically is the easy part. Culturally. As I’ve written when exploring how to communicate AI changes effectively, the human side is always harder than the technical side.

We’ve spent decades building walls between applications. Security walls. Process walls. Departmental walls. Computer Use makes those walls visible in ways that probably terrify the people who built their careers managing them. Will they welcome this visibility? No.

Building Tallyfy showed me something about this. The companies that break down these walls first don’t just get more efficient. They develop fundamentally different capabilities. They start solving problems that siloed companies can’t even see.

Update (June 2026 note): Two things moved since I wrote this, and the second one is fair to me. First, the workspace direction arrived. Anthropic released Claude Cowork, a general-purpose agent that coordinates work across your entire digital workspace, not just browsers. It reads, edits, and creates files while planning multi-step tasks that run for extended periods. Second, the Chrome plugin grew up. Claude in Chrome is now in beta for all paid plans, with a side panel that reads and clicks across sites, multi-tab coordination, and scheduled recurring tasks. So the browser agent did mature into a real product, which is more than I gave it credit for here. The wider point still holds: the browser was always the smaller half of the workday, and computer use as an API tool (still in beta) is the piece that aims at the whole desktop. The shift from browser-only automation to workspace orchestration is happening faster than I expected.

Why should you need to know which application contains which data? Why should you care whether something lives in Slack or email or Notion? You want answers, not treasure hunts.

The winners will be the ones who realize Computer Use isn’t about automating browsers. It’s about making the entire concept of application switching obsolete.

In five years, we’ll look back at Chrome extensions for AI the way we look at WAP browsers for mobile. A necessary stepping stone that missed the point.

The real shift isn’t in making browsers smarter. It’s in making the computer itself readable by AI. When that happens, when AI can see and coordinate everything we do digitally, the idea of manual application switching will seem as clunky and antiquated as hand-copying manuscripts.

But sure. Let’s build another Chrome extension.

About the Author

Amit Kothari is an experienced consultant, advisor, coach, and educator specializing in AI and operations for executives and their companies. With 25+ years of experience, he is the Co-Founder & CEO of Tallyfy® (raised $3.6m, the Workflow Made Easy® platform) and Partner at Blue Sheen, an AI advisory firm for mid-size companies. He helps companies identify, plan, and implement practical AI solutions that actually work. Originally British and now based in St. Louis, MO, Amit combines deep technical expertise with real-world business understanding. Read Amit's full bio →

Disclaimer: The content in this article represents personal opinions based on extensive research and practical experience. While every effort has been made to ensure accuracy through data analysis and source verification, this should not be considered professional advice. Always consult with qualified professionals for decisions specific to your situation.

Related Posts

View All Posts »
Revenue per employee is the only number that survives AI

Revenue per employee is the only number that survives AI

Most operating metrics get noisy or gamed once AI absorbs the task work. Revenue per employee stays hard to fake. When Facebook bought WhatsApp for about 19 billion dollars, the company had 55 people. That ratio, output per head, is the acid test of whether AI bought you a real gain in output.

You probably do not need a transfer agent: how we self-manage our cap table with AI

You probably do not need a transfer agent: how we self-manage our cap table with AI

Most early-stage startups are not legally required to use a stock transfer agent. Delaware law lets a company keep its own electronic stock ledger. Here is how we run our cap table at Tallyfy as a version-controlled JSON file, with AI doing the reconciliation and reports, plus what the law (DGCL 219, DGCL 224, Section 12(g)) really requires.

How to make AI emails actually sound like you

How to make AI emails actually sound like you

Making AI emails sound like you is not a prompting trick. A tone guide produces press-release sludge. The fix is a voice corpus built from your own sent folder, a style file you version like code, and a draft-only rule. Harper Reed trained Claude on roughly 200 sent emails and the gap closed.

What actually saves you cost on the Claude.ai web app

What actually saves you cost on the Claude.ai web app

Eight viral cost-saving tips for Claude.ai have been making the rounds. Six are sound. Two invented their specific numbers (40 percent saved, 50 times fewer tokens). And the list missed the biggest cost shift in the current Claude lineup.

How to reduce Claude Code costs on a subscription plan

How to reduce Claude Code costs on a subscription plan

Claude Code subscription plans hide real cost levers behind context management, model switching, and session hygiene. After months on the Max 20x tier, these specific techniques measurably extend what you get from every session - with terminal proof.

The consultant who fought to keep his client off AI

The consultant who fought to keep his client off AI

Some advisors resist letting a company connect AI to its own systems, dressed up as too risky. The Everlaw survey found 90% of legal professionals expect AI to change billing within two years. The real driver is an AI consultant protecting the gatekeeper role.

AI advisory services via Blue Sheen.
Contact me Follow 10k+