Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday

Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday

The coding bomb passed: Claude Opus 4! No baby steps here. This AI sprints through coding marathons. With unimaginable audacity, Anthropic proclaims it to be the world’s top coding model, like able to perform some of those thousand-step real-world tasks requiring hours of fully-continuous work. Imagine Claude Opus 4 juggling multiple software tools simultaneously all the while nailing instructions with shear precision. And that just puts the spotlight on the Code with Claude conference as ground zero for the new breed of AI WP developers.

Anthropic’s latest Opus 4 is certainly more than just an update; it is an engine for the Next Gen AI Agents- autonomous entities that perform complex tasks without any babysitting from humans. Imagine an AI working a full seven-hour day of strategizing and executing on its own, such as what Opus 4 recently exhibited in tests. This is not science fiction anymore but a step closer to real artificial general intelligence (AGI), and Anthropic is believing that Opus 4 is going to be first in that race.

Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday

Anthropic

Claude Opus 4 is an AI model from Anthropic that serves to penalize shortcuts and enhance memory; it is thus set to revolutionize AI agent creation. Opus 4 has been able to cut down by 65% any kind of exploitation of loopholes by AI agents. And with “memory capabilities” increased tenfold, especially in local file access, Claude can now learn and adapt with real knowledge.

Unlocking this potential, Anthropic will let everyone adopt Claude Code, its AI-powered coding agent. Think about a smooth integration into VSCode and JetBrains for developers to expedite their workflows and create the next set of smart applications.

Forget coding skills; Anthropic’s newest launch has offerings for everyone. Just days after the potent Opus 4, we have the release of an improved Sonnet model: a hybrid that is a marvel for very fast responses and deep, complex thinking. Imagine a chatbot that is very fast and can work through your most intricate problems-or both extremes in one. That, in essence, is Sonnet. Besides, it comes with all the latest advancements made in Opus 4 and capable of handling anything from parallel tool use to issuing instructions akin to a boss.

With the popularity of Sonnet 3.7, it became all but compulsory for Anthropic to plan the Max Plan: $100 per month for power users. Here’s the kicker, and we hope that surprises you: You shouldn’t have to crash out your bank account to access this next generation technology. Sonnet 4 is currently being rolled out to free users as a sample of that ultra-modern AI with none of the costs attached.

Claude 4 benchmarks

Anthropic

Sonnet 4 maintains the $3 and $15 per million input/output tokens pricing for its API. Microsoft is integrating Sonnet 4 as the default model for its new GitHub Copilot coding agent, making it available beyond Amazon Bedrock and Google Vertex AI. Opus 4 and Sonnet 4 are live and ready to accept requests.

Any more on relentless march of AI, still making the headlines? Put the first shot at Play during Tuesday’s Google I/O 2025, opening AI Mode to all US Search users-an opening for a new way of online exploration. The counter by OpenAI on Wednesday may be the announcement of $6.5 billion for Jony Ive’s secretive hardware venture; after all, if it is entering the real world, then most likely its cash register modality would be $6.5 billion. So that is speeding up the AI arms race, and the consequences of that race are now beginning to manifest.

Thanks for reading Anthropic’s Claude Opus 4 model can work autonomously for nearly a full workday

MataBlog
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.