Forget Chatbots. AI Agents Are the Future
This week a startup called Cognition AI caused a bit of a stir by releasing a demo showing an artificial intelligence program called Devin performing work usually done by well-paid software engineers. Devin’s creators brand it as an “AI software developer.” When asked to test how Meta’s open source language model Llama 2 performed when accessed via different companies hosting it, Devin generated a step-by-step plan for the project, generated code needed to access the APIs and run benchmarking tests, and created a website summarizing the results. Google DeepMind calls it a “generalist.” I suspect that Google has hopes that these agents will eventually go to work outside of video games, perhaps helping use the web on a user’s behalf or operate software for them. Demis Hassabis, the CEO of Google DeepMind, recently told me that he plans to combine large language models with the work his company has previously done training AI programs to play video games to develop more capable and reliable agents.

























Discover Related

'Prove You're Better Than AI': Shopify CEO Draws A Line In The Sand On Hiring

AGI Might Not Just Change The World, It Could 'Destroy Humanity': Google DeepMind

Letter from 2035: Did we give Agentic AI too much agency?

Forget ChatGPT? China’s DeepSeek is working on smarter, self-improving AI models

Microsoft To Soon Let Users Tailor Copilot to Their Needs

UN report says AI market to reach $4.8 trillion by 2033

Microsoft’s AI division head wants to create a lasting relationship between chatbots and their users

Two AI models pass benchmark Turing Test, blurring line between human, machine

AI will generate 95 per cent of all code in the next 5 years, says Microsoft CTO

How I realized AI was making me stupid—and what I do now

AI Wants To Be Your 'Love Guru': Tinder's 'The Game Game' Can Teach You How To Flirt

New AI benchmarks test speed of running AI applications

80% Of Indian Businesses Experimenting With Agentic AI – But Can They Scale It?

OpenAI raises $40 billion, valued at $300 billion in historic funding round

The Tools of Tomorrow: What Lies Ahead with the AI Revolution

What would it take for AI to operate robots? : Short Wave : NPR

AI agents are a moment of truth for tech

OpenAI does not expect to be cash-flow positive until 2029: Report

Roles for AI agents, rethinking EV charging and ransomware threats

DeepSeek rolls out V3 AI model updates in race against OpenAI. What's New?

AI To Take Over Most Coding Jobs? Zoho's Sridhar Vembu Weighs In

OpenAI reveals several lonely users are using ChatGPT
