Meta’s Next Llama AI Models Are Training on a GPU Cluster ‘Bigger Than Anything’ Else

55 years, 3 months ago

Meta’s Next Llama AI Models Are Training on a GPU Cluster ‘Bigger Than Anything’ Else

Meta CEO Mark Zuckerberg laid down the newest marker in generative AI training on Wednesday, saying that the next major release of the company’s Llama model is being trained on a cluster of GPUs that’s “bigger than anything” else that’s been reported. “We're training the Llama 4 models on a cluster that is bigger than 100,000 H100s, or bigger than anything that I've seen reported for what others are doing,” Zuckerberg said, referring to the Nvidia chips popular for training AI systems. On Wednesday, Zuckerberg declined to offer details on Llama 4’s potential advanced capabilities but vaguely referred to “new modalities,” “stronger reasoning,” and “much faster.” Meta’s approach to AI is proving a wild card in the corporate race for dominance. Although touted as “open source” by Meta, the Llama license does impose some restrictions on the model’s commercial use.

Training Ai Models Nvidia Meta Llama Cluster Zuckerberg H100S

Krutrim Cloud to host Meta’s Llama 4 AI models on Indian servers

3 days, 20 hours ago

Krutrim Cloud to host Meta’s Llama 4 AI models on Indian servers

Deccan Chronicle

India

Mark Zuckerberg-led Meta unveils new Llama 4 AI models amid race to catch up with rivals

1 week ago

Mark Zuckerberg-led Meta unveils new Llama 4 AI models amid race to catch up with rivals

Ai

‘Never a dull day in AI’: Sundar Pichai reacts as Meta launches Llama 4 models to take on Gemini

1 week ago

‘Never a dull day in AI’: Sundar Pichai reacts as Meta launches Llama 4 models to take on Gemini

Google

Meta rivals ChatGPT and Gemini with new Llama 4 models: What is it, how to use and more

1 week ago

Meta rivals ChatGPT and Gemini with new Llama 4 models: What is it, how to use and more

Model

Meta releases new AI model Llama 4

1 week ago

Meta releases new AI model Llama 4

Deccan Chronicle

Ai

Meta launches Llama 4, its most advanced AI model yet

1 week ago

Meta launches Llama 4, its most advanced AI model yet

Ai

Meta Plans to Release New AI Model Llama 4 This Month: Report

1 week, 1 day ago

Meta Plans to Release New AI Model Llama 4 This Month: Report

Deccan Chronicle

Tech

Tech Tonic | Meta Llama’s spark, and countries vying for AI governance supremacy

5 months ago

Tech Tonic | Meta Llama’s spark, and countries vying for AI governance supremacy

Hindustan Times

India

Meta Says Its Llama AI Models Being Used by Banks, Tech Companies

7 months, 2 weeks ago

Meta Says Its Llama AI Models Being Used by Banks, Tech Companies

Ai

Meta Launches Llama 3.1. Open-Source AI Model That Surpasses GPT-4, Claude 3.5 In Some Benchmarks

8 months, 2 weeks ago

Meta Launches Llama 3.1. Open-Source AI Model That Surpasses GPT-4, Claude 3.5 In Some Benchmarks

Model

Meta’s New Llama 3.1 AI Model Is Free, Powerful, and Risky

55 years, 3 months ago

Meta’s New Llama 3.1 AI Model Is Free, Powerful, and Risky

Model

Meta unveils Llama 3 and real-time image generator: All you need to know

11 months, 3 weeks ago

Meta unveils Llama 3 and real-time image generator: All you need to know

Hindustan Times

Llama

Meta is developing a new, more powerful AI system

1 year, 7 months ago

Meta is developing a new, more powerful AI system

Ai

Meta to launch AI model for writing computer codes

1 year, 7 months ago

Meta to launch AI model for writing computer codes

Model

Meta Just Released a Coding Version of Llama 2

1 year, 7 months ago

Meta Just Released a Coding Version of Llama 2

Research

MediaTek chip with Meta's Llama 2 for on-device generative AI coming soon

1 year, 7 months ago

MediaTek chip with Meta's Llama 2 for on-device generative AI coming soon

Business Standard

Ai

Meta’s Llama 2 sticks out despite lagging behind the best AI models

1 year, 8 months ago

Meta’s Llama 2 sticks out despite lagging behind the best AI models

Google

Meta’s Open Source Llama Upsets the AI Horse Race

1 year, 8 months ago

Meta’s Open Source Llama Upsets the AI Horse Race

Google

Joining AI Race, Mark Zuckerberg Reveals Meta’s New 'Large Language Model Meta AI'

2 years, 1 month ago

Joining AI Race, Mark Zuckerberg Reveals Meta’s New 'Large Language Model Meta AI'

Language Models

Meta heats up Big Tech's AI arms race with new language model

2 years, 1 month ago

Meta heats up Big Tech's AI arms race with new language model

Tech

Discover Related

OpenAI Likely To Launch New AI Models Next Week

18 hours, 23 minutes ago

OpenAI Likely To Launch New AI Models Next Week

OpenAI is set to unveil new AI models, including an enhanced version of its GPT-4o multimodal model, according to a report by The Verge. Earlier this month, OpenAI CEO Sam …

Deccan Chronicle

Model Ai Openai

'Patriotic Chest Thumping Not Real Innovation': AI Founder Calls Out Bhavish Aggarwal’s Krutrim Over Llama 4 Hype

Trending 2 days, 23 hours ago

'Patriotic Chest Thumping Not Real Innovation': AI Founder Calls Out Bhavish Aggarwal’s Krutrim Over Llama 4 Hype

Bhavish Aggarwal’s big AI pitch for Krutrim — his latest bet under Ola’s expanding tech umbrella — isn’t sitting well with everyone. In fact, it's drawn sharp criticism from Aaditya …

ABP News

Indian Ai Models Krutrim

Multibagg AI founder slams Ola Electric CEO Bhavish Aggarwal: ‘Hosting models isn’t innovation’

3 days, 13 hours ago

Multibagg AI founder slams Ola Electric CEO Bhavish Aggarwal: ‘Hosting models isn’t innovation’

Aaditya Anand, founder and CEO of Multibagg AI, has issued a strongly worded critique of Ola Electric CEO Bhavish Aggarwal’s latest LinkedIn announcement, dismissing the company’s AI infrastructure efforts as …

Indian Ai Models Ola Electric

IIT Madras partners with Ziroh Labs to establish Centre of AI Research

3 days, 20 hours ago

IIT Madras partners with Ziroh Labs to establish Centre of AI Research

Hyderabad: Indian Institute of Technology Madras and IITM Pravartak Technologies Foundation is partnering with Ziroh Labs, a California-based Innovation-driven Deep-Tech Startup, and to establish an Centre of AI Research to …

Deccan Chronicle

India Iit Models Iit Madras

Evaluation of AI large language models in final stage: Ashwini Vaishnaw

5 days, 16 hours ago

Evaluation of AI large language models in final stage: Ashwini Vaishnaw

New Delhi, April 7 : The evaluation of AI large language model applications is in its final stage, said Union Minister Ashwini Vaishnaw on Monday. The minister said the government …

India Ai Vaishnaw

Forget ChatGPT? China’s DeepSeek is working on smarter, self-improving AI models

5 days, 19 hours ago

Forget ChatGPT? China’s DeepSeek is working on smarter, self-improving AI models

After shaking up Silicon Valley with AI models earlier this year, Chinese startup DeepSeek is working on another innovation to help reduce operational costs. The new approach, first revealed in …

Model Chinese Ai Models

The AI Race Has Gotten Crowded—and China Is Closing In on the US

5 days, 20 hours ago

The AI Race Has Gotten Crowded—and China Is Closing In on the US

The year that ChatGPT went viral, only two US companies—OpenAI and Google—could boast truly cutting-edge artificial intelligence. In the US, the fiercest competition comes from Meta’s open-weight Llama models; Anthropic, …

Wired

China Model Ai Openai

Meta Cranks Up The Heat In AI Race With Launch Of Llama-4 AI Models: Scout, Maverick, & Behemoth

5 days, 23 hours ago

Meta Cranks Up The Heat In AI Race With Launch Of Llama-4 AI Models: Scout, Maverick, & Behemoth

Meta has unveiled its latest Llama-4 AI suite — a trio of powerful language models dubbed Scout, Maverick, and Behemoth. View this post on Instagram A post shared by Mark …

ABP News

Company Ai Models Meta

New AI benchmarks test speed of running AI applications

1 week, 3 days ago

New AI benchmarks test speed of running AI applications

San Francisco: Artificial intelligence group MLCommons unveiled two new benchmarks that it said can help determine how quickly top-of-the-line hardware and software can run AI applications. As the underlying models …

Deccan Chronicle

Test Benchmark Ai Mlcommons

New AI benchmarks test speed of running AI applications

1 week, 3 days ago

New AI benchmarks test speed of running AI applications

Artificial intelligence group MLCommons unveiled two new benchmarks that it said can help determine how quickly top-of-the-line hardware and software can run AI applications. As the underlying models must respond …

Benchmark Mlcommons Test Ai

Meta’s AI research head to step down amid company's $65 billion computing push

1 week, 4 days ago

Meta’s AI research head to step down amid company's $65 billion computing push

Joelle Pineau, a Meta vice president who leads its Fundamental AI Research group, known as FAIR, is planning to leave the company after working there for nearly eight years. Pineau …

Hindustan Times

Fair Ai Meta Pineau

OpenAI Secures Record $40-Billion Funding Round With SoftBank. Here's What ChatGPT-Maker Plans To Spend It On

1 week, 4 days ago

OpenAI Secures Record $40-Billion Funding Round With SoftBank. Here's What ChatGPT-Maker Plans To Spend It On

OpenAI has successfully secured $40 billion in a monumental funding round, making it the largest-ever capital raise for a startup, as reported by Agence France-Presse. SoftBank's Bold Vision for Artificial …

ABP News

Billion Ai Meta Openai

In shift, OpenAI announces open AI model

1 week, 5 days ago

In shift, OpenAI announces open AI model

Artificial intelligence powerhouse OpenAI, the creator of ChatGPT, on Monday announced it is building a more open generative AI model as it faces growing competition in the open-source space from …

Ai Models Meta Openai

OpenAI raises $40 billion, valued at $300 billion in historic funding round

Trending 1 week, 5 days ago

OpenAI raises $40 billion, valued at $300 billion in historic funding round

The investment announcement came on the same day that OpenAI revealed it was developing a more open generative AI model. In the open-source field, it faced more competition from Chinese …

Model Open Ai Models

OpenAI to release new open language model in coming months

1 week, 5 days ago

OpenAI to release new open language model in coming months

Just a few days after launching its new image generator, OpenAI has announced that it is planning to release a new “first open language model since GPT 2 in the …

Language Model Open Ai

Sam Altman Says OpenAI Will Release an ‘Open Weight’ AI Model This Summer

55 years, 3 months ago

Sam Altman Says OpenAI Will Release an ‘Open Weight’ AI Model This Summer

Sam Altman today revealed that OpenAI will release an open-weight artificial intelligence model in the coming months. Shortly after DeepSeek’s model was released in January, Altman said that OpenAI was …

Wired

Weight Company Models Meta

Amazon’s AGI Lab Reveals Its First Work: Advanced AI Agents

55 years, 3 months ago

Amazon’s AGI Lab Reveals Its First Work: Advanced AI Agents

Amazon is still seen as a bit of a laggard in the race to develop advanced artificial intelligence, but it has quietly created a lab that is now setting records …

Wired

Amazon Model Ai Agents

Good response to India AI Mission: MietY

2 weeks, 2 days ago

Good response to India AI Mission: MietY

Around 187 proposals have been submitted by researchers, entrepreneurs and startups for developing indigenous large language models which have been launched under the India AI mission. Abhishek Singh, additional secretary, …

India Ai Proposals India Ai

China floods the world with new low-cost AI models after DeepSeek’s success, undercutting OpenAI, Google, and others

Trending 2 weeks, 3 days ago

China floods the world with new low-cost AI models after DeepSeek’s success, undercutting OpenAI, Google, and others

DeepSeek did more than just show the AI industry you don’t have to spend billions to build artificial intelligence. Since DeepSeek upstaged OpenAI in January with a powerful model that …

Hindustan Times

China Tech Companies Model

DeepSeek rolls out V3 AI model updates in race against OpenAI. What's New?

Trending 2 weeks, 3 days ago

DeepSeek rolls out V3 AI model updates in race against OpenAI. What's New?

Chinese artificial intelligence startup DeepSeek has released updates to its V3 model, promising to deliver better programming capabilities. It also claims enhanced style and content quality when it comes to …

Hindustan Times

Ai Openai Deepseek

Beijing AI academy slams inclusion on US Entity List

2 weeks, 4 days ago

Beijing AI academy slams inclusion on US Entity List

Beijing Academy of Artificial Intelligence, a Chinese non-profit AI institution, strongly condemned its inclusion on the US Entity List, calling the decision unjustified and urging Washington to reverse the move. …

Ai Baai

Jack Ma-backed Ant Group touts AI breakthrough using Chinese chips

2 weeks, 5 days ago

Jack Ma-backed Ant Group touts AI breakthrough using Chinese chips

Ant Group Co, a Chinese financial services company backed by billionaire Jack Ma, is using semiconductors made in China for cutting costs on training artificial intelligence models by 20 per …

Hindustan Times

Chinese Ai Models Ant

FuriosaAI rejects Meta’s $800 mn bid as South Korean firm gears up to challenge NVIDIA

2 weeks, 5 days ago

FuriosaAI rejects Meta’s $800 mn bid as South Korean firm gears up to challenge NVIDIA

FuriosaAI is looking to raise approximately $48 million this month and the firm has already opened talks with investors. As of now, the South Korean startup has developed two AI …

South Korean Meta Chips Rngd

Tencent upgrades AI model to compete with DeepSeek, Alibaba

2 weeks, 6 days ago

Tencent upgrades AI model to compete with DeepSeek, Alibaba

Tencent Holdings Ltd. unveiled an upgraded AI reasoning model, the latest bid by China’s most valuable company to compete for leadership in the country’s crowded artificial intelligence field. T1 operates …

Model Ai Models Tencent

Tencent launches T1 reasoning model amid growing AI competition in China

3 weeks, 1 day ago

Tencent launches T1 reasoning model amid growing AI competition in China

Chinese tech giant Tencent on Friday night launched the official version of its T1 reasoning model, stepping up competition in China's increasingly crowded artificial intelligence sector. The upgraded T1 model …

China Model Tencent Deepseek

Nvidia, Elon Musk's xAI to join Microsoft, BlackRock to develop AI infrastructure

3 weeks, 3 days ago

Nvidia, Elon Musk's xAI to join Microsoft, BlackRock to develop AI infrastructure

Nvidia and Elon Musk's xAI have joined a consortium backed by Microsoft and BlackRock to expand artificial intelligence infrastructure in the U.S., the companies said on Wednesday, as a global …

Ai Blackrock

Budget retains allocation for IT Dept at ₹774 cr.

3 weeks, 3 days ago

Budget retains allocation for IT Dept at ₹774 cr.

Deputy Chief Minister and Finance Minister Bhatti Vikramarka Mallu in his Budget 2025-26 speech said Information Technology sector in Telangana witnessed remarkable growth over the past year on the back …

State Ai

Nvidia CEO Jensen Huang unveils new Rubin AI chips at GTC 2025

Trending 3 weeks, 3 days ago

Nvidia CEO Jensen Huang unveils new Rubin AI chips at GTC 2025

Nvidia founder Jensen Huang kicked off the company’s artificial intelligence developer conference on Tuesday by telling a crowd of thousands that AI is going through “an inflection point.” At GTC …

Deccan Chronicle

Training Data Model Cosmos

Nvidia CEO Jensen Huang unveils Dynamo, an open-source inference framework for AI inferencing, at GTC 2025

3 weeks, 3 days ago

Nvidia CEO Jensen Huang unveils Dynamo, an open-source inference framework for AI inferencing, at GTC 2025

AI chipmaker Nvidia on Tuesday unveiled Dynamo, an open-source inference framework designed to enhance the deployment of generative AI and reasoning models across large-scale, distributed environments. Announced at the GTC …

Ai Nvidia Gpu Dynamo

Nvidia looks to further its AI stronghold with new chips and personal supercomputers

Trending 3 weeks, 4 days ago

Nvidia looks to further its AI stronghold with new chips and personal supercomputers

Nvidia is planning to build more powerful chips, an artificial intelligence model for robotics, and “personal AI supercomputers” for developers to work on desktop machines. CEO Jensen Huang made the …

Hindustan Times

Ai Huang Personal Supercomputers Supercomputers Nvidia

Nvidia CEO Jensen Huang unveils new Rubin AI chips at GTC 2025

3 weeks, 4 days ago

Nvidia CEO Jensen Huang unveils new Rubin AI chips at GTC 2025

Nvidia founder Jensen Huang kicked off the company’s artificial intelligence developer conference on Tuesday by telling a crowd of thousands that AI is going through “an inflection point.” At GTC …

Data Model Nvidia Cosmos

Nvidia unveils Blackwell Ultra and Vera Rubin, its latest AI superchips

3 weeks, 4 days ago

Nvidia unveils Blackwell Ultra and Vera Rubin, its latest AI superchips

Nvidia CEO Jensen Huang at the company’s annual GTC conference, held in San Jose, California, unveiled a fresh line of AI superchips. However, Huang said models like DeepSeek actually require …

Google Microsoft Ai Nvidia

Nvidia CEO Jensen Huang unveils new Rubin AI chips at GTC 2025

3 weeks, 4 days ago

Nvidia CEO Jensen Huang unveils new Rubin AI chips at GTC 2025

Nvidia founder Jensen Huang kicked off the company’s artificial intelligence developer conference on Tuesday by telling a crowd of thousands that AI is going through “an inflection point.” At GTC …

Associated Press

Training Data Model Nvidia

Pact signed for using Parliament data to train indigenous AI model: Ashwini Vaishnaw

3 weeks, 4 days ago

Pact signed for using Parliament data to train indigenous AI model: Ashwini Vaishnaw

The India Artificial Intelligence Mission, helmed by the Ministry of Electronics and Information Technology, has signed a memorandum of understanding with Parliament to access its data for training an indigenous …

India Tech Model Train

Nvidia's Jensen Huang to unveil cutting-edge AI and quantum computing processors

3 weeks, 5 days ago

Nvidia's Jensen Huang to unveil cutting-edge AI and quantum computing processors

Industry analysts anticipate Huang highlighting Nvidia’s newest Blackwell range of graphics processing units, which has fresh improvements in the works The AI boom propelled Nvidia stock prices to stratospheric levels …

China Quantum Computing Ai Nvidia

AI is getting better at thinking like a person. Nvidia says its upgraded platform makes it even better

3 weeks, 5 days ago

AI is getting better at thinking like a person. Nvidia says its upgraded platform makes it even better

New York CNN — Nvidia on Tuesday revealed more details about its next artificial intelligence chip platform, Blackwell Ultra, which it says will help apps reason and act on a …

CNN

Google Power Model Ai

DeepSeek Successors Or Rivals? Chinese Giant Baidu Unveils 2 AI Models With Advanced Reasoning

3 weeks, 5 days ago

DeepSeek Successors Or Rivals? Chinese Giant Baidu Unveils 2 AI Models With Advanced Reasoning

Chinese tech giant Baidu has unveiled two new artificial intelligence models, further fueling competition in the rapidly evolving AI sector. Meanwhile, ERNIE 4.5 is our latest foundation model and new-generation …

ABP News

Chinese Baidu Ernie Multimodal Ai

China's Baidu launches two new AI models as industry competition heats up

3 weeks, 6 days ago

China's Baidu launches two new AI models as industry competition heats up

China's Baidu said on Sunday it has launched two new artificial intelligence models, including a new reasoning-focused model that it said rivalled DeepSeek's model, as it vies to stand out …

China Ai Baidu Models Including

Nvidia’s next act needs to be even bigger

3 weeks, 6 days ago

Nvidia’s next act needs to be even bigger

Nvidia has barely started shipping its latest artificial intelligence chips, and everybody is already looking to hear about the next ones. Nvidia’s annual GTC developers conference in San Jose, Calif., …

Ai Nvidia Rubin Blackwell

Mint Primer: AI power shift: Can China close the gap with the US?

4 weeks, 2 days ago

Mint Primer: AI power shift: Can China close the gap with the US?

If Chinese AI lab Deepseek shook the industry with its open-source, budget-friendly reasoning model earlier this year, another Chinese startup, Monica, is impressing many with its new general-purpose AI agentic …

India Model Chinese Ai

Training AI models might not need enormous data centres

4 weeks, 2 days ago

Training AI models might not need enormous data centres

Once, the world’s richest men competed over yachts, jets and private islands. The solution could lie with ditching the enormous bespoke computing clusters altogether and, instead, distributing the task of …

Data Models Trained Clusters

Shrinking AI: India Inc rushes to build smaller-scale AI models as cost-effective personalised tools

1 month ago

Shrinking AI: India Inc rushes to build smaller-scale AI models as cost-effective personalised tools

What do four Indian companies – two edtech platforms, a fitness tech startup and a cloud tech company – have in common? Aakash Educational Services Ltd, Physics Wallah, Healthify and …

India Tech Physics Company

OpenAI launches new developer tools as Chinese AI startups gain ground

1 month ago

OpenAI launches new developer tools as Chinese AI startups gain ground

OpenAI launched new tools for developers on Tuesday that will help them build advanced AI agents, using a few application programming interfaces, amid growing competition from Chinese AI startups. AI …

Chinese Ai Openai