3 months, 2 weeks ago

Chinese AI firm releases DeepSeek V3, a new leader in open-source AI models

Chinese firm DeepSeek has released a new open-source model, DeepSeek V3, which outperforms existing leading open-source models and closed models like OpenAI’s GPT-4o on several benchmarks. This reduces hardware costs since every time a prompt is entered, it activates just the related neural network and not the entire large language model. Notably, DeepSeek has said that the training of the AI model was done in about 2788K H800 GPU hours or an estimated $5.57 million price tag, if the rental price is $2 per GPU hour. According to a technical paper released along with the news, the company said that the model surpassed open-source models including the Llama-3.1-405B and Qwen 2.5-72B on most benchmarks.

The Hindu

Discover Related