Deepseek LLM Advanced Language Model

DeepSeek’s new model sees text differently, opening new possibilities for enterprise AI

Hello and welcome to Eye on AI. In this edition: DeepSeek defies AI convention (again)…Meta’s AI layoffs…More legal trouble for OpenAI…and what AI gets wrong about the news. Hi, Beatrice Nolan here, ...

InfoWorld

How DeepSeek innovated large language models

A glimpse at how DeepSeek achieved its V3 and R1 breakthroughs, and how organizations can take advantage of model innovations when they emerge so quickly. The release of DeepSeek roiled the world of ...

Hosted on MSN

How DeepSeek’s new training method could disrupt advanced AI again

DeepSeek has already shown that it prefers clever engineering over sheer size, and the new training method extends that philosophy. With DeepSeek R. 1.0, the company leaned heavily on reinforcement ...

CoinTelegraph

China’s DeepSeek launches new open-source AI after R1 took on OpenAI

Chinese artificial intelligence development company DeepSeek has released a new open-weight large language model (LLM). DeepSeek uploaded its newest model, Prover V2, to the hosting service Hugging ...

Nature

‘Another DeepSeek moment’: Chinese AI model Kimi K2 stirs excitement

Excitement is growing among researchers about another powerful artificial intelligence (AI) model to emerge from China, after DeepSeek shocked the world with its launch of R1 in January. The ...

VentureBeat

DeepSeek-V3.1-Terminus launches with improved agentic tool use and reduced language mixing errors

DeepSeek, the Chinese AI startup spun off of Hong Kong high-frequency trading firm High Flyer Capital Management (and which uses a whale icon for its logo), is back today with a new large language ...

Digi Times

Model wars escalate: Baidu, Alibaba, DeepSeek race to dominate China's LLM frontier

In the lead-up to China's Labor Day Golden Week, the country's AI sector is experiencing a flurry of large language model (LLM) upgrades. Baidu and Alibaba have rolled out new flagship models, while ...

Seeking Alpha

DeepSeek shows off new agentic AI model that surpasses R1 in key areas

On Thursday, Chinese AI startup DeepSeek (DEEPSEEK) officially launched its updated DeepSeek-V3.1 AI model, which surpasses its R1 model on key benchmarks. The company unveiled V3.1 earlier this week.

CNBC

How DeepSeek used distillation to train its artificial intelligence model, and what it means for companies such as OpenAI

Chinese artificial intelligence lab DeepSeek roiled markets in January, setting off a massive tech and semiconductor selloff after unveiling AI models that it said were cheaper and more efficient than ...

VentureBeat

Nvidia's new Llama-3.1 Nemotron Ultra outperforms DeepSeek R1 at half the size

Even as Meta fends off questions and criticisms of its new Llama 4 model family, graphics processing unit (GPU) master Nvidia has released a new, fully open source large language model (LLM) based on ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results