Hello and welcome to Eye on AI. In this edition: DeepSeek defies AI convention (again)…Meta’s AI layoffs…More legal trouble for OpenAI…and what AI gets wrong about the news. Hi, Beatrice Nolan here, ...
A glimpse at how DeepSeek achieved its V3 and R1 breakthroughs, and how organizations can take advantage of model innovations when they emerge so quickly. The release of DeepSeek roiled the world of ...
DeepSeek has already shown that it prefers clever engineering over sheer size, and the new training method extends that philosophy. With DeepSeek R. 1.0, the company leaned heavily on reinforcement ...
Chinese artificial intelligence development company DeepSeek has released a new open-weight large language model (LLM). DeepSeek uploaded its newest model, Prover V2, to the hosting service Hugging ...
Excitement is growing among researchers about another powerful artificial intelligence (AI) model to emerge from China, after DeepSeek shocked the world with its launch of R1 in January. The ...
DeepSeek, the Chinese AI startup spun off of Hong Kong high-frequency trading firm High Flyer Capital Management (and which uses a whale icon for its logo), is back today with a new large language ...
In the lead-up to China's Labor Day Golden Week, the country's AI sector is experiencing a flurry of large language model (LLM) upgrades. Baidu and Alibaba have rolled out new flagship models, while ...
On Thursday, Chinese AI startup DeepSeek (DEEPSEEK) officially launched its updated DeepSeek-V3.1 AI model, which surpasses its R1 model on key benchmarks. The company unveiled V3.1 earlier this week.
Chinese artificial intelligence lab DeepSeek roiled markets in January, setting off a massive tech and semiconductor selloff after unveiling AI models that it said were cheaper and more efficient than ...
Even as Meta fends off questions and criticisms of its new Llama 4 model family, graphics processing unit (GPU) master Nvidia has released a new, fully open source large language model (LLM) based on ...