Large Language Models Deepseek

1don MSN

New AI method lets models think harder while avoiding costly bandwidth

New AI memory method lets models think harder while avoiding costly high-bandwidth memory, which is the major driver for DRAM ...

Cryptopolitan on MSN

DeepSeek V4 rumored to outperform ChatGPT and Claude in long-context coding

February, is rumored to outperform ChatGPT and Claude in long-context coding, targeting elite-level coding tasks.

13don MSN

Another Chinese quant fund joins DeepSeek in AI race with model rivalling GPT-5.1, Claude

Another Chinese quantitative trading firm has entered the race to develop large language models (LLMs), unveiling systems it ...

16d

DeepSeek proposes shift in AI model development with 'mHC' architecture to upgrade ResNet

DeepSeek's latest technical paper, co-authored by the firm's founder and CEO Liang Wenfeng, has been cited as a potential game changer in developing artificial intelligence models, as it could ...

Analytics India Magazine

Decoding DeepSeek’s Solution to China’s Compute Shortage

DeepSeek's new research enables retrieval using computational memory, not neural computation, freeing up GPUs.

Analytics India Magazine

New DeepSeek Research Shows Architectural Fix Can Boost Reasoning at Scale

DeepSeek has released new research showing that a promising but fragile neural network design can be stabilised at scale, delivering measurable performance gains in large language models without ...

17don MSN

DeepSeek kicks off 2026 with paper signalling push to train bigger models for less

Chinese artificial intelligence start-up DeepSeek has ushered in 2026 with a new technical paper, co-authored by founder ...

Decrypt

Insiders Say DeepSeek V4 Will Beat Claude and ChatGPT at Coding, Launch Within Weeks

DeepSeek's upcoming V4 model could outperform Claude and ChatGPT in coding tasks, according to insiders—with its purported ...

DIGITIMES

DeepSeek V4 update: Conditional memory reshapes large-model efficiency

DeepSeek founder Liang Wenfeng has published a new paper with a research team from Peking University, outlining key technical ...

The Information

DeepSeek To Release Next Flagship AI Model With Strong Coding Ability

Chinese AI startup DeepSeek is expected to launch its next-generation AI model that features strong coding capabilities in ...

VentureBeat

DeepSeek's success shows why motivation is key to AI innovation

January 2025 shook the AI landscape. The seemingly unstoppable OpenAI and the powerful American tech giants were shocked by what we can certainly call an underdog in the area of large language models ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results