NVIDIA LLM Models - Search News

Nvidia says it can shrink LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory ...

TMCnet

Fractal Introduces LLM Studio to Bring Enterprise-Grade GenAI Customization with NVIDIA NeMo and NVIDIA NIM Microservices

New enterprise workbench helps organizations design, build, evaluate, and operate domain-specific language models using ...

Tom's Hardware on MSN

Google's TurboQuant reduces AI LLM cache memory capacity requirements by at least six times

The algorithm achieves up to an eight-times performance boost over unquantized keys on Nvidia H100 GPUs.

NVIDIA Nemotron Powers Ceramic Supervised Generation to Ground AI Responses at Enterprise Scale

Ceramic's Supervised Generation augments LLM outputs with search grounding, citations and confidence signals -- bringing verifiable, trustworthy AI to enterprise applications. -- NVIDIA Nemotron 3 ...

TMCnet

Upwind Unveils Novel Approach to High-Speed, Precise Detection of Malicious AI Prompts with Nvidia AI

Upwind, the runtime-first cloud security platform leader today unveiled the results of research from RSAC Conference demonstrating that malicious Large Language Model (LLM) prompts can be detected ...

Qubrid AI Accelerates Open-Source Model Inferencing with NVIDIA AI Infrastructure and One Single API for Enterprise Agents

Driving shift to open-source based Agents with an Open, Inference-First full-Stack AI Platform SAN JOSE, Calif., March 16, 2026 /PRNewswire/ -- Qubrid AI, a leading Open, Inference-First Full-Stack AI ...

SDxCentral

Nvidia goes all-in on agents at GTC with toolkits, OpenClaw & models

Alongside new hardware developments, the 2026 edition of Nvidia’s GTC conference was all about agentic AI, as the company ...

Tom's Hardware on MSN

Nvidia's Nemotron coalition brings eight AI labs together to build open frontier models

Nemotron Coalition's first project is a base model co-developed with Mistral AI and open sourced on release.

SDxCentral

Nvidia, hyperscaler-backed open standard for AI inference torch passed to Linux Foundation

An open standard for AI inference backed by Google Cloud, IBM, Red Hat, Nvidia and more was given to the Linux Foundation for ...

Fractal launches LLM Studio powered by NVIDIA AI infrastructure

Fractal Analytics announced the launch of LLM Studio, an enterprise platform that helps organizations build and run language models tailored to their business. It is designed for teams that want more ...

Nvidia's NemoClaw brings privacy and security controls to autonomous OpenClaw agents

Nvidia's NemoClaw installs Nemotron models and the OpenShell runtime onto the OpenClaw agent platform in a single command, ...

Agentic AI Reshapes Nvidia Strategy Beyond GPUs At GTC

Nvidia innovation does not stop with GPUs, and will incorporate whatever technology CEO Jensen Huang needs to stay at the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results