Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
nvidia
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
CUDA 13.3 Lands, AI Writes Blackwell Kernels, & FP4 VRAM Optimization for LLMs
soy
soy
soy
Follow
May 27
CUDA 13.3 Lands, AI Writes Blackwell Kernels, & FP4 VRAM Optimization for LLMs
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
FlashAttention CUDA Kernel, Strix Halo MOE Boost, & NVIDIA DLSS 4.5 Driver Update
soy
soy
soy
Follow
May 26
FlashAttention CUDA Kernel, Strix Halo MOE Boost, & NVIDIA DLSS 4.5 Driver Update
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
From Chatbot to Agent — Tool Calling with NVIDIA NIM
Torkian
Torkian
Torkian
Follow
May 26
From Chatbot to Agent — Tool Calling with NVIDIA NIM
#
nvidia
#
ai
#
python
#
tutorial
1
 reaction
Comments
Add Comment
7 min read
PatentLLM: CUDA TileLang/Triton B200 5x Speedup, RTX 5090 Power, PTX Grammar
soy
soy
soy
Follow
May 25
PatentLLM: CUDA TileLang/Triton B200 5x Speedup, RTX 5090 Power, PTX Grammar
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Tesla P40 in a Homelab: 24GB of Inference on a Budget
Guatu
Guatu
Guatu
Follow
May 25
Tesla P40 in a Homelab: 24GB of Inference on a Budget
#
teslap40
#
nvidia
#
proxmox
#
ollama
Comments
Add Comment
6 min read
RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix
soy
soy
soy
Follow
May 24
RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Diffusion Language Models Are Here: Deep Dive into NVIDIA's Nemotron-Labs DLM Architecture
Manoranjan Rajguru
Manoranjan Rajguru
Manoranjan Rajguru
Follow
May 24
Diffusion Language Models Are Here: Deep Dive into NVIDIA's Nemotron-Labs DLM Architecture
#
ai
#
machinelearning
#
llm
#
nvidia
Comments
Add Comment
15 min read
NVIDIA's Nemotron Diffusion: One Model, Three Generation Modes, 6 Faster
Andrew Kew
Andrew Kew
Andrew Kew
Follow
May 23
NVIDIA's Nemotron Diffusion: One Model, Three Generation Modes, 6 Faster
#
ai
#
machinelearning
#
llm
#
nvidia
Comments
Add Comment
3 min read
AMD GPU/AI Launches, Legacy Driver Update & CUDA Optimization Platform
soy
soy
soy
Follow
May 23
AMD GPU/AI Launches, Legacy Driver Update & CUDA Optimization Platform
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains
soy
soy
soy
Follow
May 22
RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains
#
gpu
#
nvidia
#
hardware
1
 reaction
Comments
Add Comment
4 min read
LLM Compilers, GGUF Quantization, & Radeon RX 9060 Benchmarks
soy
soy
soy
Follow
May 20
LLM Compilers, GGUF Quantization, & Radeon RX 9060 Benchmarks
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Go+CUDA Optimization, LLM VRAM Benchmarks & NVIDIA G-SYNC Firmware 1.1.6
soy
soy
soy
Follow
May 21
Go+CUDA Optimization, LLM VRAM Benchmarks & NVIDIA G-SYNC Firmware 1.1.6
#
gpu
#
nvidia
#
hardware
2
 reactions
Comments
Add Comment
3 min read
Who Wins the Future: Chips vs Frontier LLMs (2026)
Vektor Memory
Vektor Memory
Vektor Memory
Follow
May 20
Who Wins the Future: Chips vs Frontier LLMs (2026)
#
ai
#
cerebras
#
nvidia
#
llm
1
 reaction
Comments
Add Comment
17 min read
Intel Xe3P Leaks 160GB LPDDR5X; FlashAttention-2 in CuTe & Custom CUDA GPT-2 Engine
soy
soy
soy
Follow
May 19
Intel Xe3P Leaks 160GB LPDDR5X; FlashAttention-2 in CuTe & Custom CUDA GPT-2 Engine
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
GPU Bottleneck Analyzer, NVIDIA Rubin VRAM Demands, and Qwen VRAM Optimization
soy
soy
soy
Follow
May 18
GPU Bottleneck Analyzer, NVIDIA Rubin VRAM Demands, and Qwen VRAM Optimization
#
gpu
#
nvidia
#
hardware
1
 reaction
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account