Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
gpu
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
FlashAttention CUDA Kernel, Strix Halo MOE Boost, & NVIDIA DLSS 4.5 Driver Update
soy
soy
soy
Follow
May 26
FlashAttention CUDA Kernel, Strix Halo MOE Boost, & NVIDIA DLSS 4.5 Driver Update
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
PatentLLM: CUDA TileLang/Triton B200 5x Speedup, RTX 5090 Power, PTX Grammar
soy
soy
soy
Follow
May 25
PatentLLM: CUDA TileLang/Triton B200 5x Speedup, RTX 5090 Power, PTX Grammar
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
How to Detect GPU Waste in a Kubernetes Cluster
Sam Hosseini
Sam Hosseini
Sam Hosseini
Follow
May 25
How to Detect GPU Waste in a Kubernetes Cluster
#
kubernetes
#
gpu
#
mlops
#
devops
Comments
Add Comment
5 min read
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)
Alan West
Alan West
Alan West
Follow
May 24
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)
#
pytorch
#
performance
#
machinelearning
#
gpu
Comments
Add Comment
5 min read
RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix
soy
soy
soy
Follow
May 24
RTX 5080 Undervolt Benchmarks, CGO-Free CUDA API Binding, & AMD GPU Compatibility Fix
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
AMD GPU/AI Launches, Legacy Driver Update & CUDA Optimization Platform
soy
soy
soy
Follow
May 23
AMD GPU/AI Launches, Legacy Driver Update & CUDA Optimization Platform
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Running LTX-2.3 Alongside TTS on a Single 96GB GPU with a Cold-Start Architecture
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
Running LTX-2.3 Alongside TTS on a Single 96GB GPU with a Cold-Start Architecture
#
gpu
#
python
#
machinelearning
#
ai
Comments
Add Comment
5 min read
RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains
soy
soy
soy
Follow
May 22
RTX 5090 Cooling, BeeLlama VRAM Opts, Resizable BAR Performance Gains
#
gpu
#
nvidia
#
hardware
1
 reaction
Comments
Add Comment
4 min read
Five Years Later, I Finally Have 96GB VRAM — What It Actually Unlocks for Agent Loops
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
Five Years Later, I Finally Have 96GB VRAM — What It Actually Unlocks for Agent Loops
#
gpu
#
ai
#
machinelearning
#
python
Comments
Add Comment
8 min read
Turning a 1-Line Idea Into a 40-Second Short with a 10-Beat Local Video Pipeline
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
Turning a 1-Line Idea Into a 40-Second Short with a 10-Beat Local Video Pipeline
#
python
#
ai
#
machinelearning
#
gpu
Comments
Add Comment
7 min read
HiDream-O1-Image 3–8x Faster: Benchmarking Steps, CFG, and Resolution
shinji shimizu
shinji shimizu
shinji shimizu
Follow
May 22
HiDream-O1-Image 3–8x Faster: Benchmarking Steps, CFG, and Resolution
#
ai
#
machinelearning
#
gpu
#
python
Comments
Add Comment
5 min read
Profiling a CUDA Python Program with GPUFlight
Myoungho Shin
Myoungho Shin
Myoungho Shin
Follow
May 22
Profiling a CUDA Python Program with GPUFlight
#
performance
#
python
#
cuda
#
gpu
Comments
Add Comment
10 min read
LLM Compilers, GGUF Quantization, & Radeon RX 9060 Benchmarks
soy
soy
soy
Follow
May 20
LLM Compilers, GGUF Quantization, & Radeon RX 9060 Benchmarks
#
gpu
#
nvidia
#
hardware
Comments
Add Comment
3 min read
Go+CUDA Optimization, LLM VRAM Benchmarks & NVIDIA G-SYNC Firmware 1.1.6
soy
soy
soy
Follow
May 21
Go+CUDA Optimization, LLM VRAM Benchmarks & NVIDIA G-SYNC Firmware 1.1.6
#
gpu
#
nvidia
#
hardware
2
 reactions
Comments
Add Comment
3 min read
Construyendo la PC de Escritorio de tus Sueños
Cesil-Codex
Cesil-Codex
Cesil-Codex
Follow
May 21
Construyendo la PC de Escritorio de tus Sueños
#
pc
#
gpu
#
ram
#
es
Comments
Add Comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account