DEV Community

# pytorch

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
LLM-as-judge variance broke our DPO training signal for 3 weeks

LLM-as-judge variance broke our DPO training signal for 3 weeks

Comments
4 min read
The bf16 grad accumulator that killed our SDXL LoRA training

The bf16 grad accumulator that killed our SDXL LoRA training

Comments
4 min read
I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail

I Built a Diagnostic Toolkit for PyTorch Because I Was Tired of Guessing Why Models Fail

Comments
2 min read
Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

Why Your PyTorch Training Crawls on a Beefy GPU (And How to Fix It)

Comments
5 min read
Prefix caching in vLLM under multi-tenant agent traffic

Prefix caching in vLLM under multi-tenant agent traffic

Comments 1
4 min read
Why your diffusion model is slow at batch size 1 (and what actually helps)

Why your diffusion model is slow at batch size 1 (and what actually helps)

Comments
4 min read
Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It

Your PyTorch Model File Can Execute Arbitrary Code — Here's How I Built a Scanner to Detect It

Comments
3 min read
Building Smaller Graph Neural Networks for Edge Healthcare Systems

Building Smaller Graph Neural Networks for Edge Healthcare Systems

Comments
4 min read
How to Serve a PyTorch Model with FastAPI: A Complete Guide

How to Serve a PyTorch Model with FastAPI: A Complete Guide

Comments
4 min read
CUDA Out of Memory at 60% Utilization: Tracing PyTorch GPU Memory Fragmentation

CUDA Out of Memory at 60% Utilization: Tracing PyTorch GPU Memory Fragmentation

Comments
4 min read
What Google's New Chips Mean If You Train Your Own Models

Google Cloud NEXT '26 Challenge Submission

What Google's New Chips Mean If You Train Your Own Models

2
Comments
9 min read
Quantising event-camera networks to run under 1MB on a Cortex-M7

Quantising event-camera networks to run under 1MB on a Cortex-M7

Comments
4 min read
Why your diffusion model is slow at batch size 1 (and what actually helps)

Why your diffusion model is slow at batch size 1 (and what actually helps)

Comments
4 min read
Why Your Diffusion Model Is Slow at Inference (And It's Not the UNet)

Why Your Diffusion Model Is Slow at Inference (And It's Not the UNet)

Comments
4 min read
Diffusion Model Inference in Production: What the Benchmarks Leave Out

Diffusion Model Inference in Production: What the Benchmarks Leave Out

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.