DEV Community

# mlops

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Virtual keys per tenant: ditching our custom LLM billing layer

Virtual keys per tenant: ditching our custom LLM billing layer

Comments
4 min read
Semantic caching the VLM step in our product-photo pipeline

Semantic caching the VLM step in our product-photo pipeline

Comments
4 min read
AI Observability: Stop Flying Blind in Production

AI Observability: Stop Flying Blind in Production

Comments
4 min read
LLM-as-judge variance broke our DPO training signal for 3 weeks

LLM-as-judge variance broke our DPO training signal for 3 weeks

Comments
4 min read
The bf16 grad accumulator that killed our SDXL LoRA training

The bf16 grad accumulator that killed our SDXL LoRA training

Comments
4 min read
Token-level eval harness for tool-calling agents: what we wired up

Token-level eval harness for tool-calling agents: what we wired up

Comments
4 min read
Capping VLM spend per CV researcher: hierarchical budgets in practice

Capping VLM spend per CV researcher: hierarchical budgets in practice

1
Comments 2
4 min read
Part 2: Enterprise Decision Intelligence Architecture: AI Governance, Threshold Policy Engines, and Operational AI Systems

Part 2: Enterprise Decision Intelligence Architecture: AI Governance, Threshold Policy Engines, and Operational AI Systems

Comments
11 min read
Auto-labelling 1.2M robotics frames with VLMs: a failover story

Auto-labelling 1.2M robotics frames with VLMs: a failover story

Comments
4 min read
We Audited Our Agent Tool-Call Traces. Half Our Eval Data Was Garbage.

We Audited Our Agent Tool-Call Traces. Half Our Eval Data Was Garbage.

Comments
4 min read
How to Detect GPU Waste in a Kubernetes Cluster

How to Detect GPU Waste in a Kubernetes Cluster

Comments
5 min read
Cost accounting for diffusion image generation at $0.0008 per render

Cost accounting for diffusion image generation at $0.0008 per render

Comments
4 min read
I built a token-level debugger for comparing two LLMs

I built a token-level debugger for comparing two LLMs

Comments
1 min read
Quantising event-camera networks to run under 1MB on a Cortex-M7

Quantising event-camera networks to run under 1MB on a Cortex-M7

Comments
4 min read
Building a Production-Grade MLOps Home Lab on Windows — K8s, LLM, RAG & GitLab CI

Building a Production-Grade MLOps Home Lab on Windows — K8s, LLM, RAG & GitLab CI

Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.