DEV Community

# observability

Gaining deep insights into system behavior through metrics, logs, and traces.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Remetric: find waste in self-hosted Prometheus, Grafana, and Loki

Remetric: find waste in self-hosted Prometheus, Grafana, and Loki

Comments
6 min read
AI Observability: Stop Flying Blind in Production

AI Observability: Stop Flying Blind in Production

Comments
4 min read
Chronos vs Toto: Zero-Shot Forecasting Benchmark Results

Chronos vs Toto: Zero-Shot Forecasting Benchmark Results

1
Comments
14 min read
AI SRE and AI DevOps: different problems, one reliability stack

AI SRE and AI DevOps: different problems, one reliability stack

Comments
6 min read
Deploying Prometheus Metrics Collection Server on Ubuntu 24.04

Deploying Prometheus Metrics Collection Server on Ubuntu 24.04

5
Comments
2 min read
Deploying Grafana Metrics Visualization Platform on Ubuntu 24.04

Deploying Grafana Metrics Visualization Platform on Ubuntu 24.04

5
Comments
2 min read
5000 events, one worker, one bug: trace-filter for agent JSONL traces

Hermes Agent Challenge Submission: Write About Hermes Agent

5000 events, one worker, one bug: trace-filter for agent JSONL traces

Comments
4 min read
I logged 300 Hermes runs to one file. trace-session-split cut it into 300.

Hermes Agent Challenge Submission: Build With Hermes Agent

I logged 300 Hermes runs to one file. trace-session-split cut it into 300.

Comments
2 min read
Translating LLM Telemetry Between OpenInference and OTel GenAI with Rust

Translating LLM Telemetry Between OpenInference and OTel GenAI with Rust

Comments
5 min read
Bronto for Fastly: Real-Time CDN Logging That Actually Scales

Bronto for Fastly: Real-Time CDN Logging That Actually Scales

2
Comments
5 min read
What GitHub Uses eBPF For (and the Layer They Have Not Ported Yet)

What GitHub Uses eBPF For (and the Layer They Have Not Ported Yet)

Comments
5 min read
Distributed tracing across FastAPI and Celery with OpenTelemetry the part nobody shows you

Distributed tracing across FastAPI and Celery with OpenTelemetry the part nobody shows you

Comments
2 min read
Per-Customer LLM Cost Reports (Without Rearchitecting Your Billing Pipeline)

Per-Customer LLM Cost Reports (Without Rearchitecting Your Billing Pipeline)

Comments
8 min read
Hallucination Detection at the Trace Layer: 4 Detectors You Can Ship Today

Hallucination Detection at the Trace Layer: 4 Detectors You Can Ship Today

Comments
10 min read
Eval Set Drift: How to Know When Your Golden Set Went Stale

Eval Set Drift: How to Know When Your Golden Set Went Stale

Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.