DEV Community

# mlops

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Winograd convolutions cost us 2 mAP and we didn't notice for a month

Winograd convolutions cost us 2 mAP and we didn't notice for a month

Comments
4 min read
What DevOps Taught Me About AI Governance

What DevOps Taught Me About AI Governance

Comments
4 min read
From ML Tooling to Analytical Governance: Recent Updates to KMDS

From ML Tooling to Analytical Governance: Recent Updates to KMDS

1
Comments
3 min read
Why multi-agent orchestration is harder than it looks

Why multi-agent orchestration is harder than it looks

Comments
7 min read
RLAIF Is Eating RLHF — Here Are the Four Places Human Feedback Still Wins

RLAIF Is Eating RLHF — Here Are the Four Places Human Feedback Still Wins

Comments
6 min read
A 9-point eval gain vanished when we deduped train against test

A 9-point eval gain vanished when we deduped train against test

Comments
4 min read
nvidia-smi Reports 97% Utilization While the GPU Sits Idle

nvidia-smi Reports 97% Utilization While the GPU Sits Idle

Comments
6 min read
OpenAI Already Told Us the Kubernetes Scaling Story, Most People Just Did Not Read It Closely

OpenAI Already Told Us the Kubernetes Scaling Story, Most People Just Did Not Read It Closely

Comments
10 min read
I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

I Processed 2.4 Billion Tokens Across 52 AI Models for $0.52. Here's the Full Breakdown.

Comments
3 min read
Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Comments
7 min read
I Built a Production RAG System on My M1 Mac for $0

I Built a Production RAG System on My M1 Mac for $0

Comments
3 min read
I built a feature store in pure Python to finally understand the point-in-time join

I built a feature store in pure Python to finally understand the point-in-time join

Comments
6 min read
Four Models in One Training Loop: Architecting SDAR on AWS (Before Renting a Single GPU)

Four Models in One Training Loop: Architecting SDAR on AWS (Before Renting a Single GPU)

Comments
5 min read
Gemini Model Management: Ending Inefficiency! The Secret to 3x Faster Cost Tracking with Model Registry

Gemini Model Management: Ending Inefficiency! The Secret to 3x Faster Cost Tracking with Model Registry

Comments
3 min read
Per-project LLM cost attribution with OTel spans: the wiring

Per-project LLM cost attribution with OTel spans: the wiring

Comments
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.