Name: Datadog LLM Observability
Availability: InStock
Author: Datadog

Learning Objectives

Understand what LLM observability is and why it matters for production AI applications
Identify Datadog's key LLM monitoring features including tracing, cost tracking, and agentic AI monitoring
Compare Datadog LLM Observability to purpose-built alternatives like LangSmith and Helicone

What Is LLM Observability?

When you deploy an AI application in production, you need to know: Is it working? How much is it costing? Are responses accurate? How fast is it? LLM Observability answers these questions by monitoring every interaction between your application and AI models.

Datadog LLM Observability extends Datadog's industry-leading monitoring platform to cover AI workloads. It automatically traces every LLM call — capturing latency, token usage, estimated cost, error rates, and response quality — and correlates this data with your existing infrastructure metrics, application traces, and logs.

💡Key Concept

Observability vs. Monitoring: Monitoring tells you when something is wrong (an alert fires). Observability tells you why — by providing the detailed traces, metrics, and logs needed to diagnose problems. For AI applications, observability means seeing exactly which LLM call in a multi-step agent workflow caused a failure, how much each call cost, and how the AI's behavior changed after a prompt update.

Core Features

LLM Call Tracing

Automatic tracing and annotation of every LLM call — no code changes required. Each trace captures:

Latency — how long the model took to respond
Token usage — input and output tokens consumed
Estimated cost — calculated from provider pricing and token counts
Error rates — failed calls, timeouts, rate limits
Full request/response content — for debugging and evaluation

Execution Flow Charts

Visual diagrams showing agent decision paths, tool usage, and retrieval steps. See exactly how a multi-step AI agent navigated a complex task — which tools it called, what data it retrieved, and where it decided to branch.

AI Agents Console (June 2025)

A dedicated dashboard for monitoring AI agents in production:

Track actions, security posture, and performance of any AI agent
Monitor user engagement and business value metrics
Works with both custom-built and third-party agents
Visibility into agentic workflows spanning multiple models and tools

LLM Experiments (June 2025)

A structured experimentation framework for testing changes before shipping to production:

Compare prompt changes, model swaps, and configuration updates
Measure impact on quality, latency, and cost
Prove results before rolling out to users

Bits AI Copilot

Datadog's built-in AI assistant that queries across all your observability data using natural language:

Identifies root causes "90% faster" than manual investigation
Integrates into Slack incident response channels with automatic summaries
Can automate alert investigations, code fixes, and security triage

Supported LLM Providers

Language	Supported Providers
Python SDK	OpenAI; Anthropic; AWS Bedrock; LangChain; Google Vertex AI
Node.js SDK	OpenAI; Anthropic; Azure OpenAI; AWS Bedrock; Google Vertex AI; LangChain; Vercel AI SDK
OpenTelemetry	Vendor-neutral via GenAI Semantic Conventions (any provider)

Additional integrations include GitHub Copilot usage tracking, Microsoft Copilot monitoring, LiteLLM gateway tracing, and cloud cost management for Anthropic and GitHub spend.

Pricing

Datadog LLM Observability is billed per LLM span (each call to an LLM provider counts as one span; a single user request may generate multiple spans).

⚠️Warning

LLM Observability is an add-on to Datadog's platform — there is no standalone free tier. Pricing is not fully transparent on the public pricing page; enterprise customers typically negotiate custom rates. Third-party estimates suggest approximately $8 per 10,000 requests, but verify current rates at datadoghq.com/pricing.

For teams that only need LLM monitoring without full-stack observability, purpose-built tools like Helicone (open-source, generous free tier) or LangSmith ($39 per user per month) offer much lower entry points.