Distributed Tracing and Telemetry in Scala

April 1, 2026

Propagate trace context, correlate spans with logs and metrics, and keep async boundaries visible.

16.2 Distributed Tracing and Telemetry

In today’s complex distributed systems, understanding the flow of requests through various services is crucial for maintaining performance and reliability. Distributed tracing and telemetry provide the necessary insights to achieve this understanding. In this section, we will delve into the concepts of distributed tracing and telemetry, explore how to implement them using tools like OpenTelemetry, and discuss best practices for correlating logs and traces across services in Scala applications.

Introduction to Distributed Tracing

Distributed tracing is a method used to track requests as they traverse through a distributed system. It provides a way to visualize the entire journey of a request, from the initial entry point to the final response, across multiple services and components. This is particularly important in microservices architectures, where a single request may interact with numerous services.

Key Concepts

Trace: A trace represents the entire journey of a request through a system. It consists of multiple spans.
Span: A span is a single operation within a trace. It includes metadata such as start time, end time, and operation name.
Context Propagation: The mechanism by which trace context is passed along with requests to ensure continuity across services.

Implementing Distributed Tracing with OpenTelemetry

OpenTelemetry is an open-source observability framework that provides APIs, libraries, agents, and instrumentation to enable distributed tracing and metrics collection. It supports multiple languages, including Scala, and can be integrated with various backends like Jaeger, Zipkin, and Prometheus.

Setting Up OpenTelemetry in Scala

To implement distributed tracing in a Scala application using OpenTelemetry, follow these steps:

Add Dependencies: Include the necessary OpenTelemetry libraries in your Scala project. For example, using sbt:

1libraryDependencies ++= Seq(
2  "io.opentelemetry" % "opentelemetry-api" % "1.9.0",
3  "io.opentelemetry" % "opentelemetry-sdk" % "1.9.0",
4  "io.opentelemetry" % "opentelemetry-exporter-jaeger" % "1.9.0"
5)

Initialize OpenTelemetry: Set up the OpenTelemetry SDK and configure the exporter.

 1import io.opentelemetry.api.GlobalOpenTelemetry
 2import io.opentelemetry.sdk.OpenTelemetrySdk
 3import io.opentelemetry.sdk.trace.SdkTracerProvider
 4import io.opentelemetry.sdk.trace.export.BatchSpanProcessor
 5import io.opentelemetry.exporter.jaeger.JaegerGrpcSpanExporter
 6
 7val jaegerExporter = JaegerGrpcSpanExporter.builder()
 8  .setEndpoint("http://localhost:14250")
 9  .build()
10
11val tracerProvider = SdkTracerProvider.builder()
12  .addSpanProcessor(BatchSpanProcessor.builder(jaegerExporter).build())
13  .build()
14
15val openTelemetry = OpenTelemetrySdk.builder()
16  .setTracerProvider(tracerProvider)
17  .build()
18
19GlobalOpenTelemetry.set(openTelemetry)

Create Spans: Use the OpenTelemetry API to create spans around operations you want to trace.

 1import io.opentelemetry.api.trace.Span
 2import io.opentelemetry.api.trace.Tracer
 3
 4val tracer: Tracer = GlobalOpenTelemetry.getTracer("example-tracer")
 5
 6def tracedOperation(): Unit = {
 7  val span = tracer.spanBuilder("operation-name").startSpan()
 8  try {
 9    // Your operation logic here
10  } finally {
11    span.end()
12  }
13}

Propagate Context: Ensure that the trace context is propagated across service boundaries. This often involves integrating with HTTP clients and servers to automatically inject and extract context headers.

Correlating Logs and Traces

Correlating logs with traces is essential for gaining a comprehensive view of system behavior. By linking logs to specific traces, you can quickly identify the root cause of issues and understand the context of log entries.

Techniques for Correlation

Trace ID Injection: Include the trace ID in log entries to associate logs with specific traces.
Structured Logging: Use structured logging to add trace and span IDs as fields in log messages.
Log Aggregation: Use log aggregation tools like Elasticsearch and Kibana to search and visualize logs based on trace IDs.

Visualizing and Analyzing Trace Data

Once traces are collected, they can be visualized and analyzed to identify performance bottlenecks, understand request flows, and detect anomalies.

Tools for Visualization

Jaeger: An open-source, end-to-end distributed tracing system that provides a UI for visualizing traces.
Zipkin: A distributed tracing system that helps gather timing data needed to troubleshoot latency problems in microservice architectures.
Grafana: A multi-platform open-source analytics and interactive visualization web application that supports OpenTelemetry data sources.

Best Practices for Distributed Tracing

Instrument Key Operations: Focus on instrumenting critical paths and operations that impact performance and reliability.
Automate Context Propagation: Use libraries and frameworks that support automatic context propagation to reduce manual effort.
Monitor Overhead: Keep an eye on the performance overhead introduced by tracing and adjust sampling rates as necessary.
Integrate with CI/CD: Include tracing in your CI/CD pipeline to ensure that new code is properly instrumented before deployment.

Try It Yourself

To deepen your understanding of distributed tracing, try modifying the provided code examples to:

Add Custom Attributes: Enhance spans with custom attributes to capture additional context.
Experiment with Exporters: Switch between different exporters (e.g., Zipkin, Prometheus) to see how they impact trace visualization.
Simulate Errors: Introduce errors in your traced operations and observe how they appear in the trace data.

Visualizing Distributed Tracing

To better understand how distributed tracing works, let’s visualize the flow of a request through a system using a sequence diagram.

    sequenceDiagram
	    participant Client
	    participant ServiceA
	    participant ServiceB
	    participant Database
	
	    Client->>ServiceA: Send Request
	    ServiceA->>ServiceB: Forward Request
	    ServiceB->>Database: Query Data
	    Database-->>ServiceB: Return Data
	    ServiceB-->>ServiceA: Return Response
	    ServiceA-->>Client: Send Response

This diagram illustrates how a request flows through multiple services, with each interaction represented as a span in the trace.

Knowledge Check

What is a trace, and how does it differ from a span?
Why is context propagation important in distributed tracing?
How can structured logging enhance trace correlation?

Conclusion

Distributed tracing and telemetry are powerful tools for understanding and optimizing distributed systems. By implementing these practices in your Scala applications, you can gain valuable insights into request flows, identify performance bottlenecks, and improve overall system reliability. Remember, this is just the beginning. As you continue to explore and experiment with distributed tracing, you’ll uncover new ways to enhance observability and drive better outcomes for your applications.

Quiz Time!

Loading quiz…

Revised on Wednesday, June 3, 2026

16.1 Logging and Monitoring in Functional Applications

16.3 Continuous Observability