캐시를 직접 가져오세요

New Relic's Infinite Tracing Processor is an implementation of the OpenTelemetry Collector tailsamplingprocessor. In addition to upstream features, it supports scalable and durabl distributed processing by using a distributed cache for shared state storage. This documentation how to configure it

지원되는 캐시

The processor supports any Redis-compatible cache implementation. It has been tested and validated with Redis and Valkey in both single-instance and cluster configurations. For production deployments, we recommend using cluster mode (sharded) to ensure high availability and scalability. To enable distributed caching, add the distributed_cache configuration to your tail_sampling processor section:

tail_sampling:
  distributed_cache:
    connection:
      address: redis://localhost:6379/0
      password: 'local'
    trace_window_expiration: 30s      # Default: how long to wait after last span before evaluating
    processor_name: "itc"             # Nane of the processor
    data_compression:
      format: lz4                     # Optional: compression format (none, snappy, zstd, lz4); lz4 recommended

중요

Configuration behavior: When distributed_cache is configured, the processor automatically uses the distributed cache for state management. If distributed_cache is omitted entirely, the collector will use in-memory processing instead.

address 코드는 표준 형식을 사용하여 유효한 Redis 호환 서버 주소를 지정해야 합니다.

bash

redis[s]://[[username][:password]@][host][:port][/db-number]

또는 address 포럼에 직접 자격 증명을 포함할 수 있습니다.

tail_sampling:
  distributed_cache:
    connection:
      address: redis://:yourpassword@localhost:6379/0

프로세서는 Go로 구현되었으며 go-redis 클라이언트 라이브러리를 사용합니다.

구성 매개변수

distributed_cache 섹션에서는 다음과 같은 보고서를 지원합니다.

Connection settings

매개변수	유형	기본값	설명
`connection.address`	끈	필수의	Redis connection string (format: `redis://host:port/db`). For cluster mode, use comma-separated addresses (e.g., `redis://node1:6379,redis://node2:6379`)
`connection.password`	끈	`""`	Redis password for authentication

Redis client timeouts and connection pool

All settings are optional and have defaults aligned with the 10s ingestion_response_timeout.

매개변수	유형	기본값	설명
`connection.dial_timeout`	duration	`2s`	Timeout for establishing new connections to Redis
`connection.read_timeout`	duration	`500ms`	Timeout for socket reads. Commands fail with timeout error if exceeded
`connection.write_timeout`	duration	`500ms`	Timeout for socket writes. Commands fail with timeout error if exceeded
`connection.pool_timeout`	duration	`3s`	Time to wait for connection from pool if all connections are busy
`connection.pool_size`	정수	`20`	Maximum number of socket connections per Redis node
`connection.min_idle_conns`	정수	`5`	Minimum number of idle connections to maintain for quick reuse
`connection.max_idle_conns`	정수	`10`	Maximum number of idle connections to keep open
`connection.conn_max_idle_time`	duration	`1m`	Maximum time a connection may be idle before being closed
`connection.conn_max_lifetime`	duration	`30m`	Maximum time a connection may be reused before being closed
`connection.max_retries`	정수	`3`	Maximum number of command retries before giving up
`connection.min_retry_backoff`	duration	`100ms`	Minimum backoff between retries
`connection.max_retry_backoff`	duration	`1.5s`	Maximum backoff between retries (exponential backoff capped at this value)
`connection.max_redirects`	정수	`5`	Maximum number of redirects to follow in cluster mode

Timeout alignment:

The default Redis client timeouts are aligned with the ingestion_response_timeout (default: 10s) to ensure Redis operations complete before workers timeout:

Worst case calculation: PoolTimeout(3s) + Operation(0.5s) + 3 retries × (0.5s + backoff) ≈ 7s less than 10s ✅

Tuning guidelines:

High-latency Redis (cross-region, VPN): Increase timeouts to 2-3x defaults (e.g., 1-1.5s read/write) and reduce max_retries to 2
Very fast Redis (same host/rack): Can reduce timeouts further (e.g., 250ms) for faster failure detection
High throughput: Increase pool_size to 30-50 to avoid connection pool exhaustion
Unreliable network: Increase max_retries to 5-7 and adjust backoff settings

Cluster replica options

The connection.replica section controls cluster replica routing (cluster mode only).

매개변수	유형	기본값	설명
`connection.replica.read_only_replicas`	부울	`true`	Enable routing read commands to replica nodes. Default is `true` for improved scalability. Set to `false` if strong read consistency is required.
`connection.replica.route_by_latency`	부울	`false`	Route commands to the closest node based on latency (automatically enables read_only_replicas)
`connection.replica.route_randomly`	부울	`false`	Route commands to a random node (automatically enables read_only_replicas)

팁

Replica read benefits: When running with a Redis cluster that has replica nodes, enabling replica reads distributes read load across both primary and replica nodes, significantly improving read throughput (2-3x) and reducing load on primary nodes.

Important considerations:

Replication lag: Replicas may lag behind the primary by milliseconds to seconds
Cluster-only: These options only work with Redis cluster deployments
Read operations (Get, LRange) may be served by replica nodes
Write operations (SetNX, Del, Lua scripts) always route to primary nodes

Data compression

매개변수	유형	기본값	설명
`data_compression`	끈	`none`	Compression algorithm for trace data. Options: `none`, `snappy`, `zstd`, `lz4`

팁

Compression tradeoffs:

none: No CPU overhead, highest Redis memory usage
snappy: Fast compression/decompression, good compression ratio
zstd: Best compression ratio, more CPU usage
lz4: Very fast, moderate compression ratio
네트워크 대역폭과 레디스 스토리지, 프로세서 CPU 가용성 등 병목현상, 병목지점에 따라 선택하세요.

Trace management

매개변수	유형	기본값	설명
`trace_window_expiration`	duration	`30s`	How long to wait for spans before evaluating a trace
`traces_ttl`	duration	`5m`	Time-to-live for trace data in Redis
`cache_ttl`	duration	`30m`	Time-to-live for sampling decisions
`processor_name`	끈	`""`	Optional processor name for Redis keys and metrics (useful for multi-tenant deployments)

TTL guidelines:

traces_ttl should be long enough to handle retries and late spans
cache_ttl should be much longer than traces_ttl to handle late-arriving spans
Longer cache_ttl reduces duplicate evaluations but increases Redis memory usage

Retry and recovery

매개변수	유형	기본값	설명
`max_retries`	정수	`2`	Maximum retry attempts for failed trace evaluations
`in_flight_timeout`	duration	Same as `trace_window_expiration`	Timeout for in-flight batch processing before considered orphaned
`recover_interval`	duration	`5s`	How often to check for orphaned batches

중요

Orphan recovery: Orphaned batches occur when a collector crashes mid-evaluation. The orphan recovery process re-queues these traces for evaluation by another collector instance.

Evaluation settings

매개변수	유형	기본값	설명
`evaluation_interval`	duration	`1s`	How often to check for traces ready for evaluation
`max_traces_per_batch`	정수	`1000`	Maximum number of traces to evaluate per batch
`rate_limiter`	부울	`false`	Enable blocking rate limiter for concurrent trace processing

Rate limiter:

The rate_limiter option controls backpressure behavior when the concurrent trace limit (num_traces) is reached:

false (default): No rate limiting. The processor accepts traces without blocking, relying on Redis for storage. This is the recommended setting for most Redis deployments.
true: Enables a blocking rate limiter that applies backpressure when num_traces concurrent traces are being processed. New traces will block until a slot becomes available.

When to enable:

High-memory environments where you want strict control over concurrent trace processing
When Redis memory is constrained and you need to limit the rate of trace ingestion
To prevent overwhelming downstream consumers with sudden traffic bursts

Partitioning

매개변수	유형	기본값	설명
`partitions`	정수	`6`	Number of partitions for load distribution across Redis
`partition_workers`	정수	`6`	Number of concurrent evaluation workers
`partition_buffer_max_traces`	정수	`10000`	Maximum traces buffered per partition before flushing (2 workers per partition process in parallel)

Partitioning benefits:

Distributes load across multiple Redis key ranges
Enables parallel evaluation across multiple workers
Improves throughput in multi-collector deployments
팁
Partition scaling: A partition is a logical shard of trace data in Redis that enables parallel processing and horizontal scaling. Traces are assigned to partitions using consistent hashing on the trace ID. Each partition can be processed independently and concurrently, enabling both vertical scaling (more CPU cores) and horizontal scaling (more collector instances).
Important: partitions should be at least 3x times the number of Redis nodes needed for your workload. partition_workers should typically be less than or equal to the number of partitions.

Ingestion settings

매개변수	유형	기본값	설명
`ingestion_workers`	정수	`6`	Number of goroutines processing traces from the shared ingestion channel
`ingestion_buffer_size`	정수	`10000`	Capacity of the shared ingestion channel for buffering incoming traces
`ingestion_channel_timeout`	duration	`500ms`	Maximum time to wait when sending traces to the ingestion channel. If exceeded, traces are dropped
`ingestion_response_timeout`	duration	`10s`	Maximum time to wait for a worker to process and respond. Prevents indefinite blocking if workers are stuck
`hashing_strategy`	끈	`rendezvous`	Hashing algorithm for partition selection. Options: `rendezvous` (recommended, 3x faster) or `consistent`

Ingestion architecture:

The processor uses a shared channel with configurable workers for trace ingestion:

Incoming traces are sent to a shared buffered channel
Multiple workers pull from the channel and route traces to appropriate partitions
Workers hash trace IDs using the configured hashing strategy to determine partition assignment

Configuration guidelines:

Buffer Size: Should absorb traffic bursts. Recommended: 10k-60k traces
Workers: Number of concurrent goroutines processing traces. Typically 1-2 workers per partition is optimal
Channel Timeout: How long to wait if buffer is full. Short timeout (500ms) fails fast on saturation
Response Timeout: Protects against stuck workers. Default: 10s is appropriate for normal Redis operations
Hashing Strategy: Algorithm for determining trace partition assignment
- rendezvous (default): Provides superior load distribution for 2-99 partitions. Best choice for typical deployments.
- consistent: Maintains performance when using 100+ partitions where rendezvous becomes slow. Trades slightly less optimal load distribution for better performance at scale.
- Both strategies ensure the same trace always maps to the same partition (deterministic)
- Choose rendezvous for better load distribution (up to 99 partitions), consistent for performance at scale (100+)

Core configuration (applies to Redis mode)

매개변수	유형	기본값	설명
`num_traces`	정수	`50000`	Maximum concurrent processing traces
`policies`	정렬	필수의	Sampling policy definitions

Complete configuration example

processors:
  tail_sampling:
    num_traces: 5_000_000
    distributed_cache:
      # Connection
      connection:
        address: "redis://redis-cluster:6379/0"
        password: "your-redis-password"

        # Connection pool settings (optional - tune for your environment)
        pool_size: 30
        read_timeout: 2s
        write_timeout: 2s
        pool_timeout: 5s
        max_retries: 5

        # Replica read options (cluster mode only)
        replica:
          read_only_replicas: true  # Default: enabled for improved scalability
          route_by_latency: true    # Route to closest node (recommended)

      # Compression
      data_compression: snappy

      # Trace Management
      trace_window_expiration: 30s
      traces_ttl: 2m              # 120s (allow extra time for retries)
      cache_ttl: 1h               # 3600s (keep decisions longer)
      processor_name: "prod-cluster-1"

      # Retry and Recovery
      max_retries: 3
      in_flight_timeout: 45s
      recover_interval: 10s

      # Evaluation
      evaluation_interval: 1s
      max_traces_per_batch: 10000
      rate_limiter: false         # Recommended for Redis mode

      # Partitioning
      partitions: 8
      partition_workers: 8
      partition_buffer_max_traces: 1000

      # Ingestion
      ingestion_workers: 12            # 1.5 workers per partition
      ingestion_buffer_size: 40000     # 40k trace buffer
      ingestion_channel_timeout: 500ms
      ingestion_response_timeout: 10s
      hashing_strategy: rendezvous     # default, best for less than 100 partitions

    # Sampling policies
    policies:
      - name: errors
        type: status_code
        status_code: {status_codes: [ERROR]}
      - name: slow-traces
        type: latency
        latency: {threshold_ms: 1000}
      - name: sample-10-percent
        type: probabilistic
        probabilistic: {sampling_percentage: 10}

Trace evaluation

This section covers the parameters that control when traces are evaluated and how long data persists in Redis.

Evaluation timing and frequency

These parameters control when and how often the processor evaluates traces for sampling decisions:

매개변수	유형	기본값	설명
`evaluation_interval`	duration	`1s`	How often to check for traces ready for evaluation
`max_traces_per_batch`	정수	`1000`	Maximum number of traces to evaluate per batch
`partition_workers`	정수	`6`	Number of concurrent evaluation workers processing partitions

How evaluation works:

Every evaluation_interval, workers check for traces that have been idle for at least trace_window_expiration
Up to max_traces_per_batch traces are pulled from Redis per evaluation cycle
partition_workers evaluate batches concurrently across partitions

Tuning guidance:

Faster decisions: Decrease evaluation_interval (e.g., 500ms) for lower latency, but increases Redis load
Higher throughput: Increase max_traces_per_batch (e.g., 5000-10000) to process more traces per cycle
More parallelism: Increase partition_workers to match available CPU cores

TTL 및 만료

The processor uses multiple TTL layers that work together to ensure traces are properly evaluated while managing Redis memory efficiently.

How TTL works in distributed mode

When using distributed_cache, the processor implements a multi-stage TTL system that differs from the in-memory processor:

Trace lifecycle stages:

Collection phase: Spans arrive and are stored in Redis
Evaluation phase: After trace_window_expiration, the trace is ready for sampling decision
Retention phase: Trace data persists for traces_ttl to handle retries and late spans
Cache phase: Sampling decisions persist for cache_ttl to prevent duplicate evaluations

중요

Key difference from in-memory mode: The trace_window_expiration parameter replaces decision_wait and implements a sliding window approach:

Each time new spans arrive for a trace, the evaluation timer resets
Traces with ongoing activity stay active longer than traces that have stopped receiving spans
This dynamic behavior better handles real-world span arrival patterns

Why cascading TTLs matter:

The TTL hierarchy ensures data availability throughout the trace lifecycle:

cache_ttl (longest) handles late-arriving spans hours after evaluation
traces_ttl (medium) provides buffer for retries and orphan recovery
trace_window_expiration (shortest) controls when evaluation begins

Properly configured TTLs prevent data loss, duplicate evaluations, and incomplete traces while optimizing Redis memory usage.

팁

Configuration principle: Each TTL should be significantly longer than the one before it (typically 5-10x). This creates safety buffers that account for processing delays, retries, and late-arriving data.

TTL 계층 구조 및 기본값

The processor uses a cascading TTL structure where each layer provides protection and buffer time for the layer below. Understanding these relationships is critical for reliable operation:

trace_window_expiration (30s)
    ↓ [trace ready for evaluation]
in_flight_timeout (30s default)
    ↓ [evaluation completes or times out]
traces_ttl (5m)
    ↓ [trace data deleted from Redis]
cache_ttl (30m)
    ↓ [decision expires, late spans re-evaluated]

1. Trace collection window: `trace_window_expiration`

Default: 30s | Config: distributed_cache.trace_window_expiration

Purpose: Controls when a trace is ready for sampling evaluation
Behavior: Sliding window that resets each time new spans arrive for a trace
Example: If a trace receives spans at t=0s, t=15s, and t=28s, evaluation begins at t=58s (28s + 30s window)

Tuning guidance:

Shorter values (15-20s): Faster sampling decisions, but risk of incomplete traces if spans arrive slowly
Longer values (45-60s): More complete traces, but higher latency and memory usage
Typical range: 20-45 seconds depending on your span arrival patterns

2. Batch processing timeout: `in_flight_timeout`

Default: Same as trace_window_expiration | Config: distributed_cache.in_flight_timeout

Purpose: Maximum time a batch can be in processing before being considered orphaned
Behavior: Prevents data loss if a collector crashes during evaluation
Orphan recovery: Batches exceeding this timeout are automatically re-queued for evaluation by another collector

Tuning guidance:

Should be ≥ trace_window_expiration: Ensures enough time for normal evaluation
Increase if: Your evaluation policies are computationally expensive (complex OTTL, regex)
Monitor: otelcol_processor_tail_sampling_sampling_decision_timer_latency to ensure evaluations complete within this window
팁
Relationship with trace_window_expiration: Setting in_flight_timeout equal to trace_window_expiration works well for most deployments. Only increase if you observe frequent orphaned batch recoveries due to slow policy evaluation.

3. Trace data retention: `traces_ttl`

Default: 5m | Config: distributed_cache.traces_ttl

Purpose: How long trace span data persists in Redis after initial storage
Behavior: Provides buffer time for retries, late spans, and orphan recovery
Critical constraint: Must be significantly longer than trace_window_expiration + in_flight_timeout

Recommended formula:

traces_ttl ≥ (trace_window_expiration + in_flight_timeout + max_retries × evaluation_interval) × 2

Example with defaults:

traces_ttl ≥ (30s + 30s + 2 retries × 1s) × 2 = 124s ≈ 5m ✅

Tuning guidance:

Memory-constrained: Use shorter TTL (2-3m) but risk losing data for very late spans
Late span tolerance: Use longer TTL (10-15m) to handle delayed span arrivals
Standard production: 5-10 minutes provides good balance
중요
Too short = data loss: If traces_ttl is too short, traces may be deleted before evaluation completes, especially during retries or orphan recovery. This results in partial or missing traces.

4. Decision cache retention: `cache_ttl`

Default: 30m | Config: distributed_cache.cache_ttl

Purpose: How long sampling decisions (sampled/not-sampled) are cached
Behavior: Prevents duplicate evaluation when late spans arrive after trace has been evaluated
Critical constraint: Must be much longer than traces_ttl

Recommended formula:

cache_ttl ≥ traces_ttl × 6

Why much longer?

Late-arriving spans can arrive minutes or hours after the trace completed
Decision cache prevents re-evaluating traces when very late spans arrive
Without cached decision, late spans would be evaluated as incomplete traces (incorrect sampling decision)

Tuning guidance:

Standard production: 30m-2h balances memory usage and late span handling
High late-span rate: 2-4h ensures decisions persist for very delayed data
Memory-constrained: 15-30m minimum, but expect more duplicate evaluations

Memory impact:

Each decision: ~50 bytes per trace ID
At 10,000 spans/sec with 20 spans/trace → 500 traces/sec
30-minute cache: ~900,000 decisions × 50 bytes = ~45 MB
2-hour cache: ~3.6M decisions × 50 bytes = ~180 MB
팁
Monitor cache effectiveness: Track otelcol_processor_tail_sampling_early_releases_from_cache_decision metric. High values indicate the cache is preventing duplicate evaluations effectively.

TTL configuration examples

Low-latency, memory-constrained:

distributed_cache:
  trace_window_expiration: 20s
  in_flight_timeout: 20s
  traces_ttl: 2m
  cache_ttl: 15m
  evaluation_interval: 500ms
  max_traces_per_batch: 2000

High-throughput, late-span tolerant:

distributed_cache:
  trace_window_expiration: 45s
  in_flight_timeout: 60s
  traces_ttl: 10m
  cache_ttl: 2h
  evaluation_interval: 1s
  max_traces_per_batch: 10000

Balanced production (recommended):

distributed_cache:
  trace_window_expiration: 30s
  in_flight_timeout: 45s  # Extra buffer for complex policies
  traces_ttl: 5m
  cache_ttl: 30m
  evaluation_interval: 1s
  max_traces_per_batch: 5000

Retry and recovery

매개변수	유형	기본값	설명
`max_retries`	정수	`2`	Maximum retry attempts for failed trace evaluations
`recover_interval`	duration	`5s`	How often to check for orphaned batches

Orphan recovery:

Orphaned batches occur when a collector crashes mid-evaluation. The orphan recovery process runs every recover_interval and:

Identifies batches that have exceeded in_flight_timeout
Re-queues these traces for evaluation by another collector instance
Ensures no traces are lost due to collector failures

Tuning guidance:

Increase max_retries (3-5) if experiencing transient Redis errors
Decrease recover_interval (2-3s) for faster recovery in high-availability environments
Monitor recovery metrics to identify if collectors are crashing frequently

Partitioning and scaling

Partitions are the key to achieving high throughput and horizontal scalability in Redis-based tail sampling. This section explains how partitions work and how to configure them for optimal performance.

What is a partition?

A partition is a logical shard of trace data in Redis that enables parallel processing and horizontal scaling. Think of partitions as separate queues where traces are distributed based on their trace ID.

주요 개념:

Each partition maintains its own pending traces queue in Redis
Traces are assigned to partitions using a configurable hashing strategy (rendezvous or consistent) on the trace ID
Each partition can be processed independently and concurrently
Partitions enable both vertical scaling (more CPU cores) and horizontal scaling (more collector instances)
주의
Important: Changing the number of partitions when there's a cluster already running will cause data loss, since traces cannot be located anymore with a different partition count.

How partitioning works

Incoming Traces
      |
      v
┌─────────────────────────────┐
│  Hashing Strategy           │  trace_id → rendezvous or consistent hash
│  (rendezvous by default)    │
└─────────────────────────────┘
      |
      ├──────────┬──────────┬──────────┐
      v          v          v          v
┌─────────┐ ┌─────────┐ ┌─────────┐ ┌─────────┐
│Partition│ │Partition│ │Partition│ │Partition│
│    0    │ │    1    │ │    2    │ │    3    │
│ (Redis) │ │ (Redis) │ │ (Redis) │ │ (Redis) │
└─────────┘ └─────────┘ └─────────┘ └─────────┘
      |          |          |          |
      v          v          v          v
┌─────────┐ ┌─────────┐ ┌─────────┐ ┌─────────┐
│ Worker  │ │ Worker  │ │ Worker  │ │ Worker  │
│    0    │ │    1    │ │    2    │ │    3    │
│(Goroutine)│(Goroutine)│(Goroutine)│(Goroutine)│
└─────────┘ └─────────┘ └─────────┘ └─────────┘
      |          |          |          |
      └──────────┴──────────┴──────────┘
                   |
                   v
            Sampled Traces

Flow:

Ingestion: Trace ID is hashed using the configured hashing strategy to determine partition assignment
Storage: Trace data stored in Redis under partition-specific keys
Evaluation: Worker assigned to that partition pulls and evaluates traces
Concurrency: All partition workers run in parallel, processing different traces simultaneously

Hashing strategy

The processor supports two hashing algorithms for partition selection. The choice depends on the number of partitions:

Strategy	Load Distribution	성능	Best For
`rendezvous` (default)	Superior load balancing	Fast for up to 99 partitions	Standard deployments (2-99 partitions) - best load distribution for typical production workloads
`consistent`	Good distribution	Maintains performance with 100+ partitions	Very large scale (100+ partitions) - preserves performance when rendezvous becomes slow

Key characteristics:

Both strategies are deterministic - the same trace always maps to the same partition
Rendezvous provides better load distribution but becomes slow with 100+ partitions
Consistent hashing maintains performance at high partition counts (100+)
Choose based on partition count: rendezvous for better distribution (2-99), consistent for performance at scale (100+)

Standard configuration (most deployments):

distributed_cache:
  hashing_strategy: rendezvous  # default, best load distribution for 2-99 partitions
  partitions: 8

Very large scale configuration (100+ partitions):

distributed_cache:
  hashing_strategy: consistent  # maintains performance with 100+ partitions
  partitions: 200

중요

Choosing the right strategy:

Rendezvous (default): Use for deployments with up to 99 partitions. Provides superior load distribution for the vast majority of production workloads.
Consistent: Use when scaling to 100+ partitions where rendezvous becomes slow. Trades slightly less optimal distribution for maintained performance at scale.
Important: Once chosen, changing strategies requires clearing existing data as traces will map to different partitions.

Partition configuration parameters

Use partitions to control how many logical shards you have and partition_workers to set how many workers process them:

distributed_cache:
  partitions: 8         # Number of logical shards in Redis
  partition_workers: 8  # Number of workers processing partitions

Worker behavior:

8 partitions + 8 workers: Each worker processes one partition every evaluation_interval ✅ Balanced
8 partitions + 16 workers: Each partition evaluated twice per interval (redundant, wastes resources)
8 partitions + 4 workers: Only half the partitions evaluated per interval (slower, but less CPU/Redis load)

팁

Tuning tip: Setting fewer workers per instance (partition_workers < partitions) reduces stress on Redis and the collector, useful when running many collector instances.

Partition sizing guidelines

Scenario	Partitions	Partition Workers	Reasoning
개발	2-4	2-4	Minimal overhead, easy debugging
Standard Production (15k spans/sec)	4-8	4-8	Balanced parallelism and Redis key count
High Volume (moe than 100k spans/sec)	12-24	12-24	Maximize throughput

중요

Important sizing rules:

partitions should be at least 3x the number of Redis nodes needed for your workload
partition_workers should typically be ≤ partitions
Changing partition count loses existing data - traces cannot be located after partition count changes

Partition configuration examples

Single collector (4-core machine):

distributed_cache:
  partitions: 4
  partition_workers: 4
  partition_buffer_max_traces: 5000

Multi-collector (3 instances, 8-core each):

distributed_cache:
  partitions: 12                    # 3x more than single collector
  partition_workers: 6              # Each collector processes 6 partitions
  partition_buffer_max_traces: 10000

High-volume (10+ collectors):

distributed_cache:
  partitions: 24
  partition_workers: 4              # Fewer per collector to share load
  partition_buffer_max_traces: 20000

크기 및 성능

주의

Critical bottlenecks: Redis performance for tail sampling is primarily constrained by CPU and network, not memory. Focus your sizing and optimization efforts on:

Network throughput and latency between collectors and Redis
CPU capacity for compression/decompression and Lua script execution
Memory capacity (typically sufficient if CPU and network are properly sized)

Proper Redis instance sizing requires understanding your workload characteristics:

Spans per second: Example assumes 10,000 spans/sec throughput
Average span size: Example assumes 900 bytes (marshaled protobuf format)

1. CPU requirements

CPU is the primary bottleneck for Redis in tail sampling workloads due to:

Compression/decompression overhead:

Every span is compressed before storage and decompressed on retrieval
snappy or lz4: ~5-15% CPU overhead per operation
zstd: ~15-30% CPU overhead (higher compression ratio but more CPU intensive)
For 10,000 spans/sec, expect 1-2 CPU cores dedicated to compression alone

Lua script execution:

Atomic batch operations use Lua scripts for consistency
Scripts execute on a single Redis core (Redis is single-threaded per operation)
High evaluation rates can saturate a single core
Recommendation: Use Redis cluster mode to distribute Lua execution across multiple nodes

CPU sizing guidelines:

Single Redis instance: Minimum 4 vCPUs for 10,000 spans/sec with compression
Redis cluster: 3+ nodes with 4 vCPUs each for high availability and load distribution
Without compression: Reduce CPU requirements by ~30-40% but increase network and memory needs
팁
Monitoring CPU: Watch for CPU saturation (more than 80% utilization) as the first indicator of scaling needs. If CPU-bound, either add cluster nodes or reduce compression overhead.

2. Network requirements

Network bandwidth and latency directly impact sampling throughput:

Bandwidth calculations:

For 10,000 spans/sec at 900 bytes per span:

Ingestion traffic (collectors → Redis): 10,000 × 900 bytes = 9 MB/sec = ~72 Mbps
Evaluation traffic (Redis → collectors): ~9 MB/sec = ~72 Mbps (reading traces for evaluation)
Total bidirectional: ~18 MB/sec = ~144 Mbps

With 25% compression (snappy/lz4):

Compressed traffic: ~108 Mbps bidirectional

Network sizing guidelines:

Co-located (same datacenter/VPC): 1 Gbps network interfaces are sufficient for most workloads
Cross-region: Expect 10-50ms latency - increase timeouts and use compression to reduce bandwidth
Connection pooling: Default pool_size: 20 supports ~5,000-10,000 spans/sec. Increase to 30-50 for higher throughput
중요
Network is critical: Round-trip time between collectors and Redis directly impacts end-to-end sampling latency. Deploy Redis with low-latency network connectivity (less than 5ms) to collectors. Use cluster mode with replica reads to distribute network load.

3. Memory requirements

While memory is less constrained than CPU and network, proper sizing prevents evictions and ensures data availability.

메모리 추정 공식

Total Memory = (Trace Data) + (Decision Caches) + (Overhead)

Trace data storage

트레이스 데이터는 늦게 도착하는 스팬과 트레이스 복구를 지원하기 위해 전체 traces_ttl 기간 동안 레디스에 저장됩니다.

스팬당 저장소: ~900 bytes (마샬링된 protobuf)
저장 기간: traces_ttl 에 의해 제어됨(기본값: 1시간)
활성 수집 창: trace_window_expiration 에 의해 제어됨(기본값: 30초)
공식: Memory ≈ spans_per_second × traces_ttl × 900 bytes
중요
활성 창 대 전체 보존: 트레이스는 ~30-second 활성 창(trace_window_expiration) 동안 수집되지만 전체 1시간 traces_ttl 기간 동안 트레이스에 지속됩니다. 이를 통해 프로세서는 늦게 도착한 스팬을 처리하고 버려진 트레이스를 복구할 수 있습니다. 여성용 사이즈는 활성 기간뿐만 아니라 전체 보관 기간 을 고려해야 합니다.

계산 예시: 1시간 traces_ttl 으로 초당 10,000개의 스팬에서:

10,000 spans/sec × 3600 sec × 900 bytes = 32.4 GB

lz4 압축을 사용하면 (25% 감소가 관찰됨):

32.4 GB × 0.75 = 24.3 GB

참고: 이 계산은 기본 메모리 소비자를 나타냅니다. 실제 LEDS 메모리는 결정 캐시와 내부 데이터 구조로 인해 약간 더 높을 수 있습니다.

Decision cache storage

distributed_cache 사용하면 결정 캐시는 명시적 크기 제한 없이 LEDS에 저장됩니다. 대신, REDIS는 기본 LRU 퇴거 정책( maxmemory-policy 통해 구성됨)을 사용하여 메모리를 관리합니다. 각 트레이스 ID에는 약 50바이트의 저장 공간이 필요합니다.

샘플링된 캐시: 레디스 LRU 제거에 의해 관리됨
샘플링되지 않은 캐시: 레디스 LRU 제거에 의해 관리됨
트레이스 ID당 일반적인 오버헤드: ~50 bytes
팁
메모리 관리: 메모리 제한에 도달하면 오래된 결정 캐시 항목을 자동으로 제거할 수 있도록 maxmemory-policy allkeys-lru 으로 LEDS를 구성합니다. 결정 캐시 키는 고정 크기 제한 대신 TTL 기반 만료( cache_ttl 에 의해 제어됨)를 사용합니다.

Batch processing overhead

현재 배치 대기열: 최소(트레이스 ID + 정렬된 세트의 점수)
기내 배치: max_traces_per_batch × average_spans_per_trace × 900 bytes

계산 예시: 배치당 500 트레이스(기본값), 트레이당 평균 20 스팬:

500 × 20 × 900 bytes = 9 MB per batch

배치 크기는 평가 중의 메모리 사용량에 영향을 미칩니다. 진행 중인 배치 메모리는 일시적이며 처리가 완료되면 해제됩니다.

전체 크기 예시

Workload parameters:

처리량: 10,000 스팬/초
평균 스팬 크기: 900바이트
보관기간: 1시간 (traces_ttl)
Deployment: Redis cluster with 3 nodes

Resource requirements:

자원	Without Compression	With lz4 Compression (25% reduction)
CPU per node	2-3 vCPUs	3-4 vCPUs (compression overhead)
Network bandwidth	~144 Mbps bidirectional	~108 Mbps bidirectional
Memory (total)	~40.5 GB + decision cache	~30.4 GB + decision cache

Memory breakdown with compression:

요소	필요한 메모리
트레이스 데이터(1시간 보존)	24.3GB
결정 캐시	변수(LRU 관리)
일괄 처리	~7 MB
레디스 오버헤드 (25%)	~6.1 GB
총(최소)	~30.4 GB + decision cache

Recommended Redis cluster configuration:

# 3-node Redis cluster (e.g., AWS cache.r6g.xlarge)
Nodes: 3
vCPUs per node: 4
Memory per node: 25 GB (75 GB total cluster)
Network: 1 Gbps or better
Region: Co-located with collectors (5ms latency)

중요

Sizing guidance:

CPU-first approach: Size for CPU requirements first, then verify memory and network adequacy
Cluster mode strongly recommended: Distributes CPU, network, and memory load across nodes
Monitoring: Track CPU utilization, network throughput, and memory usage to identify bottlenecks
Scaling: If CPU-bound (more than 70% utilization), add cluster nodes. If network-bound, enable compression or add nodes
Buffer for spikes: Provision 20-30% additional capacity beyond steady-state requirements

Default configuration architecture

The default configuration values are designed for a reference deployment supporting 1 million spans per minute (~16,000 spans/sec):

Collector deployment:

3 collector instances
4 vCPUs per instance
8 GB RAM per instance

Redis cluster:

3 Redis instances (AWS cache.r6g.xlarge: 4 vCPUs, 25.01 GiB memory each)
Configured as a cluster for high availability and load distribution
Co-located with collectors for low-latency access

This reference architecture provides a starting point for production deployments. Adjust based on your actual throughput and latency requirements.

Metrics reference

The tail sampling processor emits the following metrics in Redis-distributed mode to help you monitor performance and diagnose issues.

사용 가능한 지표

메트릭 이름	치수	설명	Use Case
`otelcol_processor_tail_sampling_batches`	`partition`, `processor`	Number of batch operations	Monitor batch processing rate across partitions
`otelcol_processor_tail_sampling_sampling_decision_timer_latency`	`partition`, `processor`	Sampling decision timer latency (ms)	Track overall evaluation performance per partition
`otelcol_processor_tail_sampling_sampling_policy_evaluation_error`	`partition`, `processor`	Policy evaluation error count	Detect policy configuration issues
`otelcol_processor_tail_sampling_count_traces_sampled`	`policy`, `decision` , `partition` , `processor`	Count of traces sampled/not sampled per policy	Track per-policy sampling decisions
`otelcol_processor_tail_sampling_count_spans_sampled`	`policy`, `decision` , `partition` , `processor`	Count of spans sampled/not sampled per policy	Span-level sampling statistics
`otelcol_processor_tail_sampling_global_count_traces_sampled`	`decision`, `partition` , `processor`	Global count of traces sampled by at least one policy	Overall sampling rate monitoring
`otelcol_processor_tail_sampling_early_releases_from_cache_decision`	`sampled`	Spans immediately released due to cache hit	Decision cache effectiveness
`otelcol_processor_tail_sampling_new_trace_id_received`	`partition`, `processor`	Count of new traces received	Trace ingestion rate per partition
`otelcol_processor_tail_sampling_new_span_received`	`partition`, `processor`	Count of new spans received	Span ingestion rate per partition
`otelcol_processor_tail_sampling_traces_dropped`	`partition`, `processor`	Traces dropped due to saving errors	Error detection and troubleshooting
`otelcol_processor_tail_sampling_spans_dropped`	`partition`, `processor`	Spans dropped due to saving errors	Error detection and troubleshooting
`otelcol_processor_tail_sampling_count_traces_deleted`	`deleted`, `partition` , `processor`	Count of traces deleted from storage	Cleanup monitoring

Dimension details

policy: Name of the sampling policy that made the decision
sampled: Whether the decision was to sample (true/false)
decision: The sampling decision type (sampled, not_sampled, dropped)
deleted: Whether deletion was successful (true/false)
partition: Partition identifier (hex-encoded hash, e.g., {a1b2c3d4...}) - ensures Redis Cluster hash tag compatibility
processor: Processor instance identifier (from distributed_cache.processor_name config)

팁

Partition identifiers: Partition values are deterministic SHA256 hashes of the partition index combined with the processor name. Check collector logs at startup to see the mapping of partition indices to hash values.

Redis 호환 캐시 요구 사항

프로세서는 다음 트레이스 데이터에 대한 분산 저장소로 캐시를 사용합니다.

트레이스 및 스팬 속성
액티브 트레이스 데이터
샘플링 결정 캐시

프로세서는 Lua 펼쳐보기를 실행하여 레디스 캐시와 원자적으로 상호작용합니다. Lua 스크립트 지원은 일반적으로 Redis 호환 캐시에서 기본적으로 활성화됩니다. 이 기능을 명시적으로 비활성화하지 않는 한 추가 설정은 필요하지 않습니다.

사용자의 편의를 위해 제공되는 기계 번역입니다.

지원되는 캐시.css-21sua1{background:none;border:none;width:0;padding:0;}

중요

구성 매개변수

Connection settings

Redis client timeouts and connection pool

Cluster replica options

팁

Data compression

팁

Trace management

Retry and recovery

중요

Evaluation settings

Partitioning

팁

Ingestion settings

Core configuration (applies to Redis mode)

Complete configuration example

Trace evaluation

Evaluation timing and frequency

TTL 및 만료

How TTL works in distributed mode

중요

팁

TTL 계층 구조 및 기본값

1. Trace collection window: trace_window_expiration

2. Batch processing timeout: in_flight_timeout

팁

3. Trace data retention: traces_ttl

중요

4. Decision cache retention: cache_ttl

팁

TTL configuration examples

Retry and recovery

Partitioning and scaling

What is a partition?

주의

How partitioning works

Hashing strategy

중요

Partition configuration parameters

팁

Partition sizing guidelines

중요

Partition configuration examples

크기 및 성능

주의

1. CPU requirements

팁

2. Network requirements

중요

3. Memory requirements

메모리 추정 공식

Trace data storage

중요

Decision cache storage

팁

Batch processing overhead

전체 크기 예시

중요

Default configuration architecture

Metrics reference

사용 가능한 지표

Dimension details

팁

Redis 호환 캐시 요구 사항

지원되는 캐시

1. Trace collection window: `trace_window_expiration`

2. Batch processing timeout: `in_flight_timeout`

3. Trace data retention: `traces_ttl`

4. Decision cache retention: `cache_ttl`