Thrift module: zipkinCore

ModuleServicesData typesConstants
zipkinCore Annotation
AnnotationType
BinaryAnnotation
Endpoint
Span
CLIENT_ADDR
CLIENT_RECV
CLIENT_RECV_FRAGMENT
CLIENT_SEND
CLIENT_SEND_FRAGMENT
ERROR
HTTP_HOST
HTTP_METHOD
HTTP_PATH
HTTP_REQUEST_SIZE
HTTP_RESPONSE_SIZE
HTTP_ROUTE
HTTP_STATUS_CODE
HTTP_URL
LOCAL_COMPONENT
MESSAGE_ADDR
MESSAGE_RECV
MESSAGE_SEND
SERVER_ADDR
SERVER_RECV
SERVER_RECV_FRAGMENT
SERVER_SEND
SERVER_SEND_FRAGMENT
WIRE_RECV
WIRE_SEND

Constants

ConstantTypeValue
CLIENT_SENDstring"cs"
The client sent ("cs") a request to a server. There is only one send per
span. For example, if there's a transport error, each attempt can be logged
as a WIRE_SEND annotation.

If chunking is involved, each chunk could be logged as a separate
CLIENT_SEND_FRAGMENT in the same span.

Annotation.host is not the server. It is the host which logged the send
event, almost always the client. When logging CLIENT_SEND, instrumentation
should also log the SERVER_ADDR.

CLIENT_RECVstring"cr"
The client received ("cr") a response from a server. There is only one
receive per span. For example, if duplicate responses were received, each
can be logged as a WIRE_RECV annotation.

If chunking is involved, each chunk could be logged as a separate
CLIENT_RECV_FRAGMENT in the same span.

Annotation.host is not the server. It is the host which logged the receive
event, almost always the client. The actual endpoint of the server is
recorded separately as SERVER_ADDR when CLIENT_SEND is logged.

SERVER_SENDstring"ss"
The server sent ("ss") a response to a client. There is only one response
per span. If there's a transport error, each attempt can be logged as a
WIRE_SEND annotation.

Typically, a trace ends with a server send, so the last timestamp of a trace
is often the timestamp of the root span's server send.

If chunking is involved, each chunk could be logged as a separate
SERVER_SEND_FRAGMENT in the same span.

Annotation.host is not the client. It is the host which logged the send
event, almost always the server. The actual endpoint of the client is
recorded separately as CLIENT_ADDR when SERVER_RECV is logged.

SERVER_RECVstring"sr"
The server received ("sr") a request from a client. There is only one
request per span.  For example, if duplicate responses were received, each
can be logged as a WIRE_RECV annotation.

Typically, a trace starts with a server receive, so the first timestamp of a
trace is often the timestamp of the root span's server receive.

If chunking is involved, each chunk could be logged as a separate
SERVER_RECV_FRAGMENT in the same span.

Annotation.host is not the client. It is the host which logged the receive
event, almost always the server. When logging SERVER_RECV, instrumentation
should also log the CLIENT_ADDR.

MESSAGE_SENDstring"ms"
Message send ("ms") is a request to send a message to a destination, usually
a broker. This may be the only annotation in a messaging span. If WIRE_SEND
exists in the same span, it follows this moment and clarifies delays sending
the message, such as batching.

Unlike RPC annotations like CLIENT_SEND, messaging spans never share a span
ID. For example, "ms" should always be the parent of "mr".

Annotation.host is not the destination, it is the host which logged the send
event: the producer. When annotating MESSAGE_SEND, instrumentation should
also tag the MESSAGE_ADDR.

MESSAGE_RECVstring"mr"
A consumer received ("mr") a message from a broker. This may be the only
annotation in a messaging span. If WIRE_RECV exists in the same span, it
precedes this moment and clarifies any local queuing delay.

Unlike RPC annotations like SERVER_RECV, messaging spans never share a span
ID. For example, "mr" should always be a child of "ms" unless it is a root
span.

Annotation.host is not the broker, it is the host which logged the receive
event: the consumer.  When annotating MESSAGE_RECV, instrumentation should
also tag the MESSAGE_ADDR.

WIRE_SENDstring"ws"
Optionally logs an attempt to send a message on the wire. Multiple wire send
events could indicate network retries. A lag between client or server send
and wire send might indicate queuing or processing delay.

WIRE_RECVstring"wr"
Optionally logs an attempt to receive a message from the wire. Multiple wire
receive events could indicate network retries. A lag between wire receive
and client or server receive might indicate queuing or processing delay.

CLIENT_SEND_FRAGMENTstring"csf"
Optionally logs progress of a (CLIENT_SEND, WIRE_SEND). For example, this
could be one chunk in a chunked request.

CLIENT_RECV_FRAGMENTstring"crf"
Optionally logs progress of a (CLIENT_RECV, WIRE_RECV). For example, this
could be one chunk in a chunked response.

SERVER_SEND_FRAGMENTstring"ssf"
Optionally logs progress of a (SERVER_SEND, WIRE_SEND). For example, this
could be one chunk in a chunked response.

SERVER_RECV_FRAGMENTstring"srf"
Optionally logs progress of a (SERVER_RECV, WIRE_RECV). For example, this
could be one chunk in a chunked request.

HTTP_HOSTstring"http.host"
The domain portion of the URL or host header. Ex. "mybucket.s3.amazonaws.com"

Used to filter by host as opposed to ip address.

HTTP_METHODstring"http.method"
The HTTP method, or verb, such as "GET" or "POST".

Used to filter against an http route.

HTTP_PATHstring"http.path"
The absolute http path, without any query parameters. Ex. "/objects/abcd-ff"

Used as a filter or to clarify the request path for a given route. For example, the path for
a route "/objects/:objectId" could be "/objects/abdc-ff". This does not limit cardinality like
HTTP_ROUTE("http.route") can, so is not a good input to a span name.

The Zipkin query api only supports equals filters. Dropping query parameters makes the number
of distinct URIs less. For example, one can query for the same resource, regardless of signing
parameters encoded in the query line. Dropping query parameters also limits the security impact
of this tag.

Historical note: This was commonly expressed as "http.uri" in zipkin, even though it was most

HTTP_ROUTEstring"http.route"
The route which a request matched or "" (empty string) if routing is supported, but there was no
match. Ex "/users/{userId}"

Unlike HTTP_PATH("http.path"), this value is fixed cardinality, so is a safe input to a span
name function or a metrics dimension. Different formats are possible. For example, the following
are all valid route templates: "/users" "/users/:userId" "/users/*"

Route-based span name generation often uses other tags, such as HTTP_METHOD("http.method") and
HTTP_STATUS_CODE("http.status_code"). Route-based names can look like "get /users/{userId}",
"post /users", "get not_found" or "get redirected".

HTTP_URLstring"http.url"
The entire URL, including the scheme, host and query parameters if available. Ex.
"https://mybucket.s3.amazonaws.com/objects/abcd-ff?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Algorithm=AWS4-HMAC-SHA256..."

Combined with HTTP_METHOD, you can understand the fully-qualified request line.

This is optional as it may include private data or be of considerable length.

HTTP_STATUS_CODEstring"http.status_code"
The HTTP status code, when not in 2xx range. Ex. "503"

Used to filter for error status.

HTTP_REQUEST_SIZEstring"http.request.size"
The size of the non-empty HTTP request body, in bytes. Ex. "16384"

Large uploads can exceed limits or contribute directly to latency.

HTTP_RESPONSE_SIZEstring"http.response.size"
The size of the non-empty HTTP response body, in bytes. Ex. "16384"

Large downloads can exceed limits or contribute directly to latency.

LOCAL_COMPONENTstring"lc"
The value of "lc" is the component or namespace of a local span.

BinaryAnnotation.host adds service context needed to support queries.

Local Component("lc") supports three key features: flagging, query by
service and filtering Span.name by namespace.

While structurally the same, local spans are fundamentally different than
RPC spans in how they should be interpreted. For example, zipkin v1 tools
center on RPC latency and service graphs. Root local-spans are neither
indicative of critical path RPC latency, nor have impact on the shape of a
service graph. By flagging with "lc", tools can special-case local spans.

Zipkin v1 Spans are unqueryable unless they can be indexed by service name.
The only path to a service name is by (Binary)?Annotation.host.serviceName.
By logging "lc", a local span can be queried even if no other annotations
are logged.

The value of "lc" is the namespace of Span.name. For example, it might be
"finatra2", for a span named "bootstrap". "lc" allows you to resolves
conflicts for the same Span.name, for example "finatra/bootstrap" vs
"finch/bootstrap". Using local component, you'd search for spans named
"bootstrap" where "lc=finch"

ERRORstring"error"
When an annotation value, this indicates when an error occurred. When a
binary annotation key, the value is a human readable message associated
with an error.

Due to transient errors, an ERROR annotation should not be interpreted
as a span failure, even the annotation might explain additional latency.
Instrumentation should add the ERROR binary annotation when the operation
failed and couldn't be recovered.

Here's an example: A span has an ERROR annotation, added when a WIRE_SEND
failed. Another WIRE_SEND succeeded, so there's no ERROR binary annotation
on the span because the overall operation succeeded.

Note that RPC spans often include both client and server hosts: It is
possible that only one side perceived the error.

CLIENT_ADDRstring"ca"
Indicates a client address ("ca") in a span. Most likely, there's only one.
Multiple addresses are possible when a client changes its ip or port within
a span.

SERVER_ADDRstring"sa"
Indicates a server address ("sa") in a span. Most likely, there's only one.
Multiple addresses are possible when a client is redirected, or fails to a
different server ip or port.

MESSAGE_ADDRstring"ma"
Indicates the remote address of a messaging span, usually the broker.


Enumerations

Enumeration: AnnotationType

A subset of thrift base types, except BYTES.


BOOL0
Set to 0x01 when key is CLIENT_ADDR or SERVER_ADDR

BYTES1
No encoding, or type is unknown.

I162
I323
I644
DOUBLE5
STRING6
the only type zipkin v1 supports search against.


Data structures

Struct: Endpoint

KeyFieldTypeDescriptionRequirednessDefault value
1ipv4i32IPv4 host address packed into 4 bytes. Ex for the ip 1.2.3.4, it would be (1 << 24) | (2 << 16) | (3 << 8) | 4 >default
2porti16IPv4 port or 0, if unknown. Note: this is to be treated as an unsigned integer, so watch for negatives. default
3service_namestringClassifier of a source or destination in lowercase, such as "zipkin-web". This is the primary parameter for trace lookup, so should be intuitive as possible, for example, matching names in service discovery. Conventionally, when the service name isn't known, service_name = "unknown". However, it is also permissible to set service_name = "" (empty string). The difference in the latter usage is that the span will not be queryable by service name unless more information is added to the span with non-empty service name, e.g. an additional annotation from the server. Particularly clients may not have a reliable service name at ingest. One approach is to set service_name to "" at ingest, and later assign a better label based on binary annotations, such as user agent. default
4ipv6binaryIPv6 host address packed into 16 bytes. Ex Inet6Address.getBytes() optional

Indicates the network context of a service recording an annotation with two
exceptions.

When a BinaryAnnotation, and key is CLIENT_ADDR or SERVER_ADDR,
the endpoint indicates the source or destination of an RPC. This exception
allows zipkin to display network context of uninstrumented services, or
clients such as web browsers.

Struct: Annotation

KeyFieldTypeDescriptionRequirednessDefault value
1timestampi64Microseconds from epoch. This value should use the most precise value possible. For example, gettimeofday or multiplying currentTimeMillis by 1000. default
2valuestringUsually a short tag indicating an event, like "sr" or "finagle.retry". default
3hostEndpointThe host that recorded the value, primarily for query by service name. optional

Associates an event that explains latency with a timestamp.

Unlike log statements, annotations are often codes: for example "sr".

Struct: BinaryAnnotation

KeyFieldTypeDescriptionRequirednessDefault value
1keystringName used to lookup spans, such as "http.path" or "finagle.version". default
2valuebinarySerialized thrift bytes, in TBinaryProtocol format. For legacy reasons, byte order is big-endian. See THRIFT-3217. default
3annotation_typeAnnotationTypeThe thrift type of value, most often STRING. annotation_type shouldn't vary for the same key. default
4hostEndpointThe host that recorded value, allowing query by service name or address. There are two exceptions: when key is "ca" or "sa", this is the source or destination of an RPC. This exception allows zipkin to display network context of uninstrumented services, such as browsers or databases. optional

Binary annotations are tags applied to a Span to give it context. For
example, a binary annotation of HTTP_PATH ("http.path") could the path
to a resource in a RPC call.

Binary annotations of type STRING are always queryable, though more a
historical implementation detail than a structural concern.

Binary annotations can repeat, and vary on the host. Similar to Annotation,
the host indicates who logged the event. This allows you to tell the
difference between the client and server side of the same key. For example,
the key "http.path" might be different on the client and server side due to
rewriting, like "/api/v1/myresource" vs "/myresource. Via the host field,
you can see the different points of view, which often help in debugging.

Struct: Span

KeyFieldTypeDescriptionRequirednessDefault value
1trace_idi64Unique 8-byte identifier for a trace, set on all spans within it. default
3namestringSpan name in lowercase, rpc method for example. Conventionally, when the span name isn't known, name = "unknown". default
4idi64Unique 8-byte identifier of this span within a trace. A span is uniquely identified in storage by (trace_id, id). default
5parent_idi64The parent's Span.id; absent if this the root span in a trace. optional
6annotationslist<Annotation>Associates events that explain latency with a timestamp. Unlike log statements, annotations are often codes: for example SERVER_RECV("sr"). Annotations are sorted ascending by timestamp. default
8binary_annotationslist<BinaryAnnotation>Tags a span with context, usually to support query or aggregation. For example, a binary annotation key could be "http.path". default
9debugboolTrue is a request to store this span even if it overrides sampling policy. optionalfalse
10timestampi64Epoch microseconds of the start of this span, absent if this an incomplete span. This value should be set directly by instrumentation, using the most precise value possible. For example, gettimeofday or syncing nanoTime against a tick of currentTimeMillis. For compatibility with instrumentation that precede this field, collectors or span stores can derive this via Annotation.timestamp. For example, SERVER_RECV.timestamp or CLIENT_SEND.timestamp. Timestamp is nullable for input only. Spans without a timestamp cannot be presented in a timeline: Span stores should not output spans missing a timestamp. There are two known edge-cases where this could be absent: both cases exist when a collector receives a span in parts and a binary annotation precedes a timestamp. This is possible when.. - The span is in-flight (ex not yet received a timestamp) - The span's start event was lost optional
11durationi64Measurement in microseconds of the critical path, if known. Durations of less than one microsecond must be rounded up to 1 microsecond. This value should be set directly, as opposed to implicitly via annotation timestamps. Doing so encourages precision decoupled from problems of clocks, such as skew or NTP updates causing time to move backwards. For compatibility with instrumentation that precede this field, collectors or span stores can derive this by subtracting Annotation.timestamp. For example, SERVER_SEND.timestamp - SERVER_RECV.timestamp. If this field is persisted as unset, zipkin will continue to work, except duration query support will be implementation-specific. Similarly, setting this field non-atomically is implementation-specific. This field is i64 vs i32 to support spans longer than 35 minutes. optional
12trace_id_highi64Optional unique 8-byte additional identifier for a trace. If non zero, this means the trace uses 128 bit traceIds instead of 64 bit. optional

A trace is a series of spans (often RPC calls) which form a latency tree.

Spans are usually created by instrumentation in RPC clients or servers, but
can also represent in-process activity. Annotations in spans are similar to
log statements, and are sometimes created directly by application developers
to indicate events of interest, such as a cache miss.

The root span is where parent_id = Nil; it usually has the longest duration
in the trace.

Span identifiers are packed into i64s, but should be treated opaquely.
String encoding is fixed-width lower-hex, to avoid signed interpretation.