Should language SDK generate service.instance.id? #3136

jack-berg · 2023-01-23T19:45:43Z

The resource semantic conventions say the following about service.instance.id:

If the service has no inherent unique ID that can be used as the value of this attribute it is recommended to generate a random Version 1 or Version 4 RFC 4122 UUID (services aiming for reproducible UUIDs may also use Version 5, see RFC 4122 for more recommendations).

The language is ambiguous as it's not clear "what" / "who" is supposed to generate the random Version 1 or Version 4 UUID. Is it the application owner? Is it the language SDK?

There's a request in opentelemetry-java that we generate service.instance.id. We discussed adding it in the Java SIG and there was debate about whether this was actually the intent of the spec, and if it is, whether its a good idea. The argument against including it can probably be summarized as:

Adding a random service.instance.id that is not persistent across restarts increases cardinality. While this is true I disagree that it is a problem, since I think the number of users negatively impacted would be small and they have options available to solve the problem.
Adding a random service.instance.id that doesn't reflect the identifier of the instance in its environment (i.e. is different than the pod uid) may be confusing to users. I disagree that this is a problem because users should set service.instance.id to an inherent unique ID if available.

Maybe I'm missing part of the debate but in any case it would be good if the spec could clear up this ambiguity.

This may be a duplicate of #1034, but that issue's description appears to be asking about alternates for service.instance.id where I'm asking about what is specifically responsible for generating service.instance.id in the current wording of the spec.

The text was updated successfully, but these errors were encountered:

svrnm · 2023-01-24T08:10:00Z

Adding my 2 cents to that:

I would prefer a solution where the SDK is providing the service.instance.id if it is not already defined by the end user (via environment variable, config or code). It's not something most end-users care about until it comes to observability where they want to say "give me all data for that one particular service instance" and then they figure out they have no unique identifier...

I also don't think it should be persistent across restarts, here's my rational for that: Right now if I have a highly scaled service with lots of nodes I can not identify which service is the troublemaker, except through other IDs that might be available if running on K8s (pod.id), some container runtime (container.id*) or a physical host (host.id), etc., but it's not their purpose to identify a service, which brings us back to a restart: if I (soft) restart a service within a container, the pod&container&host ID stays the same: So in the worst case I have a service that restarted a few times, but I can not uniquely identify an instance.

Oberon00 · 2023-01-24T13:03:57Z

This may be a duplicate of #1034, but that issue's description appears to be asking about alternates for service.instance.id where I'm asking about what is specifically responsible for generating service.instance.id in the current wording of the spec.

I think that is mostly what the discussion on #1034 is about. Repeating my comment from there: #1034 (comment):

IMHO this attribute is poorly defined right now as it may or may not be the same across service restarts, which IMHO can make quite a difference. It would be easiest if it MUST be the different for each restart, that way it could be used as primary key for all resources (not only service.*) sent by the same telemetry instance. On the other hand, maybe such an attribute would better be named telemetry.sdk.instance.id.

tsloughter · 2023-01-24T17:46:40Z

As a datapoint, in the Erlang SDK we use the node name as the instance id resource attribute. If the node is run without distributed erlang enabled (so no nodename) we simply use a random integer as the id -- same as using the default otel trace id generator.

nodename is somename@IP or somename@hostname, so stays the same between restarts if the node is on the same ip/hostname.

sirzooro · 2023-01-24T18:12:37Z

I also don't think it should be persistent across restarts, here's my rational for that: Right now if I have a highly scaled service with lots of nodes I can not identify which service is the troublemaker, except through other IDs that might be available if running on K8s (pod.id), some container runtime (container.id*) or a physical host (host.id), etc., but it's not their purpose to identify a service, which brings us back to a restart: if I (soft) restart a service within a container, the pod&container&host ID stays the same: So in the worst case I have a service that restarted a few times, but I can not uniquely identify an instance.

I have opposite use case: I have exactly 2 instances of service X and 10 of Y. These instances are numbered (1-2, 1-10) and their numbers are statically assigned to them during deployment. It makes sense to put them in service.instance.id as-is, and use resource attributes from Kubernetes semantic conventions to store other details.

Oberon00 · 2023-01-27T08:04:28Z

I think both an ID that persists over restarts and one that doesn't can make sense to have, for different use cases. Also, the former is harder to impalement while the latter is trivial. There should probably be two different attributes here.

svrnm · 2023-01-30T08:17:40Z

I think both an ID that persists over restarts and one that doesn't can make sense to have, for different use cases.
+1

Agreed, there should be 2 different attributes, the questions remains which one is then "service.instance.id" and which one is something else?

jsuereth · 2023-02-10T17:29:17Z

I'm actually in favor of having the SDK generate this. The main concern, and others have raised this, is that we don't have a good specification for what to generate. I'm actually looking to tackle that and would be happy to brainstorm with you on what we could do.

jsuereth · 2023-02-13T13:54:21Z

Copying this here from #3202.

My requirements for a solution to this bug:

Require service.instance.id to be generated by every SDK.
Outline an algorithm/logic for service.instance.id synthesis in a majority of cases. Allow UUID for unknown scenarios.
Allow users to override service.instance.id via configuration. Today that means OTEL_RESOURCE_ATTRIBUTES.
Ensure folks using Prometheus scrape/pull-based metrics of any sort w/ OTEL have an opportunity to see consistent service.instance.id. This may actually not be technically feasible

jsuereth · 2023-02-13T18:32:12Z

Ok, my proposal - Looking for feedback:

https://docs.google.com/document/d/1BenPf9vsZHCf4JpHWGQBydKZAA4XdH38wuMD7JQnz9A/edit?usp=sharing

jmacd · 2023-02-15T20:12:58Z

@jsuereth I think your proposal looks good.

jsuereth · 2023-02-15T21:36:05Z

Thanks @jmacd. I'll PR-ify it today/tomorrow.

…to generate it.

spedersen-emailage · 2023-03-24T21:35:20Z

Wanted to revive this. A colleague found this issue as we were researching the use of service.instance.id and the best way to generate one. Has there been any other movement on this question?

tigrannajaryan · 2023-07-11T16:51:18Z

Additionally we need to make it clear that later elements of collection pipeline may override the service.instance.id if they have a better value for it.

This is for example is the case in resourcedetection processor in the Collector.

Oberon00 · 2023-07-12T07:13:54Z

How do the later components know their value is better? Would there be any way to distinguish a user-defined instance.id from an SDK-generated one? Should there be?

tigrannajaryan · 2023-07-12T17:17:18Z

How do the later components know their value is better?

Judgement call by the component developer or by the end user. If you are certain the data you have is more "correct" than what is coming from the previous component then you overwrite it. In the Collector this is end-user configurable (see for example).

Would there be any way to distinguish a user-defined instance.id from an SDK-generated one? Should there be?

I don't expect there to be a way.

dyladan · 2024-05-21T20:27:15Z

@jack-berg is this resolved by open-telemetry/semantic-conventions#312?

jack-berg · 2024-05-21T22:10:08Z

Yes! Closing.

jack-berg added the spec:resource Related to the specification/resource directory label Jan 23, 2023

github-actions bot assigned arminru Jan 23, 2023

jack-berg mentioned this issue Jan 23, 2023

Generate a service.instance.id if it is not present? open-telemetry/opentelemetry-java#5103

Closed

svrnm mentioned this issue Jan 24, 2023

Generate a service.instance.id if it is not present? open-telemetry/opentelemetry-cpp#1908

Open

srikanthccv mentioned this issue Feb 4, 2023

Generate a service.instance.id resource attribute if it is not present open-telemetry/opentelemetry-python#2113

Open

jack-berg mentioned this issue Feb 10, 2023

Mark service and telemetry.sdk resource attributes as stable. #3202

Merged

jsuereth added a commit to jsuereth/opentelemetry-specification that referenced this issue Feb 16, 2023

Fix open-telemetry#3136 - Require service.instance.id and define how …

d77b1f6

…to generate it.

mateuszrzeszutek mentioned this issue Jul 28, 2023

add service.instance.id provider open-telemetry/opentelemetry-java-instrumentation#9062

Closed

jaredjenkins mentioned this issue Aug 4, 2023

Unify and improve GCP resource detection, second attempt open-telemetry/opentelemetry-go-contrib#2310

Merged

trentm mentioned this issue Apr 11, 2024

feat(resources): implements service.instance.id open-telemetry/opentelemetry-js#4608

Merged

4 tasks

dyladan added the triage:deciding:needs-info Not enough information. Left open to provide the author with time to add more details label May 21, 2024

dyladan unassigned arminru May 21, 2024

jack-berg closed this as completed May 21, 2024

svrnm removed the triage:deciding:needs-info Not enough information. Left open to provide the author with time to add more details label Jul 8, 2024

jsuereth mentioned this issue Jun 6, 2025

Create an entity modelling guide open-telemetry/semantic-conventions#2328

Open

3 tasks

Should language SDK generate service.instance.id? #3136

Should language SDK generate service.instance.id? #3136

Comments

jack-berg commented Jan 23, 2023

svrnm commented Jan 24, 2023

Uh oh!

Oberon00 commented Jan 24, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tsloughter commented Jan 24, 2023

Uh oh!

sirzooro commented Jan 24, 2023

Uh oh!

Oberon00 commented Jan 27, 2023

Uh oh!

svrnm commented Jan 30, 2023

Uh oh!

jsuereth commented Feb 10, 2023

Uh oh!

jsuereth commented Feb 13, 2023

Uh oh!

jsuereth commented Feb 13, 2023

Uh oh!

jmacd commented Feb 15, 2023

Uh oh!

jsuereth commented Feb 15, 2023

Uh oh!

spedersen-emailage commented Mar 24, 2023

Uh oh!

tigrannajaryan commented Jul 11, 2023

Uh oh!

Oberon00 commented Jul 12, 2023

Uh oh!

tigrannajaryan commented Jul 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dyladan commented May 21, 2024

Uh oh!

jack-berg commented May 21, 2024

Uh oh!

Oberon00 commented Jan 24, 2023 •

edited

Loading

tigrannajaryan commented Jul 12, 2023 •

edited

Loading