Dependency: Update to Prometheus 0.300.* #36873

ArthurSens · 2024-12-17T17:43:08Z

Description

Supersedes #36642

This PR updates the prometheus/prometheus library in our go.mods to 0.300.* (which represents Prometheus 3.0).

It touches many go.mod files, but the Prometheus Receiver is the only component heavily affected by breaking changes.

dashpole · 2025-02-12T17:24:31Z

receiver/prometheusreceiver/factory.go

+	// This includes ScrapeManager lib that is used by the Promethes receiver.
+	// We need to set the validation scheme to _something_ to avoid panics, and
+	// UTF8 is the default in Prometheus.
+	model.NameValidationScheme = model.UTF8Validation


Note for after this merges (non-blocking): Is this actually necessary? Seems like it should always be set to this by default? https://github.com/prometheus/common/blob/cc17dab08e7c33f70e3b5ab865d6789f0ff95761/model/metric.go#L37

Hmmm, I think that change in prometheus/common was made after this PR was open.

I haven't tested this with the newest common version, definitely worth a try as a follow up to this

What would happen if someone is still running Prometheus 2.x but upgrades their OTEL Collector pipelines first? I'd imagine that could be a common scenario since people are probably not trying to do these upgrades in lock-step.

Great question!

Even though the default validation scheme changes, we still use our own library to translate OTLP into Prometheus. This package still translates everything just like the old ways so we don't expect any changes here.

dashpole · 2025-02-12T17:26:46Z

receiver/prometheusreceiver/factory.go

@@ -56,6 +63,7 @@ func createMetricsReceiver(
 	nextConsumer consumer.Metrics,
 ) (receiver.Metrics, error) {
 	configWarnings(set.Logger, cfg.(*Config))
+	addDefaultFallbackScrapeProtocol(cfg.(*Config))


Note for after this merges (non-blocking): If this is setting the default scrape protocol, why do we still need to set it in our configuration everywhere?

By everywhere you mean all those test changes I've made?

That, and also in the compliance repo

dashpole · 2025-02-12T17:27:07Z

Some questions, but lets get this merged and fix them afterwards

dashpole · 2025-02-12T17:27:17Z

Can you re-open the compliance PR?

ArthurSens · 2025-02-12T23:59:21Z

Here you go prometheus/compliance#143

ArthurSens · 2025-02-13T14:59:36Z

That was some work, thank you everyone who helped here ❤️

basti1302 · 2025-02-18T16:19:42Z

@ArthurSens I think this breaks existing configs.

Note: I'm building a custom collector via https://github.com/open-telemetry/opentelemetry-collector/tree/main/cmd/builder, so YMMV. But as far as I can tell, this will also affect users that do not build their own image.

After updating to github.com/open-telemetry/opentelemetry-collector-contrib/receiver/prometheusreceiver v0.120.0 and starting the collector (on K8s, as a daemonset) with a previously working config, the collector pods go into a CrashLoopBackOff, with this error in the logs:

Error: invalid configuration: receivers::prometheus::config::scrapeconfigs::0::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
receivers::prometheus::config::scrapeconfigs::1::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
2025/02/18 16:02:18 collector server run finished with error: invalid configuration: receivers::prometheus::config::scrapeconfigs::0::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
receivers::prometheus::config::scrapeconfigs::1::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
stream closed EOF for dash0-system/dash0-operator-opentelemetry-collector-agent-daemonset-jwpwm (opentelemetry-collector)

Setting fallback_scrape_protocol: "PrometheusText1.0.0" on all scrape jobs solves the issue. However, it seems the intention of this PR was to update Prometheus as a non-breaking change? It appears addDefaultFallbackScrapeProtocol is not effective? My guess is that the validation kicks in before the factory has a chance to set the default value.

ArthurSens · 2025-02-18T16:21:12Z

@ArthurSens I think this breaks existing configs.

Note: I'm building a custom collector via https://github.com/open-telemetry/opentelemetry-collector/tree/main/cmd/builder, so YMMV. But as far as I can tell, this will also affect users that do not build their own image.

After updating to github.com/open-telemetry/opentelemetry-collector-contrib/receiver/prometheusreceiver v0.120.0 and starting the collector (on K8s, as a daemonset) with a previously working config, the collector pods go into a CrashLoopBackOff, with this error in the logs:
Error: invalid configuration: receivers::prometheus::config::scrapeconfigs::0::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
receivers::prometheus::config::scrapeconfigs::1::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
2025/02/18 16:02:18 collector server run finished with error: invalid configuration: receivers::prometheus::config::scrapeconfigs::0::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
receivers::prometheus::config::scrapeconfigs::1::scrapefallbackprotocol: unknown scrape protocol , supported: [OpenMetricsText0.0.1 OpenMetricsText1.0.0 PrometheusProto PrometheusText0.0.4 PrometheusText1.0.0]
stream closed EOF for dash0-system/dash0-operator-opentelemetry-collector-agent-daemonset-jwpwm (opentelemetry-collector)
Setting fallback_scrape_protocol: "PrometheusText1.0.0" on all scrape jobs solves the issue. However, it seems the intention of this PR was to update Prometheus as a non-breaking change? It appears addDefaultFallbackScrapeProtocol is not effective? My guess is that the validation kicks in before the factory has a chance to set the default value.

You're correct! We're trying to fix the problem here: #38018

This is now effectively a required parameter, see: open-telemetry/opentelemetry-collector-contrib#36873 (comment)

basti1302 · 2025-02-18T16:51:29Z

You're correct! We're trying to fix the problem here: #38018

Ah, there is already an issue for it. Sorry about the ping then. I searched the repo for existing issues, but I searched for scrapefallbackprotocol and unknown scrape protocol, that is, parts of the error message and didn't find that existing issue. Thanks for the link to the issue 👍

…dation (#38018) #### Description During the release process, we noticed a breaking change in the Prometheus receiver caused by #36873. In that PR, I tried adding a fallback scrape protocol by default everytime the PrometheusReceiver was built, but it turned out that config validation happened even before the component was created. And the collector fails startup with invalid configuration This PR moves the addition of scrape protocol to the validation step.  #### Link to tracking issue Fixes #37902  #### Testing  #### Documentation

…dation (open-telemetry#38018) During the release process, we noticed a breaking change in the Prometheus receiver caused by open-telemetry#36873. In that PR, I tried adding a fallback scrape protocol by default everytime the PrometheusReceiver was built, but it turned out that config validation happened even before the component was created. And the collector fails startup with invalid configuration This PR moves the addition of scrape protocol to the validation step.  Fixes open-telemetry#37902

#### Description #36873 the prometheus receiver can now keep dots in metric names rather than converting them to underscores. E.g. say there is a metric `my.metric` scraped from prometheus receiver, its name is `my_metric` before 0.120.0 vs. `my.metric` now. This should have broken some datadog integration tests, but those are skipped in race detector (which is always on in CIs) so the failures did not show up in CIs.

#### Description Add a changelog amending #36873. There are indeed several breaking changes associated with the 3.0 version update: https://prometheus.io/docs/prometheus/latest/migration/ #### Link to tracking issue related to http://github.com/open-telemetry/opentelemetry-collector-contrib/issues/38097

This reverts commit af6b17e.

…" (#3) This reverts commit af6b17e.

ArthurSens requested review from a team, fatsheep9146, jpkrohling, dashpole, mx-psi and songy23 as code owners December 17, 2024 17:43

github-actions bot assigned fatsheep9146 Dec 17, 2024

github-actions bot requested review from Aneurysm9, ankitpatel96, bertysentry, chrroberts-pure, dgoscn, dineshg13, gouthamve, gramidt, jade-guiton-dd and liustanley December 17, 2024 17:43

dashpole approved these changes Feb 12, 2025

View reviewed changes

ArthurSens mentioned this pull request Feb 12, 2025

Add fallback scrape protocol to OTel-Collector config prometheus/compliance#143

Merged

dashpole added the ready to merge Code review completed; ready to merge by maintainers label Feb 13, 2025

songy23 merged commit af6b17e into open-telemetry:main Feb 13, 2025
173 checks passed

github-actions bot added this to the next release milestone Feb 13, 2025

ArthurSens deleted the update-to-prom3 branch February 13, 2025 14:59

This was referenced Feb 13, 2025

[receiver/prometheus] Investigate if we can relax usage of fallback_scrape_protocol #37902

Closed

[receiver/prometheus] Add fallback_scrape_protocol during config validation #38018

Merged

basti1302 added a commit to dash0hq/dash0-operator that referenced this pull request Feb 18, 2025

fix(collectors): add fallback_scrape_protocol

5641b30

This is now effectively a required parameter, see: open-telemetry/opentelemetry-collector-contrib#36873 (comment)

basti1302 mentioned this pull request Feb 18, 2025

fix(collectors): add fallback_scrape_protocol dash0hq/dash0-operator#285

Merged

basti1302 added a commit to dash0hq/dash0-operator that referenced this pull request Feb 18, 2025

fix(collectors): add fallback_scrape_protocol

fb0c55d

This is now effectively a required parameter, see: open-telemetry/opentelemetry-collector-contrib#36873 (comment)

songy23 mentioned this pull request Feb 20, 2025

[chore][exporter/datadog] fix integration test #38091

Merged

bacherfl mentioned this pull request Feb 21, 2025

[chore] fix expected service name for prometheus e2e test Dynatrace/dynatrace-otel-collector#469

Merged

songy23 mentioned this pull request Feb 21, 2025

[chore] amend changelog for prometheus receiver change #38109

Merged

thiagogcm mentioned this pull request Feb 28, 2025

Release the operator v0.120.0 open-telemetry/opentelemetry-operator#3749

Closed

cmacknz added a commit to cmacknz/opentelemetry-collector-contrib that referenced this pull request Mar 13, 2025

Revert "Dependency: Update to Prometheus 0.300.* (open-telemetry#36873)"

2f532dc

This reverts commit af6b17e.

cmacknz added a commit to elastic/opentelemetry-collector-contrib that referenced this pull request Mar 17, 2025

Revert "Dependency: Update to Prometheus 0.300.* (open-telemetry#36873)…

19cd4e8

…" (#3) This reverts commit af6b17e.

andrzej-stencel mentioned this pull request Mar 27, 2025

Update OTel components to v0.120.x elastic/elastic-agent#6912

Closed

3 tasks

mauri870 mentioned this pull request Apr 1, 2025

Update OTel components to v0.120.x elastic/elastic-agent#7663

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dependency: Update to Prometheus 0.300.* #36873

Dependency: Update to Prometheus 0.300.* #36873

Uh oh!

ArthurSens commented Dec 17, 2024 •

edited

Loading

Uh oh!

dashpole Feb 12, 2025

Uh oh!

ArthurSens Feb 13, 2025

Uh oh!

npordash Feb 13, 2025

Uh oh!

ArthurSens Feb 13, 2025

Uh oh!

dashpole Feb 12, 2025

Uh oh!

ArthurSens Feb 13, 2025

Uh oh!

dashpole Feb 13, 2025

Uh oh!

dashpole commented Feb 12, 2025

Uh oh!

dashpole commented Feb 12, 2025

Uh oh!

ArthurSens commented Feb 12, 2025

Uh oh!

Uh oh!

ArthurSens commented Feb 13, 2025

Uh oh!

basti1302 commented Feb 18, 2025

Uh oh!

ArthurSens commented Feb 18, 2025

Uh oh!

basti1302 commented Feb 18, 2025

Uh oh!

Uh oh!

Dependency: Update to Prometheus 0.300.* #36873

Dependency: Update to Prometheus 0.300.* #36873

Uh oh!

Conversation

ArthurSens commented Dec 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

dashpole Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurSens Feb 13, 2025

Choose a reason for hiding this comment

Uh oh!

npordash Feb 13, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurSens Feb 13, 2025

Choose a reason for hiding this comment

Uh oh!

dashpole Feb 12, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurSens Feb 13, 2025

Choose a reason for hiding this comment

Uh oh!

dashpole Feb 13, 2025

Choose a reason for hiding this comment

Uh oh!

dashpole commented Feb 12, 2025

Uh oh!

dashpole commented Feb 12, 2025

Uh oh!

ArthurSens commented Feb 12, 2025

Uh oh!

Uh oh!

ArthurSens commented Feb 13, 2025

Uh oh!

basti1302 commented Feb 18, 2025

Uh oh!

ArthurSens commented Feb 18, 2025

Uh oh!

basti1302 commented Feb 18, 2025

Uh oh!

Uh oh!

ArthurSens commented Dec 17, 2024 •

edited

Loading