Skip to content

Conversation

@NachoEchevarria
Copy link
Collaborator

@NachoEchevarria NachoEchevarria commented Dec 24, 2025

Summary of changes

The following jobs have been found flaky, mostly due to a already reported crash condition:

integration_tests_windows.Win x86_net9.0_ASM  7 failures / last month
integration_tests_linux.Test alpine_net7.0_ASM  5 failures / last month
integration_tests_linux.Test alpine_net9.0_ASM  5 failures / last month
integration_tests_linux.Test debian_net9.0_ASM  3 failures / last month

The complete flakiness report can be found here:
https://docs.google.com/spreadsheets/d/1Gftmhb-66Dag4qFEXw9tyXp7U0fOQsdCE2gI-1voWyA/edit?gid=1708590243#gid=1708590243

The affected jobs will automatically be retried once to avoid CI flakiness.

For other teams, the flaky tests have been marked as flaky, which causes automatic retry. In the case of ASM, the failure does not occur on especific tests, so marking some of them would not solve the issue. This change can be reverted once the jobs are more stable.

This PR is part of an initiative of marking the most flaky tests or jobs of all the teams.

Reason for change

Implementation details

Test coverage

Other details

@github-actions github-actions bot added the area:builds project files, build scripts, pipelines, versioning, releases, packages label Dec 24, 2025
@dd-trace-dotnet-ci-bot
Copy link

dd-trace-dotnet-ci-bot bot commented Dec 24, 2025

Execution-Time Benchmarks Report ⏱️

Execution-time results for samples comparing This PR (8011) and master.

✅ No regressions detected - check the details below

Full Metrics Comparison

FakeDbCommand

Metric Master (Mean ± 95% CI) Current (Mean ± 95% CI) Change Status
.NET Framework 4.8 - Baseline
duration68.39 ± (68.36 - 68.63) ms68.53 ± (68.54 - 68.76) ms+0.2%✅⬆️
.NET Framework 4.8 - Bailout
duration72.14 ± (72.10 - 72.38) ms72.24 ± (72.15 - 72.40) ms+0.1%✅⬆️
.NET Framework 4.8 - CallTarget+Inlining+NGEN
duration1000.15 ± (1002.77 - 1008.87) ms1005.41 ± (1007.20 - 1012.90) ms+0.5%✅⬆️
.NET Core 3.1 - Baseline
process.internal_duration_ms21.91 ± (21.88 - 21.95) ms22.00 ± (21.95 - 22.04) ms+0.4%✅⬆️
process.time_to_main_ms78.58 ± (78.44 - 78.72) ms79.06 ± (78.89 - 79.22) ms+0.6%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed10.91 ± (10.90 - 10.91) MB10.94 ± (10.93 - 10.95) MB+0.3%✅⬆️
runtime.dotnet.threads.count12 ± (12 - 12)12 ± (12 - 12)+0.0%
.NET Core 3.1 - Bailout
process.internal_duration_ms21.82 ± (21.81 - 21.84) ms21.82 ± (21.79 - 21.84) ms-0.0%
process.time_to_main_ms79.92 ± (79.83 - 80.01) ms80.33 ± (80.23 - 80.43) ms+0.5%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed10.96 ± (10.96 - 10.97) MB10.96 ± (10.95 - 10.96) MB-0.1%
runtime.dotnet.threads.count13 ± (13 - 13)13 ± (13 - 13)+0.0%
.NET Core 3.1 - CallTarget+Inlining+NGEN
process.internal_duration_ms248.95 ± (245.05 - 252.85) ms246.19 ± (242.35 - 250.03) ms-1.1%
process.time_to_main_ms469.67 ± (469.18 - 470.17) ms471.04 ± (470.54 - 471.55) ms+0.3%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed48.18 ± (48.15 - 48.20) MB48.21 ± (48.19 - 48.23) MB+0.1%✅⬆️
runtime.dotnet.threads.count28 ± (28 - 28)28 ± (28 - 28)-0.9%
.NET 6 - Baseline
process.internal_duration_ms20.56 ± (20.53 - 20.59) ms20.75 ± (20.71 - 20.79) ms+0.9%✅⬆️
process.time_to_main_ms67.97 ± (67.85 - 68.10) ms68.59 ± (68.43 - 68.76) ms+0.9%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed10.59 ± (10.59 - 10.60) MB10.63 ± (10.62 - 10.63) MB+0.3%✅⬆️
runtime.dotnet.threads.count10 ± (10 - 10)10 ± (10 - 10)+0.0%
.NET 6 - Bailout
process.internal_duration_ms20.57 ± (20.55 - 20.59) ms20.64 ± (20.61 - 20.67) ms+0.4%✅⬆️
process.time_to_main_ms68.98 ± (68.92 - 69.03) ms69.33 ± (69.27 - 69.40) ms+0.5%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed10.66 ± (10.65 - 10.66) MB10.73 ± (10.72 - 10.74) MB+0.7%✅⬆️
runtime.dotnet.threads.count11 ± (11 - 11)11 ± (11 - 11)+0.0%
.NET 6 - CallTarget+Inlining+NGEN
process.internal_duration_ms241.61 ± (238.90 - 244.31) ms249.13 ± (248.19 - 250.08) ms+3.1%✅⬆️
process.time_to_main_ms438.41 ± (438.04 - 438.79) ms441.56 ± (441.14 - 441.99) ms+0.7%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed48.65 ± (48.62 - 48.68) MB48.66 ± (48.63 - 48.69) MB+0.0%✅⬆️
runtime.dotnet.threads.count28 ± (28 - 28)28 ± (28 - 28)+0.4%✅⬆️
.NET 8 - Baseline
process.internal_duration_ms18.85 ± (18.82 - 18.88) ms18.94 ± (18.91 - 18.97) ms+0.5%✅⬆️
process.time_to_main_ms67.08 ± (66.98 - 67.17) ms67.41 ± (67.30 - 67.53) ms+0.5%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed7.67 ± (7.66 - 7.68) MB7.68 ± (7.67 - 7.69) MB+0.2%✅⬆️
runtime.dotnet.threads.count10 ± (10 - 10)10 ± (10 - 10)+0.0%
.NET 8 - Bailout
process.internal_duration_ms18.81 ± (18.79 - 18.83) ms18.95 ± (18.92 - 18.98) ms+0.7%✅⬆️
process.time_to_main_ms68.14 ± (68.08 - 68.19) ms68.63 ± (68.56 - 68.69) ms+0.7%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed7.74 ± (7.73 - 7.75) MB7.72 ± (7.71 - 7.73) MB-0.2%
runtime.dotnet.threads.count11 ± (11 - 11)11 ± (11 - 11)+0.0%
.NET 8 - CallTarget+Inlining+NGEN
process.internal_duration_ms178.74 ± (177.64 - 179.84) ms178.16 ± (177.21 - 179.10) ms-0.3%
process.time_to_main_ms425.04 ± (424.15 - 425.93) ms425.75 ± (425.19 - 426.32) ms+0.2%✅⬆️
runtime.dotnet.exceptions.count0 ± (0 - 0)0 ± (0 - 0)+0.0%
runtime.dotnet.mem.committed36.31 ± (36.29 - 36.34) MB36.28 ± (36.24 - 36.32) MB-0.1%
runtime.dotnet.threads.count27 ± (27 - 27)27 ± (27 - 27)-0.1%

HttpMessageHandler

Metric Master (Mean ± 95% CI) Current (Mean ± 95% CI) Change Status
.NET Framework 4.8 - Baseline
duration194.31 ± (194.09 - 195.09) ms192.65 ± (192.72 - 193.32) ms-0.9%
.NET Framework 4.8 - Bailout
duration196.73 ± (196.52 - 197.10) ms196.77 ± (196.59 - 196.96) ms+0.0%✅⬆️
.NET Framework 4.8 - CallTarget+Inlining+NGEN
duration1106.57 ± (1110.07 - 1119.33) ms1110.92 ± (1115.31 - 1124.32) ms+0.4%✅⬆️
.NET Core 3.1 - Baseline
process.internal_duration_ms187.30 ± (186.92 - 187.68) ms187.58 ± (187.27 - 187.88) ms+0.1%✅⬆️
process.time_to_main_ms80.59 ± (80.34 - 80.84) ms81.00 ± (80.81 - 81.18) ms+0.5%✅⬆️
runtime.dotnet.exceptions.count3 ± (3 - 3)3 ± (3 - 3)+0.0%
runtime.dotnet.mem.committed16.14 ± (16.11 - 16.18) MB16.11 ± (16.08 - 16.14) MB-0.2%
runtime.dotnet.threads.count20 ± (20 - 20)20 ± (19 - 20)-0.8%
.NET Core 3.1 - Bailout
process.internal_duration_ms187.27 ± (186.93 - 187.60) ms187.20 ± (186.93 - 187.46) ms-0.0%
process.time_to_main_ms82.10 ± (81.93 - 82.26) ms82.19 ± (82.08 - 82.30) ms+0.1%✅⬆️
runtime.dotnet.exceptions.count3 ± (3 - 3)3 ± (3 - 3)+0.0%
runtime.dotnet.mem.committed16.20 ± (16.16 - 16.23) MB16.15 ± (16.13 - 16.18) MB-0.3%
runtime.dotnet.threads.count21 ± (21 - 21)21 ± (21 - 21)-0.7%
.NET Core 3.1 - CallTarget+Inlining+NGEN
process.internal_duration_ms415.45 ± (411.93 - 418.97) ms422.53 ± (419.11 - 425.95) ms+1.7%✅⬆️
process.time_to_main_ms472.96 ± (472.36 - 473.57) ms472.88 ± (472.29 - 473.48) ms-0.0%
runtime.dotnet.exceptions.count3 ± (3 - 3)3 ± (3 - 3)+0.0%
runtime.dotnet.mem.committed58.69 ± (58.57 - 58.81) MB58.79 ± (58.69 - 58.90) MB+0.2%✅⬆️
runtime.dotnet.threads.count29 ± (29 - 30)30 ± (29 - 30)+0.0%✅⬆️
.NET 6 - Baseline
process.internal_duration_ms191.96 ± (191.65 - 192.27) ms192.35 ± (191.95 - 192.74) ms+0.2%✅⬆️
process.time_to_main_ms69.99 ± (69.80 - 70.17) ms70.37 ± (70.18 - 70.56) ms+0.5%✅⬆️
runtime.dotnet.exceptions.count4 ± (4 - 4)4 ± (4 - 4)+0.0%
runtime.dotnet.mem.committed16.17 ± (16.04 - 16.29) MB16.21 ± (16.09 - 16.33) MB+0.3%✅⬆️
runtime.dotnet.threads.count19 ± (18 - 19)19 ± (18 - 19)-0.2%
.NET 6 - Bailout
process.internal_duration_ms191.30 ± (190.99 - 191.62) ms191.08 ± (190.87 - 191.29) ms-0.1%
process.time_to_main_ms70.88 ± (70.77 - 71.00) ms70.99 ± (70.90 - 71.08) ms+0.2%✅⬆️
runtime.dotnet.exceptions.count4 ± (4 - 4)4 ± (4 - 4)+0.0%
runtime.dotnet.mem.committed16.24 ± (16.12 - 16.37) MB16.01 ± (15.86 - 16.17) MB-1.4%
runtime.dotnet.threads.count20 ± (19 - 20)19 ± (19 - 19)-2.6%
.NET 6 - CallTarget+Inlining+NGEN
process.internal_duration_ms455.87 ± (453.99 - 457.75) ms455.22 ± (453.24 - 457.19) ms-0.1%
process.time_to_main_ms444.36 ± (443.89 - 444.82) ms445.51 ± (445.02 - 445.99) ms+0.3%✅⬆️
runtime.dotnet.exceptions.count4 ± (4 - 4)4 ± (4 - 4)+0.0%
runtime.dotnet.mem.committed58.23 ± (58.13 - 58.33) MB58.31 ± (58.20 - 58.43) MB+0.1%✅⬆️
runtime.dotnet.threads.count30 ± (29 - 30)30 ± (29 - 30)-0.1%
.NET 8 - Baseline
process.internal_duration_ms190.16 ± (189.83 - 190.49) ms191.03 ± (190.62 - 191.44) ms+0.5%✅⬆️
process.time_to_main_ms69.45 ± (69.25 - 69.66) ms69.71 ± (69.53 - 69.90) ms+0.4%✅⬆️
runtime.dotnet.exceptions.count4 ± (4 - 4)4 ± (4 - 4)+0.0%
runtime.dotnet.mem.committed11.77 ± (11.73 - 11.80) MB11.81 ± (11.78 - 11.83) MB+0.3%✅⬆️
runtime.dotnet.threads.count18 ± (18 - 18)18 ± (18 - 18)-0.2%
.NET 8 - Bailout
process.internal_duration_ms189.45 ± (189.18 - 189.72) ms190.26 ± (189.81 - 190.72) ms+0.4%✅⬆️
process.time_to_main_ms70.27 ± (70.17 - 70.38) ms70.72 ± (70.57 - 70.87) ms+0.6%✅⬆️
runtime.dotnet.exceptions.count4 ± (4 - 4)4 ± (4 - 4)+0.0%
runtime.dotnet.mem.committed11.82 ± (11.79 - 11.84) MB11.81 ± (11.78 - 11.83) MB-0.1%
runtime.dotnet.threads.count19 ± (19 - 19)19 ± (19 - 19)+0.1%✅⬆️
.NET 8 - CallTarget+Inlining+NGEN
process.internal_duration_ms362.82 ± (361.13 - 364.51) ms363.34 ± (361.82 - 364.86) ms+0.1%✅⬆️
process.time_to_main_ms428.15 ± (427.55 - 428.74) ms428.17 ± (427.43 - 428.92) ms+0.0%✅⬆️
runtime.dotnet.exceptions.count4 ± (4 - 4)4 ± (4 - 4)+0.0%
runtime.dotnet.mem.committed47.91 ± (47.87 - 47.95) MB47.97 ± (47.94 - 48.01) MB+0.1%✅⬆️
runtime.dotnet.threads.count29 ± (29 - 29)29 ± (29 - 29)-0.0%
Comparison explanation

Execution-time benchmarks measure the whole time it takes to execute a program, and are intended to measure the one-off costs. Cases where the execution time results for the PR are worse than latest master results are highlighted in **red**. The following thresholds were used for comparing the execution times:

  • Welch test with statistical test for significance of 5%
  • Only results indicating a difference greater than 5% and 5 ms are considered.

Note that these results are based on a single point-in-time result for each branch. For full results, see the dashboard.

Graphs show the p99 interval based on the mean and StdDev of the test run, as well as the mean value of the run (shown as a diamond below the graph).

Duration charts
FakeDbCommand (.NET Framework 4.8)
gantt
    title Execution time (ms) FakeDbCommand (.NET Framework 4.8)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (69ms)  : 67, 70
    master - mean (68ms)  : 67, 70

    section Bailout
    This PR (8011) - mean (72ms)  : 71, 73
    master - mean (72ms)  : 71, 74

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (1,010ms)  : 969, 1051
    master - mean (1,006ms)  : 963, 1049

Loading
FakeDbCommand (.NET Core 3.1)
gantt
    title Execution time (ms) FakeDbCommand (.NET Core 3.1)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (106ms)  : 104, 109
    master - mean (106ms)  : 103, 108

    section Bailout
    This PR (8011) - mean (107ms)  : 106, 109
    master - mean (107ms)  : 106, 108

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (740ms)  : 672, 807
    master - mean (738ms)  : 669, 808

Loading
FakeDbCommand (.NET 6)
gantt
    title Execution time (ms) FakeDbCommand (.NET 6)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (94ms)  : 92, 96
    master - mean (93ms)  : 91, 95

    section Bailout
    This PR (8011) - mean (95ms)  : 94, 96
    master - mean (94ms)  : 93, 95

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (714ms)  : 686, 743
    master - mean (707ms)  : 668, 745

Loading
FakeDbCommand (.NET 8)
gantt
    title Execution time (ms) FakeDbCommand (.NET 8)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (92ms)  : 90, 95
    master - mean (92ms)  : 90, 94

    section Bailout
    This PR (8011) - mean (94ms)  : 93, 95
    master - mean (93ms)  : 92, 94

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (632ms)  : 618, 645
    master - mean (631ms)  : 618, 644

Loading
HttpMessageHandler (.NET Framework 4.8)
gantt
    title Execution time (ms) HttpMessageHandler (.NET Framework 4.8)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (193ms)  : 189, 197
    master - mean (195ms)  : 189, 200

    section Bailout
    This PR (8011) - mean (197ms)  : 195, 199
    master - mean (197ms)  : 194, 200

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (1,120ms)  : 1052, 1188
    master - mean (1,115ms)  : 1048, 1182

Loading
HttpMessageHandler (.NET Core 3.1)
gantt
    title Execution time (ms) HttpMessageHandler (.NET Core 3.1)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (277ms)  : 273, 282
    master - mean (277ms)  : 271, 283

    section Bailout
    This PR (8011) - mean (278ms)  : 274, 281
    master - mean (277ms)  : 272, 283

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (926ms)  : 873, 979
    master - mean (918ms)  : 858, 977

Loading
HttpMessageHandler (.NET 6)
gantt
    title Execution time (ms) HttpMessageHandler (.NET 6)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (271ms)  : 265, 277
    master - mean (270ms)  : 266, 275

    section Bailout
    This PR (8011) - mean (270ms)  : 268, 273
    master - mean (270ms)  : 266, 274

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (931ms)  : 896, 967
    master - mean (927ms)  : 897, 958

Loading
HttpMessageHandler (.NET 8)
gantt
    title Execution time (ms) HttpMessageHandler (.NET 8)
    dateFormat  x
    axisFormat %Q
    todayMarker off
    section Baseline
    This PR (8011) - mean (270ms)  : 264, 277
    master - mean (269ms)  : 264, 275

    section Bailout
    This PR (8011) - mean (270ms)  : 264, 277
    master - mean (269ms)  : 265, 274

    section CallTarget+Inlining+NGEN
    This PR (8011) - mean (823ms)  : 805, 841
    master - mean (823ms)  : 802, 844

Loading

@pr-commenter
Copy link

pr-commenter bot commented Dec 24, 2025

Benchmarks

Benchmark execution time: 2025-12-26 12:28:31

Comparing candidate commit 48127d7 in PR branch nacho/RetryASMIntegrationTests with baseline commit ed3fa0f in branch master.

Found 7 performance improvements and 10 performance regressions! Performance is the same for 152 metrics, 17 unstable metrics.

scenario:Benchmarks.Trace.ActivityBenchmark.StartStopWithChild net6.0

  • 🟥 throughput [-7291.492op/s; -4795.011op/s] or [-7.862%; -5.170%]

scenario:Benchmarks.Trace.ActivityBenchmark.StartStopWithChild netcoreapp3.1

  • 🟥 execution_time [+110.636ms; +114.799ms] or [+116.681%; +121.071%]

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.AllCycleSimpleBody net472

  • 🟥 throughput [-71780.908op/s; -68819.424op/s] or [-7.278%; -6.978%]

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.AllCycleSimpleBody net6.0

  • 🟥 execution_time [+18.781ms; +25.259ms] or [+9.454%; +12.715%]

scenario:Benchmarks.Trace.Asm.AppSecBodyBenchmark.AllCycleSimpleBody netcoreapp3.1

  • 🟩 execution_time [-21.497ms; -15.125ms] or [-10.012%; -7.044%]

scenario:Benchmarks.Trace.Asm.AppSecEncoderBenchmark.EncodeLegacyArgs net6.0

  • 🟩 execution_time [-21.155ms; -20.974ms] or [-10.596%; -10.506%]

scenario:Benchmarks.Trace.Asm.AppSecEncoderBenchmark.EncodeLegacyArgs netcoreapp3.1

  • 🟩 execution_time [-23.572ms; -23.315ms] or [-11.686%; -11.558%]

scenario:Benchmarks.Trace.CIVisibilityProtocolWriterBenchmark.WriteAndFlushEnrichedTraces net472

  • 🟥 throughput [-87.174op/s; -65.960op/s] or [-7.780%; -5.886%]

scenario:Benchmarks.Trace.CIVisibilityProtocolWriterBenchmark.WriteAndFlushEnrichedTraces net6.0

  • 🟩 execution_time [-21.466ms; -20.037ms] or [-12.554%; -11.718%]
  • 🟩 throughput [+186.620op/s; +201.298op/s] or [+13.296%; +14.342%]

scenario:Benchmarks.Trace.CharSliceBenchmark.OriginalCharSlice net6.0

  • 🟥 execution_time [+112.987µs; +119.066µs] or [+5.910%; +6.228%]
  • 🟥 throughput [-30.727op/s; -29.138op/s] or [-5.874%; -5.570%]

scenario:Benchmarks.Trace.Iast.StringAspectsBenchmark.StringConcatAspectBenchmark netcoreapp3.1

  • 🟥 throughput [-557.278op/s; -380.796op/s] or [-24.742%; -16.907%]

scenario:Benchmarks.Trace.Log4netBenchmark.EnrichedLog netcoreapp3.1

  • 🟩 execution_time [-34.529ms; -31.182ms] or [-17.182%; -15.517%]

scenario:Benchmarks.Trace.RedisBenchmark.SendReceive net6.0

  • 🟥 execution_time [+15.652ms; +20.300ms] or [+8.134%; +10.550%]

scenario:Benchmarks.Trace.SpanBenchmark.StartFinishSpan net6.0

  • 🟩 throughput [+86919.289op/s; +114757.619op/s] or [+7.093%; +9.365%]

scenario:Benchmarks.Trace.TraceAnnotationsBenchmark.RunOnMethodBegin net6.0

  • 🟥 execution_time [+19.302ms; +22.813ms] or [+9.858%; +11.652%]

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

area:builds project files, build scripts, pipelines, versioning, releases, packages

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants