Skip to content

Conversation

@tarun-google
Copy link
Contributor

@tarun-google tarun-google commented Sep 12, 2025

Migrating Bigquery java transforms to managedIO. There should not be any change for Read and Streaming write. But for Batch write we are migrating to FILE_LOADS method, which stages files in temp path before writing. This is not a breaking change for the existing users.

Batch write Test:
Tested with Fix #36209 on sandbox

http://go/pantheon-hourly/dataflow/jobs/us-central1/2025-09-23_13_57_20-6164423741318359762?project=apache-beam-testing&mods=dataflow_dev&dfSandbox=tannapareddy

Read Test:

https://dataflow-console.corp.google.com/?justification=#/jobmanagement/2025-09-23_13_30_58-6526397567899460230

Streaming write Test:
https://dataflow-console.corp.google.com/?justification=#/jobmanagement/2025-09-23_10_46_32-18235306192448492682

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests
Go tests

See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.

@tarun-google
Copy link
Contributor Author

Run Python_Examples PreCommit 3.11

@tarun-google tarun-google marked this pull request as ready for review September 12, 2025 20:32
@github-actions
Copy link
Contributor

Assigning reviewers:

R: @liferoad for label python.

Note: If you would like to opt out of this review, comment assign to next reviewer.

Available commands:

  • stop reviewer notifications - opt out of the automated review tooling
  • remind me after tests pass - tag the comment author after tests pass
  • waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

The PR bot will only process comments in the main thread (not review comments).

@chamikaramj
Copy link
Contributor

Did your end-to-end test runs confirm that the managed transforms upgrade is occurring ?

You should notice "Managed Transform(s)" logs mentioned here: https://cloud.google.com/dataflow/docs/guides/managed-io

@github-actions
Copy link
Contributor

Reminder, please take a look at this pr: @liferoad

@tarun-google
Copy link
Contributor Author

Blocking this on #33074 issue and fix #36209

@kennknowles
Copy link
Member

waiting on author

@kennknowles
Copy link
Member

noting that the two blockers are closed

@tarun-google
Copy link
Contributor Author

Re-tested with fix and updated main description with results

@chamikaramj
Copy link
Contributor

LGTM. Thanks.

@tarun-google
Copy link
Contributor Author

Run Python_Examples PreCommit 3.9

@chamikaramj chamikaramj merged commit 3a419b6 into apache:master Sep 24, 2025
102 of 103 checks passed
@chamikaramj chamikaramj changed the title Managed Bigquery Java Read&Write Transforms Use Managed BigQuery Java Read&Write Transforms for Beam YAML Oct 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants