Add the ability to backfill a DAG based on past Asset Events #59886
Unanswered
hnomichith
asked this question in
Ideas
Replies: 1 comment
-
|
This seems a very specific need and I think you should just use a Python script - and use Python Client to trigger such assets in the way you want via the API. The thing about backfills is that their whole "management" is based on data intervals - and it's easy to specify "between this and that interval" for example. I do not see an easy UX where you would like to "declaratively" define what assets to regenerate - it's way easier and way more flexible to write a simple Python script to generate the set of API calls that you want to "backfill". |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello!
Airflow version
Airflow 3.1.4
Context
I'm currently working with DAG scheduling on Asset Events. I saw that if you create a DAG which triggers on an Asset Event, there's no usable way to backfill it based on past Asset Events. For example, if those Asset Events were created when the DAG was not created yet, or if the DAG was paused.
I saw two work-around, both with their caveats:
I identified one more usable solution creating a script to automate the second solution. It makes it scalable, but still have the caveat of running all downstream DAGs.
Questions
I'm opening the discussion to ask some questions:
Potential solution
Instinctively, the solution I see to address my need is:
/api/v2/dags/:dag_id/assets/queuedEventsendpoint/api/v2/assets/eventsto add anamefilter (and everything that could be used to schedule a DAG based on an Asset)What do you think?
Beta Was this translation helpful? Give feedback.
All reactions