Render ephemeral type DBT model & no freshness DBT source with test as EmptyOperator by okayhooni · Pull Request #1625 · astronomer/astronomer-cosmos

okayhooni · 2025-03-18T16:52:47Z

Description

dbt models with ephemeral materialization type that serve only as CTEs should not be rendered as DbtRunOperator tasks, as they unnecessarily occupy Airflow worker slots, even for a short period.
Updated the logic to render these ephemeral models as EmptyOperator tasks instead, ensuring they are processed quickly by the Airflow scheduler without being assigned to an Airflow worker.
Similarly, DBT source nodes without a freshness check but with downstream tests are rendered as EmptyOperator to avoid unnecessary worker slot resource consumption.

Breaking Change?

No

Checklist

I have made corresponding changes to the documentation (if required)
I have added tests that prove my fix is effective or that my feature works

netlify · 2025-03-18T16:53:34Z

✅ Deploy Preview for sunny-pastelito-5ecb04 canceled.

Name	Link
🔨 Latest commit	`32b15b9`
🔍 Latest deploy log	https://app.netlify.com/sites/sunny-pastelito-5ecb04/deploys/67dab105e5ffc20008e00547

codecov · 2025-03-18T19:00:09Z

Codecov Report

❌ Patch coverage is 92.30769% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 97.41%. Comparing base (43b5400) to head (32b15b9).
⚠️ Report is 500 commits behind head on main.

Files with missing lines	Patch %	Lines
cosmos/airflow/graph.py	88.88%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1625      +/-   ##
==========================================
- Coverage   97.43%   97.41%   -0.02%     
==========================================
  Files          80       80              
  Lines        4950     4957       +7     
==========================================
+ Hits         4823     4829       +6     
- Misses        127      128       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

github-actions · 2025-04-19T11:03:49Z

This PR is stale because it has been open for 30 days with no activity.

github-actions · 2025-05-30T11:04:28Z

This PR is stale because it has been open for 30 days with no activity.

pankajkoti · 2025-07-11T10:02:15Z

hi @okayhooni, thanks a lot for this PR. Really sorry, we missed reviewing this earlier. Planning to review it now, would it be possible to resolve the conflicts on this PR, please?

jroachgolf84 · 2025-07-14T20:00:08Z

@okayhooni, were you able to add unit/integration tests for these changes?

#2279) Avoid consumer tasks that hang indefinitely when using `ExecutionMode.WATCHER` when the associated dbt models are either ephemeral or consist of empty SQL models that are not run by dbt. ## Context There are circumstances when there is a discrepant number of nodes in the output when we run `dbt ls` and `dbt build`, using the same selectors. In the following example (`tests/sample/dbt_project_with_empty_model`), we can observe that `dbt ls` returned two models, while the `dbt build` returned a single one: ``` $ dbt ls 10:48:32 Running with dbt=1.11.2 10:48:32 Registered adapter: postgres=1.10.0 10:48:32 Unable to do partial parsing because saved manifest not found. Starting full parse. 10:48:32 Found 2 models, 464 macros micro_dbt_project.add_row micro_dbt_project.empty_model $ dbt build 10:50:21 Running with dbt=1.11.2 10:50:21 Registered adapter: postgres=1.10.0 10:50:21 Found 2 models, 464 macros 10:50:21 10:50:21 Concurrency: 4 threads (target='dev') 10:50:21 10:50:21 1 of 1 START sql view model public.add_row ..................................... [RUN] 10:50:21 1 of 1 OK created sql view model public.add_row ................................ [CREATE VIEW in 0.06s] 10:50:21 10:50:21 Finished running 1 view model in 0 hours 0 minutes and 0.20 seconds (0.20s). 10:50:21 10:50:21 Completed successfully 10:50:21 10:50:21 Done. PASS=1 WARN=0 ERROR=0 SKIP=0 NO-OP=0 TOTAL=1 ``` So far, we observed this happening in two scenarios: 1. Ephemeral nodes (#2266) 2. If the dbt model is not executable (e.g. it is an empty SQL file), both the `dbt build` and the `dbt run` will not display it in their info logs. Until Cosmos 1.12.1, Cosmos assumed these two commands would return the same number of nodes, and we implemented the `LoadMode.MANIFEST` assuming the same. In the case of `ExecutionMode.LOCAL`, this was not a big issue, because dbt does not run when we select the particular model it's excluding: ``` $ dbt build --select empty_model 10:53:03 Running with dbt=1.11.2 10:53:03 Registered adapter: postgres=1.10.0 10:53:03 Found 2 models, 464 macros 10:53:03 Nothing to do. Try checking your model configs and model specification args ``` The downside in the case of `ExecutionMode.LOCAL` is that we waste Airflow resources by potentially parsing a dbt project that wouldn't need to be parsed in those particular tasks. The PR #1625 aims to address this. However, in the case of `ExecutionMode.WATCHER`, this became a big problem, as the behaviour caused consumer nodes representing ephemeral nodes or empty models to hang indefinitely after the producer task completed successfully. The producer task was not aware of them and would not populate XCom, whereas the consumer tasks would keep checking for updates. Closes: #2266 Closes: astronomer/oss-integrations-private#315 Closes: https://astronomer.zendesk.com/agent/tickets/87180 ## About the solution Ideally, probably, we would know upfront which nodes `dbt build` decides to execute, and we would not render them as Airflow tasks. However, I do not believe this is a simple problem, since there may be other circumstances when `dbt build` skips nodes from being executed - and any custom logic we implement in Cosmos will be affected by changes dbt Core/Fusion implements upstream. Therefore, it feels - for now - the safest solution is: - Continue adding those nodes to Cosmos - Mark them as successful, logging a specific message, if they were not actually run by the dbt command. This is identified by checking the `run_results.json` file.

make ephemeral type node as empty operator

1b6c828

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Mar 18, 2025

okayhooni temporarily deployed to external March 18, 2025 16:52 — with GitHub Actions Inactive

dosubot bot added the area:rendering Related to rendering, like Jinja, Airflow tasks, etc label Mar 18, 2025

make no freshness source with test be EmptyOperator task

32b15b9

okayhooni temporarily deployed to external March 19, 2025 11:56 — with GitHub Actions Inactive

okayhooni changed the title ~~Render ephemeral type DBT model as EmptyOperator~~ Render ephemeral type DBT model & no freshness DBT source with test as EmptyOperator Mar 19, 2025

github-actions bot added the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Apr 19, 2025

tatiana assigned pankajkoti Apr 22, 2025

github-actions bot removed the stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed label Apr 23, 2025

tatiana added this to the Cosmos 1.11.0 milestone Apr 30, 2025

github-actions bot added stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed and removed stale Issue has not had recent activity or appears to be solved. Stale issues will be automatically closed labels May 30, 2025

tatiana modified the milestones: Cosmos 1.11.0, Cosmos 1.12.0 Oct 21, 2025

tatiana modified the milestones: Cosmos 1.12.0, Cosmos 1.13.0 Dec 18, 2025

faridnsh mentioned this pull request Jan 14, 2026

[Bug] Watcher mode sensor tasks timeout for ephemeral models without --debug #2266

Closed

1 task

tatiana mentioned this pull request Jan 22, 2026

Fix running empty models or ephemeral nodes in ExecutionMode.WATCHER #2279

Merged

pankajkoti modified the milestones: Cosmos 1.13.0, Cosmos 1.14.0 Jan 28, 2026

pankajkoti modified the milestones: Cosmos 1.14.0, Cosmos 1.15.0 Mar 19, 2026

pankajkoti removed their assignment Apr 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Render ephemeral type DBT model & no freshness DBT source with test as EmptyOperator#1625

Render ephemeral type DBT model & no freshness DBT source with test as EmptyOperator#1625
okayhooni wants to merge 2 commits intoastronomer:mainfrom
okayhooni:feat/render_ephemeral_as_empty_operator

okayhooni commented Mar 18, 2025 •

edited

Loading

Uh oh!

netlify bot commented Mar 18, 2025 •

edited

Loading

Uh oh!

codecov bot commented Mar 18, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

pankajkoti commented Jul 11, 2025

Uh oh!

jroachgolf84 commented Jul 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

okayhooni commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Breaking Change?

Checklist

Uh oh!

netlify bot commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for sunny-pastelito-5ecb04 canceled.

Uh oh!

codecov bot commented Mar 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

github-actions bot commented Apr 19, 2025

Uh oh!

github-actions bot commented May 30, 2025

Uh oh!

pankajkoti commented Jul 11, 2025

Uh oh!

jroachgolf84 commented Jul 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

okayhooni commented Mar 18, 2025 •

edited

Loading

netlify bot commented Mar 18, 2025 •

edited

Loading

codecov bot commented Mar 18, 2025 •

edited

Loading