[general] including targets as dependencies without loading them into memory (for use with DuckDB/parquet) #1505
Closed
adamaltmejd
started this conversation in
General
Replies: 2 comments
-
You could try |
Beta Was this translation helpful? Give feedback.
0 replies
-
Jesus was it that easy, here I was trying to write my own custom targets and stuff and all I needed was to read the docs more carefully. Thanks! |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Help
Description
What is the best way to include targets as dependencies without loading them into memory?
More specifically I have a pipeline that deals with some rather large data sets and I'm running out of memory. I was thinking of using format="parquet" and using DuckDB to process some parts of the pipeline. I would then include a target with a dbExecute() call that loads the parquet files with read_parquet, outputs a parquet file and returns its path (to be stored in targets with format="file"). But how do I ensure that the code runs when the upstream dependencies change without actually loading the targets?
Beta Was this translation helpful? Give feedback.
All reactions