Scan Delete Support Part 4: Delete File Loading; Skeleton for Processing #982

sdd · 2025-02-21T09:01:51Z

Extends the DeleteFileManager introduced in #950 To include loading of delete files, storage and retrieval of parsed delete files from shared state, and the outline for how parsing will connect up to this new work.

Issue: #630

jonathanc-n

This is nice, will look at the parsed records next.

crates/iceberg/src/arrow/delete_file_manager.rs

sdd · 2025-04-03T07:09:02Z

@liurenjie1024, @Xuanwo, @Fokko - this is ready for re-review, if you could take a look that would be great!

liurenjie1024

Thanks @sdd for this pr. There are some missing points in current design. Also I would suggest not putting too much in DeleteFilterManager. I suppose DeleterFilterManager acting more like a delete loader, which manages the io and caching of record batch. The actual filtering part, could delegate to DeleteFilter, WDYT? I think a good reference implementation is java's DeleteFilter, see https://github.com/apache/iceberg/blob/af8e3f5a40f4f36bbe1d868146749e2341471586/data/src/main/java/org/apache/iceberg/data/DeleteFilter.java#L50

crates/iceberg/src/delete_vector.rs

crates/iceberg/src/arrow/delete_file_manager.rs

sdd · 2025-04-14T06:38:39Z

Thanks for the review @liurenjie1024 - much appreciated. Will come back with a revised design.

liurenjie1024 · 2025-05-16T07:34:33Z

Let's wait for a moment to merge it after 0.5.0 release

crates/iceberg/src/delete_file_index.rs

sdd · 2025-05-23T06:18:14Z

Hi @liurenjie1024 / @Xuanwo / @xxchan.

This is now ready again for review after a refactor taking into account @xxchan's great feedback. I'll be on holiday for a week after today so it would be great if you guys could take a look. Thanks!

…DeleteVector::intersect_assign pub(crate)

…our of it being always on

liurenjie1024

Thanks @sdd for this pr, I think it's fine to move forward for now.

sdd force-pushed the feat/delete-fila-manager-loading branch 5 times, most recently from edb1d27 to 8e90bdd Compare February 23, 2025 14:55

This was referenced Feb 23, 2025

Delete Files in Table Scans #630

Open

Scan Delete Support Part 5: Positional Delete Parsing #1011

Merged

sdd marked this pull request as ready for review February 26, 2025 09:20

sdd mentioned this pull request Mar 1, 2025

Scan Delete Support Part 6: Equality Delete Parsing #1017

Open

sdd force-pushed the feat/delete-fila-manager-loading branch 4 times, most recently from ec8e7c1 to 06f0df5 Compare March 5, 2025 19:53

jonathanc-n reviewed Mar 14, 2025

View reviewed changes

crates/iceberg/src/arrow/delete_file_manager.rs Outdated Show resolved Hide resolved

crates/iceberg/src/arrow/delete_file_manager.rs Outdated Show resolved Hide resolved

crates/iceberg/src/arrow/delete_file_manager.rs Outdated Show resolved Hide resolved

sdd force-pushed the feat/delete-fila-manager-loading branch 6 times, most recently from 5530bc3 to e997fc6 Compare March 31, 2025 17:27

sdd force-pushed the feat/delete-fila-manager-loading branch from e997fc6 to 056e73f Compare April 3, 2025 07:28

liurenjie1024 reviewed Apr 9, 2025

View reviewed changes

sdd force-pushed the feat/delete-fila-manager-loading branch 2 times, most recently from bd33aa5 to 39a26ab Compare April 17, 2025 06:39

sdd mentioned this pull request Apr 23, 2025

Add equality_ids to FileScanTaskDeleteFile #1235

Merged

sdd force-pushed the feat/delete-fila-manager-loading branch 3 times, most recently from 5739a46 to 52cf8b9 Compare April 23, 2025 21:07

sdd dismissed liurenjie1024’s stale review via f9bff57 May 17, 2025 20:21

sdd force-pushed the feat/delete-fila-manager-loading branch 3 times, most recently from 078bba7 to fc696ef Compare May 17, 2025 20:31

xxchan mentioned this pull request May 19, 2025

fix: delete file lost wake #1323

Closed

xxchan reviewed May 19, 2025

View reviewed changes

crates/iceberg/src/delete_file_index.rs Outdated Show resolved Hide resolved

sdd force-pushed the feat/delete-fila-manager-loading branch 6 times, most recently from 20b44ab to acd7ab8 Compare May 22, 2025 05:39

sdd added 12 commits May 23, 2025 07:18

feat: delete file manager loading

3776aeb

feat: changes suggested in review

0078be5

feat: return Err for unimplemented delete vec parse methods and make …

e2903e1

…DeleteVector::intersect_assign pub(crate)

feat: schema evolution of equality delete file record batches

65dd638

refactor: split DeleteFileManager into DeleteFileLoader and DeleteFilter

79415ac

fix: add waker for DeleteFileIndex

2207257

refactor: remove DeleteFileFilter from CachingDeleteFileLoader

2cf7692

refactor: extract BasicDeleteFileLoader from CachingDeleteFileLoader

839232f

feat: remove flag to selectively enable delete file processing in vav…

d6a4a4d

…our of it being always on

changes required after rebase on main

462b2f7

fix: handle WouldBlock correctly in DeleteFileIndex

ef28809

refactor: use Notify and oneshot channel rather than custom Future

b147098

sdd force-pushed the feat/delete-fila-manager-loading branch from acd7ab8 to b147098 Compare May 23, 2025 06:18

liurenjie1024 approved these changes Jun 9, 2025

View reviewed changes

liurenjie1024 merged commit 7d794fa into apache:main Jun 9, 2025
18 checks passed

xxchan mentioned this pull request Jun 25, 2025

Tracking: sync changes from upstream and drop commits in our fork risingwavelabs/iceberg-rust#31

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Scan Delete Support Part 4: Delete File Loading; Skeleton for Processing #982

Scan Delete Support Part 4: Delete File Loading; Skeleton for Processing #982

Uh oh!

sdd commented Feb 21, 2025 •

edited

Loading

Uh oh!

jonathanc-n left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sdd commented Apr 3, 2025

Uh oh!

liurenjie1024 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sdd commented Apr 14, 2025

Uh oh!

liurenjie1024 commented May 16, 2025

Uh oh!

Uh oh!

sdd commented May 23, 2025

Uh oh!

liurenjie1024 left a comment

Uh oh!

Uh oh!

Uh oh!

Scan Delete Support Part 4: Delete File Loading; Skeleton for Processing #982

Scan Delete Support Part 4: Delete File Loading; Skeleton for Processing #982

Uh oh!

Conversation

sdd commented Feb 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jonathanc-n left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sdd commented Apr 3, 2025

Uh oh!

liurenjie1024 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sdd commented Apr 14, 2025

Uh oh!

liurenjie1024 commented May 16, 2025

Uh oh!

Uh oh!

sdd commented May 23, 2025

Uh oh!

liurenjie1024 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

sdd commented Feb 21, 2025 •

edited

Loading