Skip to content

virtual reference error: "connection closed before message completed" #1431

@nbren12

Description

@nbren12

What happened?

I am using icechunk to read data from an s3 compatible object store in a distributed context with ~100 workers spread across a few compute nodes. I will intermittently get errors like this

[rank12]: icechunk.IcechunkError:   x error fetching virtual reference
[rank12]:   |
[rank12]:   | context:
[rank12]:   |    0: icechunk::store::get
[rank12]:   |            with key="arr/c/3/3476/58/0/0" byte_range=From(0)
[rank12]:   |              at icechunk/src/store.rs:198
[rank12]:   |
[rank12]:   |-> error fetching virtual reference
[rank12]:   |-> dispatch failure
[rank12]:   |-> other
[rank12]:   `-> connection closed before message completed

Is this case handled by icechunks retry mechanism?

I'm not sure what the default settings of icechunk retry are since the docs just list None.

What did you expect to happen?

Error message saying failed after "x number of retries".

Anything else we need to know?

No response

Environment

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions