Skip to content

Feature request: Rotation based on maximum file size on hdfs. #365

Open
@TomLous

Description

@TomLous

I'd like the option to specify the maximum file size for the hdfs connector to write before rotating.
I understand the only way to do this is to approximate it by setting flushSize (based on # records) or time interval.
The reason is that it's very useful to keep files at the approximate size of the hdfs block size, but no more. This gives us a more fine grained control and the assurance that we won't end up with to large files, or worse, many small files.
I'd like to add this feature to the codebase myself if possible. Are there any restrictions / guidelines I have to take into account if I wan the possibility to merge these changes back into the codebase?
Or was this feature previously explored and abandoned for some reason?

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions