-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add primitive arrays column wrappers #30
Comments
Our current idea is to use Arrow as a backend for primitive types. |
It is a great idea, but it will be worth it only in terms of interop with other platforms. For JVM-only, Arrow will give nothing new. |
Arrow should give significant performance increase for JVM due to nullable values types support. Current implementation generates quite a lot of boxing/unboxing. It can be solved without Arrow, but I expect Arrow implementation to be faster. We will do performance benchmarks before implementation. And we need to support Arrow I/O anyway. |
I was experimenting with |
I recommend considering Immutable Arrays as they have some surprising memory and performance benefits. Since they are immutable and their operations produce Immutable Arrays to maintain safety, this enables many optimizations that make them even faster and more efficient than regular primitive arrays. |
Primitive array columns are required for optimized big-data applications. It is also possible to add numerical DataFrame integration with MultiK or KMath.
The text was updated successfully, but these errors were encountered: