ndarray development discussion and dashboard #293

bluss · 2017-03-31T23:16:52Z

Issues with label: breaking-change
- These are anticipated breaking changes in soon to come versions
Meta Issues:
- Meta Issue: Support for parallelized/blocked algorithms #89 Support for parallelized/blocked algorithms
- Create an Array interop crate anowell/are-we-learning-yet#14 Array interop crate

Luthaf · 2017-04-01T13:08:28Z

Is it possible to reconsider #152 (range dimension and negative indexing)?

As indexing and slicing uses different syntax, this should not be a breaking change, and it would really be useful for some kind of code. I think of FFT-like algorithms, where the problem space is naturally slitted over a symmetric -k .. k range.

If this is not acceptable to be in ndarray I think I'll try to implement it on top of standard arrays, but this means that we will be paying an additional overhead. I started a PoC for range-based dimensions in mudi, using a Vec<T> as storage.

bluss · 2017-04-01T13:40:04Z

We can consider the change when we know how to implement it. Do you have any links that detail how such a thing is implemented efficiently?

I don't understand why your implementation would have different performance implications than ndarray's?

Luthaf · 2017-04-01T14:42:26Z

I know two implementation of this: the Fortran standard and Boost.MultiArray.

gfortran is implementing this directly in the compiler front-end, but I could not find high level documentation about this. I know a gfortran developer, I can ask him about this.
Boost allow this by using extend_range type, which allow to use generic ranges as index. I did not find implementation description, but the initial boost review is here. Other than that, the boost implementation can be nice to read, but looks very convoluted.

The performances implication might come from the needed translation from (n..m) to (0..n+m) for all the indexes before passing them to ndarray.

Another solution would be to do the index linearisation separately and call array::as_slice to access the underlying data, but then there would be call to is_standard_layout every time. Which also means that this could not be used for non-standard layout array (which I don't really understand).

Another solution again would be to store a pointer + dimensions and build a ndarray from this when needed to access the functionalities with a as_ndarray function. Which would correspond to re-implementing a lot of code for difference storage types.

bluss · 2017-04-01T15:13:00Z

It's intriguing, thanks for the links. Need to investigate how compatible it is with custom stride arrays. (To give a simple motivation for strided arrays: We want to be able to cut an array into array views that are rectangular pieces.)

(The overhead of is_standard_layout should go away at some point, arrays should carry their layout information as an additional field. For a low-dimensional array, it's not much of an overhead, it's just comparing a pair.)

bluss · 2017-04-07T11:09:09Z

Ok, there will be a ndarray 0.9 shortly after 0.8. I don't expect any actual breakage or difficulty with the upgrades, and the library is not really used for “interchange” much, so I think it's unproblematic. We're evidently still in exploration and development.

jonathanstrong · 2017-05-20T03:40:12Z

I'm using ndarray heavily for a project right now for the first time and the biggest difficulty I'm encountering is how to write functions that can handle arrays of different shapes/forms. For instance, is there a way to take an Array or an ArrayView? I'm assuming there is - but haven't been able to figure it out as of yet. Perhaps you could provide additional guidance on how to use arrays in code outside the library. Thanks!

SuperFluffy · 2017-05-20T06:24:34Z

@jonathanstrong If you want to have a function that takes any sort of Array, you have to work with ArrayBase. To know what that looks like, have a look at the definition of zip_mut_with. If it was a free-standing function outside of an impl block, it would look like this:

fn zip_mut_with<A, B, S1, S2, E, F>(&mut ArrayBase<S1, E>, rhs: &ArrayBase<S2, E>, f: F) where
    S1: DataMut<Elem = A>,
    S2: Data<Elem = B>,
    E: Dimension,
    F: FnMut(&mut A, &B),

Let's say you wanted to have a function that took a matrix and a vector, then you would use Ix1 and Ix2 instead of the two E above.

As you mentioned arrays of different shapes/forms: unfortunately, we don't have compile-time integer generics yet. So if you want to ensure that, say, a function takes two arrays of exactly the same number of elements/shape, you need to assert that at runtime.

bluss · 2017-05-20T13:31:13Z

hi @jonathanstrong, I think accepting arrays of varying ndim is the least nice part of the library right now. Is that part of what you are doing?

jonathanstrong · 2017-05-21T00:11:16Z

thanks for the example @SuperFluffy.

@bluss that's one pain point. The one that prompted me to write was when I looped through the rows of a matrix with iter_axis and couldn't pass the "vector" view to a function that accepted an Array1. Another was an Ix1 can dot an Ix2 but not vice versa. I also had a hard time figuring out Zip::fold_while without any examples in the docs (did pay attention to the FoldWhile enum initially). These aren't criticisms - just sharing as I know it's helpful to know how someone who isn't familiar with it experiences using it.

However, coming from a numpy/theano background this is definitely the rust math lib I feel most comfortable/productive working in. Thanks for all your hard work on it.

SuperFluffy · 2017-05-21T08:57:03Z

@jonathanstrong

The one that prompted me to write was when I looped through the rows of a matrix with iter_axis and couldn't pass the "vector" view to a function that accepted an Array1.

Did my example with the ArrayBase to have a function that can take ArrayViews help?

I recently realized why @bluss chose to introduce ArrayViews rather than working with shared Arrays: it enforces invariance of the data structures! Let's say you pass a mut_a: &mut Array to some function, then you could simply replace the mut_a by a new Array with, e.g., a different shape, causing an error somewhere down the line. However, if you pass an ArrayViewMut, then all you can do is manipulate the underlying &mut [T], keeping the original shape intact.

I suppose once compile-time integer generics land, you might be able to pass around &mut Arrays, with the type system ensuring immutability of the shape.

@bluss

Another was an Ix1 can dot an Ix2 but not vice versa.

Maybe it makes sense to do it like numpy, and implicitly assume that a Ix1-array is a row (column) vector depending on whether the M is dotted on the left (right) and implement a fn dot(ArrayBase<S, Ix1>, ArrayBase<S, Ix2>) -> Array<S, Ix1>) (dotted on the left, which we don't have) along with the fn dot(ArrayBase<S, Ix2>, ArrayBase<S, Ix1>) -> Array<S, Ix1> (dotted on the right, which we have)?

bluss · 2017-05-21T20:04:51Z

Right, it's convenient to write functions in terms of concrete types (like Array1<T>) and I do that too, and a problem that one can't then pass just a view. The function could then be written in terms of ArrayView1 instead then.

@SuperFluffy

Array views have their own shape, which is mutable, which I think is cool. And that one can have an array view of a different shape or dimension than the data it is a view of.
Yes that makes sense

bluss · 2017-05-21T20:31:47Z

I guess it's important for ndarray to explain that Array, ArrayView, ArrayViewMut are meant to mirror the ownership and borrowing semantics of Vec<T>, &[T], &mut [T] with only minor differences that come from the fact that [T] is a dynamically sized type.

bluss · 2021-03-29T21:17:24Z

This issue is superseded by discussion board (github) and matrix.

bluss mentioned this issue Apr 1, 2017

Update dependencies lumol-org/lumol#130

Merged

bluss closed this as completed Mar 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ndarray development discussion and dashboard #293

ndarray development discussion and dashboard #293

bluss commented Mar 31, 2017 •

edited

Loading

Luthaf commented Apr 1, 2017

bluss commented Apr 1, 2017

Luthaf commented Apr 1, 2017

bluss commented Apr 1, 2017 •

edited

Loading

bluss commented Apr 7, 2017

jonathanstrong commented May 20, 2017

SuperFluffy commented May 20, 2017

bluss commented May 20, 2017

jonathanstrong commented May 21, 2017

SuperFluffy commented May 21, 2017 •

edited

Loading

bluss commented May 21, 2017

bluss commented May 21, 2017 •

edited

Loading

bluss commented Mar 29, 2021

ndarray development discussion and dashboard #293

ndarray development discussion and dashboard #293

Comments

bluss commented Mar 31, 2017 • edited Loading

Luthaf commented Apr 1, 2017

bluss commented Apr 1, 2017

Luthaf commented Apr 1, 2017

bluss commented Apr 1, 2017 • edited Loading

bluss commented Apr 7, 2017

jonathanstrong commented May 20, 2017

SuperFluffy commented May 20, 2017

bluss commented May 20, 2017

jonathanstrong commented May 21, 2017

SuperFluffy commented May 21, 2017 • edited Loading

bluss commented May 21, 2017

bluss commented May 21, 2017 • edited Loading

bluss commented Mar 29, 2021

bluss commented Mar 31, 2017 •

edited

Loading

bluss commented Apr 1, 2017 •

edited

Loading

SuperFluffy commented May 21, 2017 •

edited

Loading

bluss commented May 21, 2017 •

edited

Loading