RFC: Comprehensive Plan for Restructuring Dimension Types

# Summary

This RFC proposes a long-term plan for restructuring the library's core types and approach for handling shapes, strides, and dimensionalities. A successful implementation of new dimension types should accomplish the following goals:

1. Make the library friendlier to newcomers and more ergonomic for power users by reducing confusion around dimension types
2. Store strides as `isize`, not `usize`
3. Maintain the ability to capture the dimensionality of an array at the type level
4. Allow for capturing axis lengths at the type level, and therefore allow compile-time constant optimizations
5. Support the move towards a "trait-based" library
6. Reduce the number of "sealed" traits, thereby making the library more extensible
7. Incorporate, as best as possible, const generics for dimensionality and fixed-length axes
8. Provide a strong basis for other feature enhancements, such as optimized iteration orders and named axes

This RFC is _not_ meant to propose all the changes at once. Rather, this RFC lays out the overall vision and how we will build a smooth transition path. Each step in that path will require its own set of issues and pull requests.

An experimental implementation of this RFC (but not its integration into `ndarray`) can be found [here](https://github.com/akern40/ndarray-design/blob/dimensions/src/core/layout.rs) (for the `Layout` trait and module definition) and [here](https://github.com/akern40/ndarray-design/tree/dimensions/src/core/layout) for the rest of the traits and types. It includes types that implement the traits, additional design notes and rationale, and examples of advanced usage (such as a completely-constant two-by-two matrix shape).

cc @bluss @jturner314 @NewBornRustacean @lucascolley 

# Motivation

## Clarity

The current handling of shape and strides in the library is a point of confusion for users (#489, #367, [this comment](https://github.com/rust-ndarray/ndarray/issues/1272#issuecomment-2240875720)) and a point of complexity for maintainers (#283) since at least 2017. The dimension traits and types in `ndarray` are, at least for me, the most difficult part of the library to understand. I managed to write an entire RFC to revamp the array types themselves without truly understanding the current setup for dimension types. This is a testament to both the complexity of the implementation details and to the power of the current solution.

## Features

In addition to complexity, the current approach limits some very promising and important optimizations and extensions.
As @grothesque points out in [this comment](https://github.com/rust-ndarray/ndarray/issues/1272#issuecomment-2241818760) on #1272, fixed-length axes are a critical step for many powerful optimizations. This is an issue that actually led me working on `ndarray` in the first place: I wanted to write code that dealt with `Nx3` and `Nx3x3` state vectors and covariance matrices, I wanted that code to be as fast as possible, and I wanted it to be ergonomic to write.

Non-strided layouts (blocked, sparse, diagonal, Morton, etc) are also of limited interest; see #321, #1500, and [this comment](https://github.com/rust-ndarray/ndarray/issues/571#issuecomment-513527962).

# Conceptual Explanation

I believe that the core driver of confusion around `Dimension` and its related types and traits is not complexity, but vocabulary. The word "dimension" describes a single size, extent, or measurement. The type `Dimension` describes the number of axes an array has, the "dimension" / length of each axis, the stride of each axis, and (sometimes) the index used to access the array; it's even used for helping with iteration. I believe this trait is simply overloaded, both conceptually and practically.

Users of the library also often deal with `raw_dim()`, a method which returns the shape as stored within the array. I believe this is emblematic of the vocabulary problem: "dim" here is ambiguous, and not descriptive of the methods actual functionality.

In #519, @jturner314 lays out a plan for altering the `Dimension` trait to resolve the shape/strides usize/isize problem. However, discussion at the time suggested that the solution did not go far enough, and that const generics (which have since landed in stable Rust, albeit in a limited form) would likely change the design.

So this RFC takes a clean slate approach, inspired partially by the work done for C++ `mdspan` and by the excellent work by @fre-hu in [`mdarray`](https://github.com/fre-hu/mdarray). However, this proposal is not a direct copy of either approach and was developed independently to support `ndarray` specifically.

## New Vocabulary

The basic conceit of this design is breaking up `Dimension` to give each concept its own trait:

-   `Dimensionality`, for the number of axes an array has
-   `Shape`, for the lengths of each axis
-   `Strides`, for the strides of each axis
-   `Layout`, for abstracting the memory order of an array
-   `Strided: Layout`, a layout specifically for strided arrays

The idea of an "index" - or multiple kinds of indices - likely deserves its own traits or types, but that is a question that can be resolved as the design matures, step-by-step.

# Technical Details

## `Dimensionality`

We start by breaking out the concept for "number of axes". I propose a trait, `Dimensionality`, that serves to unify both compile-time numbers of dimensions and runtime numbers of dimensions. Two alternatives deserve mentioning: const generics and [typenum](https://docs.rs/typenum/latest/typenum/index.html). We cannot use const generics universally because they are not yet powerful enough to express the dimensionality for operations like inserting an axis (as that would require `N + 1` to be evaluated in a const context). We cannot use typenum universally because it does not support the dynamic-dimensional "escape hatch" that we need for runtime-only array dimensionalities; plus, its errors are very verbose and may impose extra difficulty in the use of `ndarray`. This leaves us with a third option: implement our own type-level numeric system, very akin to typenum, with good const generic support and a dynamic-dimensional escape hatch after a fixed number (e.g., 7 as it is right now or 12, to match the maximum tuple size).

```rust
pub trait Dimensionality:
    Copy
    + Eq
    + Debug
    + Send
    + Sync
    + DMax<D0, Output = Self>
    // ... other max relationship
    + DAdd<Self>
    // ... other add relationships
{
    /// The dimensionality as a constant usize, if it's not dynamic.
    const N: Option<usize>;

    type Smaller: Dimensionality;

    type Larger: Dimensionality;

    // And more associated types and methods, as needed
}
```

A nice helper trait is `Dimensioned`, implemented on types that have a dimensionality (e.g., arrays, shapes, strides, vecs, etc). It provides an associated type, `Dimality` (or maybe just `NDim`?), that describes how many dimensions it has.
This is mostly useful for not always writing `type Dimality: Dimensionality` for all of the following traits.

## Shape

The next step is to break out the concept of "length of each axis". Here is our first opportunity for an exciting feature enhancement: fixed-length axes, captured at compile time. A basic `Shape` trait can lay the foundation; for example

```rust
pub trait Shape:
    Dimensioned + Index<usize, Output = usize> + Eq + Clone + Send + Sync + Debug
{
    /// Get the shape as a (possibly-borrowed) slice of `usize`.
    fn as_slice(&self) -> Cow<'_, [usize]>;

    /// Try to index the shape mutably, if `index` is a non-`const` axis.
    ///
    /// Types that implement `Shape` do not guarantee any sort of mutability; this method
    /// allows a non-panicking way to discover whether a given axis of a shape is mutable.
    fn try_index_mut(&mut self, index: usize) -> Result<&mut usize, ShapeStrideError<Self>>;

    /// Try to create an `ndim`-dimensional shape filled with `value`.
    ///
    /// This may fail if either the number of dimensions does not match the dimensionality
    /// of the shape, or if the shape has any `const` axis lengths.
    fn try_full(ndim: usize, value: usize) -> Result<Self, ShapeStrideError<Self>>;

    /// Convert this shape into a dynamic-dimensional shape.
    fn to_dyn(&self) -> DShape {
        self.as_slice().into()
    }

    /// Get the runtime dimensionality of the shape.
    ///
    /// Implementors of `Shape` must guarantee that this value will match [`Shape::Dimality`].
    /// Does that mean that this should be an unsafe trait?
    fn ndim(&self) -> usize {
        self.as_slice().len()
    }

    /// Get the number of elements that the array contains.
    fn size(&self) -> usize {
        self.as_slice().iter().product()
    }

    /// Get the number of elements that the array contains, checking for overflow.
    fn size_checked(&self) -> Option<usize> {
        self.as_slice()
            .iter()
            .try_fold(1_usize, |acc, &i| acc.checked_mul(i))
    }

    /// Try to turn this shape into a constant `N`-dimensional shape.
    fn try_to_nshape<const N: usize>(&self) -> Result<NShape<N>, ShapeStrideError<NShape<N>>> {
        if self.ndim() == N {
            let mut values = [0; N];
            values.split_at_mut(N).0.copy_from_slice(&self.as_slice());
            Ok(values.into())
        } else {
            Err(ShapeStrideError::BadDimality(PhantomData, self.ndim()))
        }
    }

    // Additional methods...
}
```

I've included some familiar methods (`size`, `ndim`, `size_checked`) but also some that require explaining:

-   `as_slice` returns `Cow` because, critically, `Shape` does not guarantee that there is an underlying slice to be borrowed, since some axes may have fixed lengths and not be stored at runtime
-   `try_index_mut`, rather than just `: IndexMut<...>`, because `Shape` does not guarantee the mutability of any of its axis lengths
-   `try_full`, rather than just `full`, because some `Shape`s may not allow creation with arbitrary axis lengths (because their sizes are fixed)

The natural extension is a `ShapeMut` trait that brings us back to our familiar world of completely mutable shapes.

A fun note here: with the `Dimensionality` design, we could even create a trait (`AxisMut<N>`) that captures at the type-level that the `N`th axis is mutable. I'm not sure this is particularly useful, but I think it's cool that it can be done.

### Error Handling

I also included a `ShapeStrideError` in the above `Shape` definition; see the reference implementation (linked at the start) for more information, but it's just an enum that implements `Display`.

## Stride

Of course, we'll also be needing a `Strides` trait; this is essentially an exact copy of `Shape`, except that the `Index` bound will return an `isize`, not a `usize`. There would also be an accompanying `StridesMut` for strides that are entirely mutable. Because stride manipulation is pretty central to libraries like `ndarray` or NumPy, I expect that the bare, immutable `Strides` will get a lot less usage than the immutable `Shape`.

I'm not sure whether `Strides` needs to support mixing `const`-valued strides with runtime strides; my initial prototypes don't bother. I can't think of many applications where fixing some subset of strides while allowing others to vary is helpful; but maybe we should! It avoids some complexity, though.

## Layout

This is where my design proposal really starts to depart from what `ndarray` does right now, and the second opportunity for new features. I'd like to propose a `Layout` trait that would serve as the main interface between an array's memory and its "layout".

Right now, `ndarray` exclusively supports dense, strided arrays, as the `shape` and `strides` are fields within the core types (as of the array reference PR, they sit within `LayoutRef`). However, moving to trait-based shapes and strides means that we'd have to go from `Array<A, D>` to `Array<A, D, Sh, St>` for `Sh: Shape` and `St: Strides`, and that wouldn't allow us to move into capabilities like block-organized matrices, sparse matrices, etc. Here's where the `Layout` trait comes in: like the C++ `mdspan` implementation, `Layout` is the gatekeeper between logical indices and memory access:

```rust
/// A trait capturing how an array is laid out in memory.
pub trait Layout: Dimensioned {
    /// The type of shape that the array uses.
    ///
    /// Must implement [`Shape`] and have the same dimensionality.
    type Shape: Shape<Dimality = Self::Dimality>;

    /// The index type that this layout uses; e.g., `[usize; N]`.
    ///
    /// Must have the same dimensionality.
    type Index: Dimensioned<Dimality = Self::Dimality>;

    /// Get the shape of the layout.
    ///
    /// If the implementing type does not carry a shape directly,
    /// one should be constructed and passed as [`Cow::Owned`].
    fn shape(&self) -> Cow<'_, Self::Shape>;

    /// Index into this layout with a multidimensional index.
    fn index(&self, idx: Self::Index) -> isize;

    // Shortcut methods, we could add more of these
    fn ndim(&self) -> usize {
        self.shape().ndim()
    }

    fn size(&self) -> usize {
        self.shape().size()
    }

    fn size_checked(&self) -> Option<usize> {
        self.shape().size_checked()
    }

    // Other methods, which will need to support iteration and other kinds of indexing...
}
```

A few things to note:

-   Like `Shape::as_slice`, `Layout::shape` returns `Cow` since it does not require the storage of a type that implements `Shape` (hyper-optimizations, like fixed-size matrices, may have their shapes and strides defined completely at compile time)
-   `Layout` can't implement `Index`, as it requires _computing_ the offset rather than just accessing it in memory; so we define `Layout::index` instead
-   I notionally include an `Index` associated type, but I'm not sure that's the way to go; or maybe it should have a default?

Finally, the `Layout` trait provides a unified interface and vocabulary for users and developers to talk and think about times when an array's shape and strides should be bundled together. The clearest example of this, in my opinion, is the creation of arrays that "look like" another array, in terms of memory order, e.g., a `zeros_like` function. I believe that doing something like `zeros_layout(arr.layout())` gives users a clear way to indicate the shape and strides of a desired array.

### Strided Layouts

If you're still reading this RFC, you may be so invested as to have noticed that `Layout` doesn't have anything about strides.
Instead, I propose we define an additional trait, `Strided: Layout`, as so:

```rust
pub trait Strided: Layout {
    type Strides: Strides<Dimality = Self::Dimality>;

    fn strides(&self) -> Cow<'_, Self::Strides>;

    // Probably other methods...
}
```

This leaves us a path forward for those non-dense and non-strided layouts in the future. We may want to consider restricting `ArrayBase` to `L: Strided` either temporarily or permanently as we make the transition to this new design; more on that later.

### New Generics

Rolling up into `Layout` means we can now go back to our core `ArrayBase` (and co.) types using `<A, L: Layout>`. This would be a huge breaking change, and we should consider if it's a) worth it and b) how we could make it as smooth as possible. However, it comes with a number of advantages:

-   The fewer generics the better, and keeping it at 2 would be nice
-   Trait bounds on associated types can be implicitly added, meaning that `fn function(arr: &ArrayRef<A, L>) where L: Strided<Dimality = NDim<3>> {}` specifies a 3-dimensional strided array
-   Type aliases can be easily constructed, such as `type Array3<A> = Array<A, NLayout<3>>` or `type Matrix3x3<A> = Array<A, MatLayout<3, 3>>`

# Drawbacks

As best I can tell, there are three obvious drawbacks to this approach:

1. Backwards Compatibility: I am proposing a framework that I believe is both more ergonomic and more powerful, but it will eventually come with major breaking changes.
2. Complexity: There are significantly more traits here, with more flexibility. If we're not careful and/or we don't communicate clearly, this could end up being or looking more complex than the current `Dimension` approach.
3. Performance: Ensuring that these are zero-cost abstractions will require careful benchmarking as we go; otherwise, we may end up erasing critical information or using pointer indirection where we weren't before. Similarly, concerns may come up around binary sizes due to monomorphization, LLVM optimizations, etc.

# Rationale and Alternatives

I think the clearest alternative to this entire plan is to stick with `Dimension` and alter it along the lines of what was suggested in #519. As I said before, while this might help the `usize`/`isize` problem of shape and strides, I don't think it solves the larger issue of vocabulary, nor does it provide a path for new features. I strongly believe that the pain of breaking changes would be worth the value in ergonomics and features that this new proposal provides.

# Transition Path

This is a giant proposal that includes major breaking changes. I wanted to get it all down to create a sort of "north star" that library developers can work together to achieve. To actually implement it, I think we'd have to do something like the following:

-   Prototype the core traits suggested here in a separate branch
-   Create compatibility / bridge traits that allow for smoother transitions between the current API and the new one
-   Introduce new APIs and deprecating old ones
-   Establish plans for versioning - when we want to introduce which breaking changes
-   Provide very clear documentation - see #1499 for how we might do that effectively

The transition path itself needs to be fleshed out. However, I think we can start introducing these new traits and types, and start adapting the library. As we go, we'll be able to experiment with the best ways to transition the codebase.

# Unresolved Questions

The largest unresolved question is clearly the transition path. However, the interplay of this design with indexing and iteration is still an unknown. I think more experimentation is needed to figure out how `Layout` can ergonomically support iteration, in particular.

# Summary

This is already a long RFC, so I'll be brief: I propose a design that I think is both more ergonomic and more powerful than our current `Dimension`-based approach and provides users with "industry"-standard vocabulary (i.e., like NumPy or PyTorch or others). Smooth transition will be a challenge, and breaking changes will be involved, but I think it will be worth it.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC: Comprehensive Plan for Restructuring Dimension Types #1506

Summary

Motivation

Clarity

Features

Conceptual Explanation

New Vocabulary

Technical Details

`Dimensionality`

Shape

Error Handling

Stride

Layout

Strided Layouts

New Generics

Drawbacks

Rationale and Alternatives

Transition Path

Unresolved Questions

Summary

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RFC: Comprehensive Plan for Restructuring Dimension Types #1506

Description

Summary

Motivation

Clarity

Features

Conceptual Explanation

New Vocabulary

Technical Details

Dimensionality

Shape

Error Handling

Stride

Layout

Strided Layouts

New Generics

Drawbacks

Rationale and Alternatives

Transition Path

Unresolved Questions

Summary

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

`Dimensionality`