General idea for nd
to nd
transformation
The idea with such nd
to nd
transformation is using just two things -
Permute axes : To get the order such that the flattened version corresponds to the flattened version of output. So, if you somehow end up using it twice, look again because you shouldn't.
Reshape : To split the axes or bring the final output to the desired shape. Splitting axes is needed mostly at the start, when the input is of lower-dim and we are needed to split into blocks. Again, you shouldn't need this more than twice.
Hence, generally we would have three steps :
[ Reshape ] ---> [ Permute axes ] ---> [ Reshape ]
Create more axes Bring axes Merge axes
into correct order
Back-tracking method
The safest way to solve, given the input and output is through, what one could call as the back-tracking method, i.e. split the axes of the input (when going from smaller nd
to bigger nd
) or split the axes of the output (when going from bigger nd
to smaller nd
). The idea with the splitting is to bring the number of dims of the smaller nd
one same as the bigger nd
one. Then, study the strides of the output and match it up against the input to get the required permute order. Finally, a reshape (default way or C order) might be needed at the end, if the final one is a smaller nd
one, to merge axes.
If both input and output are of same number of dims, then we would need to split both and break into blocks and study their strides against each other. In such cases, we should have the additional input parameter of block sizes, but that's probably off-topic.
Example
Let's use this specific case to demonstrate how to apply those strategies. In here, the input is 4D
, while output is 2D
. So, most probably, we won't need reshape to split. So, we need to start with permuting axes. Since, the final output is not 4D
, but a 2D
one, we would need a reshape at the end.
Now, the input here is :
In [270]: a
Out[270]:
array([[[[ 0, 0],
[ 0, 0]],
[[ 5, 10],
[15, 20]]],
[[[ 6, 12],
[18, 24]],
[[ 7, 14],
[21, 28]]]])
The expected output is :
In [271]: out
Out[271]:
array([[ 0, 5, 0, 10],
[ 6, 7, 12, 14],
[ 0, 15, 0, 20],
[18, 21, 24, 28]])
Also, this is a bigger nd
to smaller nd
transformation, so the back-tracking method would involve, splitting the output and studying its strides and matching up against the corresponding values in input :
axis = 3
--- -->
axis = 1
------>
axis=2| axis=0| [ 0, 5, 0, 10],
| [ 6, 7, 12, 14],
v
| [ 0, 15, 0, 20],
v
[18, 21, 24, 28]])
Hence, the permuted order needed is (2,0,3,1)
:
In [275]: a.transpose((2, 0, 3, 1))
Out[275]:
array([[[[ 0, 5],
[ 0, 10]],
[[ 6, 7],
[12, 14]]],
[[[ 0, 15],
[ 0, 20]],
[[18, 21],
[24, 28]]]])
Then, simply reshape to the expected shape :
In [276]: a.transpose((2, 0, 3, 1)).reshape(4,4)
Out[276]:
array([[ 0, 5, 0, 10],
[ 6, 7, 12, 14],
[ 0, 15, 0, 20],
[18, 21, 24, 28]])
More examples
I dug up my history and found few Q&As
based on nd
to nd
transformations. These could serve as other example cases, albeit with lesser explanation (mostly). As mentioned earlier, at most two reshapes
and at most one swapaxes
/transpose
did the job everywhere. They are listed below :