cuTENSOR: A High-Performance CUDA Library For Tensor Primitives.
2.1.0.9
aarch64-linux
x86_64-linux
x86_64-windows