Open
Description
This article https://romanpoya.medium.com/a-look-at-the-performance-of-expression-templates-in-c-eigen-vs-blaze-vs-fastor-vs-armadillo-vs-2474ed38d982 shows more than order of magnitude slower performance of xtensor compared to Eigen, Blaze and Fastor. It seems to be caused by xt::linalg::norm()
. Can the code they used be improved to match the performance of other libraries?