Benchmarking Matrix Multiplication on CPUs

  • I am working on RL (with evolutionary algorithms) and artificial life, and I am trying to find the right setup (software, hardware) to accelerate my experiments. This benchmark is a step to illuminate the search space for me. Happy to hear your thoughts