Currently, a paradigm shift is taking place from human interpretation of results towards machine interpretation. The long-term vision here is to automate the benchmarking process further to enable machines to automatically learn the properties of algorithms. This requires a more rigid benchmarking process with all assumptions and design decisions made explicit and accessible for machine interpretation.