We want to combine the results in order to summarize the performance of
the algorithms on the five datasets. This is analogous to comparing
students by calculating the GPA (Grade Point Average), where students
are to courses as algorithms are to datasets.