We want to combine the results in order to summarize the performance of the algorithms on the five datasets. This is analogous to comparing students by calculating the GPA (Grade Point Average), where students are to courses as algorithms are to datasets.