In the model the application and the case under consideration are
described by a set of characteristic numbers. These are the number of
cache loads, stores, main memory accesses and flops for a single
processor run. For a parallel run the size of the boundary partitions
per process, the amount of data to be communicated and the number of
communication steps have to be added. These numbers are combined with
the results of the low-level benchmarks and performance metrics. These
are the theoretical peak performance, the Cachebench read and write
bandwidths on different memory levels and the network bandwidth and
latency from the PMB benchmarks. These results are taken from the
repository directly, making the performance prediction depending
automatically on the measured benchmark data.
Cachebench Information
Read:
6040.1458
Write:
12552.1785
RMW:
5971.9115
Pingpong Information
Max Bandwidth:
1524.69
Bandwidth at 4 MB:
0.90519
Latency:
4.84
*Warning: This value must fit to the indicated processor architecture. If this is not the case, the value can be changed here. If the value do not fit to the respective processor architecture, the results of the performance prediction will be falsified.