The performance numbers are indeed best when data is in memory. However in reality sum() goes over memory mapped file, lazy loading data as required.