-
Notifications
You must be signed in to change notification settings - Fork 122
Open
Description
Currently, some of the benchmarks allocate memory that's accessed from the kernels using:
malloc
aligned_alloc(2MiB, ...)
- C++
new
- vendor specific apis (cudaMallocManaged, etc.)
How the memory is allocated does impact benchmark performance a bit.
I think we should extract these into their own file, to ensure all benchmarks may pick the exact same defaults.
Metadata
Metadata
Assignees
Labels
No labels