Add an option to reduce pagefault when available #430

hankluo6 · 2021-06-30T14:15:21Z

This PR add MAP_POPULATE flag in mi_unix_mmap and enable user to control it through the new option (MIMALLOC_PREFAULT).

The MAP_POPULATE will prefault the page tables and therefore can reduce some page fault in the runtime. This can improve some performance.

In my system, Linux x86_64 with i5-8300H CPU and 1 numa node, test with ./mimalloc-test-stress 32 100 50 in debug build,

Results:

without MAP_POPULATE causes around 2,943,560 page faults and 205 seconds
with MAP_POPULATE causes around 94,554 page faults and 203 seconds

I also test in mimalloc-bench. With eager_commit = 0, page-fault in most benchmark (cfrac, espresso, larsonN) are increase about 65000 and other big test (leanN, mstressN) will increase even more. And with eager_commit = 1, setting MAP_POPULATE can't see any significant effects and the page-fault is same like origin version.

I think this is because the option eager_commit was set to 0 will prefaults some unused memory overall. Therefore, this option can get user an ability to tune by themselves.

Btw, above page faults means main page faults and minor page faults.

jserv · 2021-07-01T01:05:00Z

In my system, Linux x86_64 with 6 CPU cores and 1 numa node, test with ./mimalloc-test-stress 32 100 50,

If you would like to share the benchmark results, you shall describe the hardware configurations. "6 CPU cores" is too rough. Instead, mention the microarchitecture at least.

jserv · 2021-07-01T01:06:52Z

Besides the reducing of page fault amounts, the elapsed time should be listed for each run.

jserv

The git commit message is not as informative as what "readme.md" was changed. You should improve the messages.

jserv

Wrap the body of git commit messages at 72 characters as the article How to Write a Git Commit Message suggests.
You shall mention that it is specific to Linux only.

jserv · 2021-07-01T16:11:52Z

I also test in mimalloc-bench but I found that the page faults would become larger compared to initial version.

It would be great if the comprehensive experimental results of mimalloc-bench under different configurations can be shown along with the analysis.

This option instructs the kernel to synchronously load the entire mapped region into active memory by specifying `MAP_POPULATE` in `mmap`. It will cause read-ahead on that memory, and then the subsequent accesses to the memory can proceed without page faults, improving some performance.

daanx · 2021-10-19T17:46:48Z

Hi @hankluo6 ; thanks for your PR. At this moment I am hesitant though, as jserv remarks we need to measure more. It sounds great of course to reduce page-faults by pre-populating and it seems indeed that the (smallish) benchmarks get faster. But generally with large (real-world) workloads we usually try to avoid touching any pages that may not be needed after all. For example, for each block size mimalloc reservers mimalloc "pages" that are usually about 64k but it only touches at first the initial OS page (of 4k) -- in case just few objects are needed, all the other OS pages (60k) keep being just virtual address space without needing real physical memory. That can be a big fraction due to memory fragmentation. So, generally, I would say it is not a good idea to do this.

This is also why the eager_commit setting is there: in general you don't really want to do this; but indeed, for benchmarks it is always better to enable it (but I try to avoid optimizing for benchmarks as for real world workloads like browsers or long running servers, being good about memory fragmentation is much more important)

Anyways, I need some more thinking on this and better understand the impact. Best, Daan

jserv suggested changes Jul 1, 2021

View reviewed changes

hankluo6 changed the base branch from master to dev July 1, 2021 05:29

hankluo6 force-pushed the dev_populate branch from 9bbe190 to 2b4b669 Compare July 1, 2021 14:54

jserv suggested changes Jul 1, 2021

View reviewed changes

hankluo6 force-pushed the dev_populate branch from 2b4b669 to 9a66d37 Compare July 5, 2021 11:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add an option to reduce pagefault when available #430

Add an option to reduce pagefault when available #430

hankluo6 commented Jun 30, 2021 •

edited

Loading

jserv commented Jul 1, 2021

jserv commented Jul 1, 2021

jserv left a comment

jserv left a comment

jserv commented Jul 1, 2021

daanx commented Oct 19, 2021

Add an option to reduce pagefault when available #430

Are you sure you want to change the base?

Add an option to reduce pagefault when available #430

Conversation

hankluo6 commented Jun 30, 2021 • edited Loading

jserv commented Jul 1, 2021

jserv commented Jul 1, 2021

jserv left a comment

Choose a reason for hiding this comment

jserv left a comment

Choose a reason for hiding this comment

jserv commented Jul 1, 2021

daanx commented Oct 19, 2021

hankluo6 commented Jun 30, 2021 •

edited

Loading