ca28cee2cb35fd5c6395e7dc98d603b5061c5672
[openwrt/staging/hauke.git] / target / linux / generic / backport-6.1 / 020-v6.3-13-UPSTREAM-mm-multi-gen-LRU-section-for-rmap-PT-walk-f.patch
1 From 5ddf9d53d375e42af49b744bd7c2f8247c6bce15 Mon Sep 17 00:00:00 2001
2 From: "T.J. Alumbaugh" <talumbau@google.com>
3 Date: Wed, 18 Jan 2023 00:18:22 +0000
4 Subject: [PATCH 13/19] UPSTREAM: mm: multi-gen LRU: section for rmap/PT walk
5 feedback
6
7 Add a section for lru_gen_look_around() in the code and the design doc.
8
9 Link: https://lkml.kernel.org/r/20230118001827.1040870-3-talumbau@google.com
10 Change-Id: I5097af63f61b3b69ec2abee6cdbdc33c296df213
11 Signed-off-by: T.J. Alumbaugh <talumbau@google.com>
12 Cc: Yu Zhao <yuzhao@google.com>
13 Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
14 (cherry picked from commit db19a43d9b3a8876552f00f656008206ef9a5efa)
15 Bug: 274865848
16 Signed-off-by: T.J. Mercier <tjmercier@google.com>
17 ---
18 Documentation/mm/multigen_lru.rst | 14 ++++++++++++++
19 mm/vmscan.c | 4 ++++
20 2 files changed, 18 insertions(+)
21
22 --- a/Documentation/mm/multigen_lru.rst
23 +++ b/Documentation/mm/multigen_lru.rst
24 @@ -156,6 +156,20 @@ This time-based approach has the followi
25 and memory sizes.
26 2. It is more reliable because it is directly wired to the OOM killer.
27
28 +Rmap/PT walk feedback
29 +---------------------
30 +Searching the rmap for PTEs mapping each page on an LRU list (to test
31 +and clear the accessed bit) can be expensive because pages from
32 +different VMAs (PA space) are not cache friendly to the rmap (VA
33 +space). For workloads mostly using mapped pages, searching the rmap
34 +can incur the highest CPU cost in the reclaim path.
35 +
36 +``lru_gen_look_around()`` exploits spatial locality to reduce the
37 +trips into the rmap. It scans the adjacent PTEs of a young PTE and
38 +promotes hot pages. If the scan was done cacheline efficiently, it
39 +adds the PMD entry pointing to the PTE table to the Bloom filter. This
40 +forms a feedback loop between the eviction and the aging.
41 +
42 Summary
43 -------
44 The multi-gen LRU can be disassembled into the following parts:
45 --- a/mm/vmscan.c
46 +++ b/mm/vmscan.c
47 @@ -4553,6 +4553,10 @@ static void lru_gen_age_node(struct pgli
48 }
49 }
50
51 +/******************************************************************************
52 + * rmap/PT walk feedback
53 + ******************************************************************************/
54 +
55 /*
56 * This function exploits spatial locality when shrink_folio_list() walks the
57 * rmap. It scans the adjacent PTEs of a young PTE and promotes hot pages. If