Re: [PATCH v6 1/5] mm: rmap: support batched checks of the references for large folios
From: David Hildenbrand (Arm)
Date: Mon Mar 16 2026 - 10:17:07 EST
On 3/16/26 07:25, Baolin Wang wrote:
>
>
> On 3/10/26 4:17 PM, David Hildenbrand (Arm) wrote:
>> On 3/10/26 02:37, Baolin Wang wrote:
>>>
>>>
>>>
>>> I understand your point. I’m concerned that I can’t test this patch on
>>> every architecture to validate the benefits. Anyway, let me try this on
>>> my X86 machine first.
>>
>> In any case, please make that a follow-up patch :)
>
> Sure. However, after investigating RISC‑V and x86, I found that
> ptep_clear_flush_young() does not flush the TLB on these architectures:
>
> int ptep_clear_flush_young(struct vm_area_struct *vma,
> unsigned long address, pte_t *ptep)
> {
> /*
> * On x86 CPUs, clearing the accessed bit without a TLB flush
> * doesn't cause data corruption. [ It could cause incorrect
> * page aging and the (mistaken) reclaim of hot pages, but the
> * chance of that should be relatively low. ]
> *
> * So as a performance optimization don't flush the TLB when
> * clearing the accessed bit, it will eventually be flushed by
> * a context switch or a VM operation anyway. [ In the rare
> * event of it not getting flushed for a long time the delay
> * shouldn't really matter because there's no real memory
> * pressure for swapout to react to. ]
> */
> return ptep_test_and_clear_young(vma, address, ptep);
> }
You'd probably want an arch helper then, that tells you whether
a flush_tlb_range() after ptep_test_and_clear_young() is required.
Or some special flush_tlb_range() helper.
I agree that it requires more work.
--
Cheers,
David