Re: [PATCH v4] mm: introduce a new page type for page pool in page type

From: Pedro Falcato

Date: Wed May 13 2026 - 05:44:15 EST


On Wed, May 13, 2026 at 11:12:43AM +0200, Vlastimil Babka (SUSE) wrote:
> On 5/13/26 11:00, Dragos Tatulea wrote:
> >
> >
> > On 24.02.26 06:13, Byungchul Park wrote:
> >> Currently, the condition 'page->pp_magic == PP_SIGNATURE' is used to
> >> determine if a page belongs to a page pool. However, with the planned
> >> removal of @pp_magic, we should instead leverage the page_type in struct
> >> page, such as PGTY_netpp, for this purpose.
> >>
> >> Introduce and use the page type APIs e.g. PageNetpp(), __SetPageNetpp(),
> >> and __ClearPageNetpp() instead, and remove the existing APIs accessing
> >> @pp_magic e.g. page_pool_page_is_pp(), netmem_or_pp_magic(), and
> >> netmem_clear_pp_magic().
> >>
> >> Plus, add @page_type to struct net_iov at the same offset as struct page
> >> so as to use the page_type APIs for struct net_iov as well. While at it,
> >> reorder @type and @owner in struct net_iov to avoid a hole and
> >> increasing the struct size.
> >>
> >> This work was inspired by the following link:
> >>
> >> https://lore.kernel.org/all/582f41c0-2742-4400-9c81-0d46bf4e8314@xxxxxxxxx/
> >>
> >> While at it, move the sanity check for page pool to on the free path.
> >>
> >> Suggested-by: David Hildenbrand <david@xxxxxxxxxx>
> >> Co-developed-by: Pavel Begunkov <asml.silence@xxxxxxxxx>
> >> Signed-off-by: Pavel Begunkov <asml.silence@xxxxxxxxx>
> >> Signed-off-by: Byungchul Park <byungchul@xxxxxx>
> >> Acked-by: David Hildenbrand <david@xxxxxxxxxx>
> >> Acked-by: Zi Yan <ziy@xxxxxxxxxx>
> >> Acked-by: Vlastimil Babka <vbabka@xxxxxxx>
> >> Reviewed-by: Toke Høiland-Jørgensen <toke@xxxxxxxxxx>
> >> ---
> >
> > Seems like this patch broke tcp_mmap because
> > validate_page_before_insert() returns -EINVAL due
> > to a page having a type. Here's the full flow:
> >
> > getsockopt(TCP_ZEROCOPY_RECEIVE) returns -EINVAL because of the
> > below flow in the kernel:
> >
> > tcp_zerocopy_receive()
> > -> tcp_zerocopy_vm_insert_batch()
> > -> vm_insert_pages()
> > -> insert_pages()
> > -> insert_page_in_batch_locked()
> > -> validate_page_before_insert() returns -EINVAL
> > because page_has_type(page) is now true.
> >
> > The patch below fixes the issue. But is this a valid fix?
>
> Hmm the check traces back to commit 0ee930e6cafa0 "mm/memory.c: prevent
> mapping typed pages to userspace"
>
> > Pages which use page_type must never be mapped to userspace as it would
> > destroy their page type. Add an explicit check for this instead of
> > assuming that kernel drivers always get this right.
>
> So uh, this doesn't look good I think.

Yep, you fundamentally can't map a page with a type as page type aliases with
mapcount. Even with the given diff, just mapping it will increment the mapcount
and wreak havoc. I think we need to revert this patch for now.

I'm not sure what the long term plan for this would be. If page types are moved
to memdesc types, then the two stop colliding and that could work. I don't know
if that's Willy's plan, however.

(then there's the other question: are page pool pages really folios? not really.
they are mappable, but they aren't part of the page cache, or anon, nor are
they in the LRU or have rmap capabilities. perhaps we need a different memdesc
for those. we're one step away from reinventing class polymorphism from first
principles ;)

>
> > diff --git a/mm/memory.c b/mm/memory.c
> > index ea6568571131..4cb12673f450 100644
> > --- a/mm/memory.c
> > +++ b/mm/memory.c
> > @@ -2326,7 +2326,7 @@ static int validate_page_before_insert(struct vm_area_struct *vma,
> > return -EINVAL;
> > return 0;
> > }
> > - if (folio_test_anon(folio) || page_has_type(page))
> > + if (folio_test_anon(folio) || (page_has_type(page) && !PageNetpp(page)))
> > return -EINVAL;
> > flush_dcache_folio(folio);
> > return 0;
> >
> > Thanks,
> > Dragos
> >
> >
>
>

--
Pedro