Re: [PATCH 09/79] block: rust: introduce `kernel::block::bio` module

From: Alice Ryhl

Date: Mon Mar 16 2026 - 06:30:35 EST


On Mon, Feb 16, 2026 at 12:34:56AM +0100, Andreas Hindborg wrote:
> Add Rust abstractions for working with `struct bio`, the core IO command
> descriptor for the block layer.
>
> The `Bio` type wraps `struct bio` and provides safe access to the IO
> vector describing the data buffers associated with the IO command. The
> data buffers are represented as a vector of `Segment`s, where each
> segment is a contiguous region of physical memory backed by `Page`.
>
> The `BioSegmentIterator` provides iteration over segments in a single
> bio, while `BioIterator` allows traversing a chain of bios. The
> `Segment` type offers methods for copying data to and from pages, as
> well as zeroing page contents, which are the fundamental operations
> needed by block device drivers to process IO requests.
>
> The `Request` type is extended with methods to access the bio chain
> associated with a request, allowing drivers to iterate over all data
> buffers that need to be processed.
>
> Signed-off-by: Andreas Hindborg <a.hindborg@xxxxxxxxxx>
> ---
> rust/helpers/blk.c | 8 +
> rust/kernel/block.rs | 1 +
> rust/kernel/block/bio.rs | 143 +++++++++++++++
> rust/kernel/block/bio/vec.rs | 389 ++++++++++++++++++++++++++++++++++++++++
> rust/kernel/block/mq/request.rs | 46 +++++
> rust/kernel/lib.rs | 2 +
> rust/kernel/page.rs | 2 +-
> 7 files changed, 590 insertions(+), 1 deletion(-)
>
> diff --git a/rust/helpers/blk.c b/rust/helpers/blk.c
> index cc9f4e6a2d234..53beba8c7782d 100644
> --- a/rust/helpers/blk.c
> +++ b/rust/helpers/blk.c
> @@ -1,5 +1,6 @@
> // SPDX-License-Identifier: GPL-2.0
>
> +#include <linux/bio.h>
> #include <linux/blk-mq.h>
> #include <linux/blkdev.h>
>
> @@ -12,3 +13,10 @@ struct request *rust_helper_blk_mq_rq_from_pdu(void *pdu)
> {
> return blk_mq_rq_from_pdu(pdu);
> }
> +
> +void rust_helper_bio_advance_iter_single(const struct bio *bio,
> + struct bvec_iter *iter,
> + unsigned int bytes)
> +{
> + bio_advance_iter_single(bio, iter, bytes);
> +}

__rust_helper

> diff --git a/rust/kernel/block/bio.rs b/rust/kernel/block/bio.rs
> new file mode 100644
> index 0000000000000..94062ea5281e6
> --- /dev/null
> +++ b/rust/kernel/block/bio.rs
> @@ -0,0 +1,143 @@
> +// SPDX-License-Identifier: GPL-2.0
> +
> +//! Types for working with the bio layer.
> +//!
> +//! C header: [`include/linux/blk_types.h`](../../include/linux/blk_types.h)

srctree/

> +/// A block device IO descriptor (`struct bio`).
> +///
> +/// A `Bio` is the main unit of IO for the block layer. It describes an IO command and associated
> +/// data buffers.
> +///
> +/// The data buffers associated with a `Bio` are represented by a vector of [`Segment`]s. These
> +/// segments represent physically contiguous regions of memory. The memory is represented by
> +/// [`Page`] descriptors internally.
> +///
> +/// The vector of [`Segment`]s can be iterated by obtaining a [`SegmentIterator`].
> +///
> +/// # Invariants
> +///
> +/// Instances of this type is always reference counted. A call to
> +/// `bindings::bio_get()` ensures that the instance is valid for read at least
> +/// until a matching call to `bindings :bio_put()`.

Refcounted? None of these methods are called anywhere, and you do not
implement AlwaysRefcounted.

> +#[repr(transparent)]
> +pub struct Bio(Opaque<bindings::bio>);
> +
> +impl Bio {
> + /// Returns an iterator over segments in this `Bio`. Does not consider
> + /// segments of other bios in this bio chain.
> + #[inline(always)]
> + pub fn segment_iter(&mut self) -> BioSegmentIterator<'_> {

Not `self: Pin<&mut Self>` here?

> + /// Create an instance of `Bio` from a raw pointer.
> + ///
> + /// # Safety
> + ///
> + /// Caller must ensure that the `ptr` is valid for use as a reference to
> + /// `Bio` for the duration of `'a`.
> + #[inline(always)]
> + pub(crate) unsafe fn from_raw<'a>(ptr: *mut bindings::bio) -> Option<&'a Self> {
> + Some(
> + // SAFETY: by the safety requirement of this funciton, `ptr` is
> + // valid for read for the duration of the returned lifetime
> + unsafe { &*NonNull::new(ptr)?.as_ptr().cast::<Bio>() },
> + )
> + }
> +
> + /// Create an instance of `Bio` from a raw pointer.
> + ///
> + /// # Safety
> + ///
> + /// Caller must ensure that the `ptr` is valid for use as a unique reference
> + /// to `Bio` for the duration of `'a`.
> + #[inline(always)]
> + pub(crate) unsafe fn from_raw_mut<'a>(ptr: *mut bindings::bio) -> Option<&'a mut Self> {
> + Some(
> + // SAFETY: by the safety requirement of this funciton, `ptr` is
> + // valid for read for the duration of the returned lifetime
> + unsafe { &mut *NonNull::new(ptr)?.as_ptr().cast::<Bio>() },

Why the Option? I imagine every caller has a non-null pointer?

> + )
> + }
> +}
> +
> +impl core::fmt::Display for Bio {

We have our own fmt trait now, right?

> + fn fmt(&self, f: &mut fmt::Formatter<'_>) -> fmt::Result {
> + write!(
> + f,
> + "Bio({:?}, vcnt: {}, idx: {}, size: 0x{:x}, completed: 0x{:x})",
> + self.0.get(),
> + self.io_vec_count(),
> + self.raw_iter().bi_idx,
> + self.raw_iter().bi_size,
> + self.raw_iter().bi_bvec_done

This reads the entire `bi_iter` field three separate times. A local
variable may be a good idea.

> +/// An iterator over `Segment`
> +///
> +/// # Invariants
> +///
> +/// If `iter.bi_size` > 0, `iter` must always index a valid `bio_vec` in `bio.io_vec()`.
> +pub struct BioSegmentIterator<'a> {
> + bio: &'a mut Bio,
> + iter: bindings::bvec_iter,
> +}
> +
> +impl<'a> BioSegmentIterator<'a> {
> + /// Creeate a new segemnt iterator for iterating the segments of `bio`. The

typo

> +impl<'a> core::iter::Iterator for BioSegmentIterator<'a> {
> + type Item = Segment<'a>;
> +
> + #[inline(always)]
> + fn next(&mut self) -> Option<Self::Item> {
> + if self.iter.bi_size == 0 {
> + return None;
> + }
> +
> + // SAFETY: We checked that `self.iter.bi_size` > 0 above.
> + let bio_vec_ret = unsafe { self.io_vec() };
> +
> + // SAFETY: By existence of reference `&bio`, `bio.0` contains a valid
> + // `struct bio`. By type invariant of `BioSegmentItarator` `self.iter`
> + // indexes into a valid `bio_vec` entry. By C API contracit, `bv_len`
> + // does not exceed the size of the bio.
> + unsafe {
> + bindings::bio_advance_iter_single(
> + self.bio.0.get(),
> + core::ptr::from_mut(&mut self.iter),

Creating this BioSegmentItarator copies the bvec_iter from the Bio, and
then here you modify the copy. Is that the intent? Is the C type such
that copying it is always okay?

Also, is the C type such that moves are ok? It's playsible that the
answer is yes - the same applies in rust/kernel/iov.rs but it could be
clearer in e.g. "Invariants" that this is the case.

Nit: core::ptr::from_mut(&mut self.iter) -> &raw mut self.iter

> + /// Get a mutable reference to the first [`Bio`] in this request.
> + #[inline(always)]
> + pub fn bio_mut(&mut self) -> Option<&mut Bio> {
> + // SAFETY: By type invariant of `Self`, `self.0` is valid and the deref
> + // is safe.
> + let ptr = unsafe { (*self.0.get()).bio };
> + // SAFETY: By C API contract, if `bio` is not null it will have a
> + // positive refcount at least for the duration of the lifetime of
> + // `&self`.
> + unsafe { Bio::from_raw_mut(ptr) }

Surely &mut requires refcount == 1, not just positive refcount?

Alice