Re: [PATCH] gpio: shared: call gpio_chip::of_xlate() if set

From: Bartosz Golaszewski

Date: Thu Mar 19 2026 - 06:11:03 EST

On Wed, 18 Mar 2026 20:09:08 +0100, Jon Hunter <jonathanh@xxxxxxxxxx> said:
>
>>>> Does this patch fix the real problem on the tegra board that you
>>>> reported initially? I doubt two separate GPIO keys, share the same pin
>>>> in real life.
>>>
>>> Yes it fixes the initial issue. However, now I am seeing a different
>>> error on the actual platform that is having the issue to begin with ...
>>>
>>
>> This is *with* the fix?
>
> Yes.
>
>>> ------------[ cut here ]------------
>>> WARNING: kernel/rcu/srcutree.c:757 at cleanup_srcu_struct+0xc0/0x1e0, CPU#2: kworker/u49:1/114
>>> Modules linked in:
>>> CPU: 2 UID: 0 PID: 114 Comm: kworker/u49:1 Not tainted 6.19.0-tegra #1 PREEMPT
>>> Hardware name: NVIDIA NVIDIA Jetson AGX Orin Developer Kit/Jetson, BIOS buildbrain-gcid-44496888 03/15/2026
>>> Workqueue: events_unbound deferred_probe_work_func
>>> pstate: 60400009 (nZCv daif +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
>>> pc : cleanup_srcu_struct+0xc0/0x1e0
>>> lr : cleanup_srcu_struct+0xb4/0x1e0
>>> sp : ffff800081cbb930
>>> x29: ffff800081cbb930 x28: ffffd79ff96d0c40 x27: ffff000086059000
>>> x26: 00000000fffffff0 x25: ffff000086571200 x24: ffffd79ff94adb10
>>> x23: ffffd79ff86400c0 x22: ffff000086059390 x21: ffffd79ff94aa040
>>> x20: 0000000000000000 x19: fffffdffbf669d40 x18: 00000000ffffffff
>>> x17: 0000000000000000 x16: ffffd79ff62dc8a0 x15: 0081cf5fe0409838
>>> x14: 0000000000000000 x13: 0000000000000272 x12: 0000000000000000
>>> x11: 00000000000000c0 x10: f7c5d06d757a4b3a x9 : 15ccf89dfeffb5e1
>>> x8 : ffff800081cbb8c8 x7 : 0000000000000000 x6 : 000000000151e960
>>> x5 : 0800000000000000 x4 : 0000000000000000 x3 : 0000000000000000
>>> x2 : 0000000000000001 x1 : 0000000000000000 x0 : 0000000000000004
>>> Call trace:
>>> cleanup_srcu_struct+0xc0/0x1e0 (P)
>>> gpiochip_add_data_with_key+0x3dc/0xf68
>>> devm_gpiochip_add_data_with_key+0x30/0x84
>>> tegra186_gpio_probe+0x5e4/0x808
>>> platform_probe+0x5c/0xb0
>>> really_probe+0xbc/0x2b4
>>> __driver_probe_device+0x78/0x134
>>> driver_probe_device+0x3c/0x164
>>> __device_attach_driver+0xc8/0x15c
>>> bus_for_each_drv+0x88/0x100
>>> __device_attach+0xa0/0x198
>>> device_initial_probe+0x58/0x5c
>>> bus_probe_device+0x38/0xbc
>>> deferred_probe_work_func+0x88/0xc8
>>> process_one_work+0x16c/0x3fc
>>> worker_thread+0x2d8/0x3ec
>>> kthread+0x144/0x22c
>>> ret_from_fork+0x10/0x20
>>> ---[ end trace 0000000000000000 ]---
>
> It seems that when the gpiochip_add_data_with_key(), then to avoid the
> above warning I needed to ...
>
> diff --git a/drivers/gpio/gpiolib.c b/drivers/gpio/gpiolib.c
> index 27ea5bc9ed8a..3130acfeeb66 100644
> --- a/drivers/gpio/gpiolib.c
> +++ b/drivers/gpio/gpiolib.c
> @@ -1277,6 +1277,7 @@ int gpiochip_add_data_with_key(struct gpio_chip *gc, void *data,
> goto err_print_message;
> }
> err_cleanup_desc_srcu:
> + synchronize_srcu(&gdev->desc_srcu);
> cleanup_srcu_struct(&gdev->desc_srcu);
> err_cleanup_gdev_srcu:
> cleanup_srcu_struct(&gdev->srcu);
>

Hi Tzung-Bi, allow me to Cc you. It looks like someone takes the SRCU lock
during the call to gpiochip_add_data_with_key() and this is why the cleanup
path complains. Does it make sense to add this synchronize_srcu() call here?

>
>>> gpiochip_add_data_with_key: GPIOs 512..675 (tegra234-gpio) failed to register, -16
>>> tegra186-gpio 2200000.gpio: probe with driver tegra186-gpio failed with error -16
>
> Which leaves the above.
>
>> There's a change to how gpiochip_add_data_with_key() error path works in
>> linux-next at the moment but it's not in any stable branch yet.
>>
>
> This commit?
>
> 16fdabe143fc ("gpio: Fix resource leaks on errors in gpiochip_add_data_with_key()")
>

Yes, I Cc'ed the author above.

>
>> -EBUSY can typically only happen if gpiod_request_commit() is called twice on
>> the same descriptor. Is that the case here?
>
> I have been looking at this today and now I can see that we have a
> 'gpio-hog' set for the same pins that are shared and hence it is
> getting request twice. If I drop the hog it goes away. This is a
> produce device-tree, not upstream, for some camera modules so I am
> wondering if we are doing something here we should not be. I am
> taking a closer look.
>

Ah, yes that definitely will not work. Hogs are taken first during the chip's
bringup and hogged lines will not be available to users. The error returned is
-EBUSY so it makes perfect sense and is expected.

Bart