Message ID | 20241101193412.1390391-1-gpiccoli@igalia.com (mailing list archive) |
---|---|
State | Accepted |
Delegated to: | Ping-Ke Shih |
Headers | show |
Series | [V4] wifi: rtlwifi: Drastically reduce the attempts to read efuse in case of failures | expand |
"Guilherme G. Piccoli" <gpiccoli@igalia.com> wrote: > Syzkaller reported a hung task with uevent_show() on stack trace. That > specific issue was addressed by another commit [0], but even with that > fix applied (for example, running v6.12-rc5) we face another type of hung > task that comes from the same reproducer [1]. By investigating that, we > could narrow it to the following path: > > (a) Syzkaller emulates a Realtek USB WiFi adapter using raw-gadget and > dummy_hcd infrastructure. > > (b) During the probe of rtl8192cu, the driver ends-up performing an efuse > read procedure (which is related to EEPROM load IIUC), and here lies the > issue: the function read_efuse() calls read_efuse_byte() many times, as > loop iterations depending on the efuse size (in our example, 512 in total). > > This procedure for reading efuse bytes relies in a loop that performs an > I/O read up to *10k* times in case of failures. We measured the time of > the loop inside read_efuse_byte() alone, and in this reproducer (which > involves the dummy_hcd emulation layer), it takes 15 seconds each. As a > consequence, we have the driver stuck in its probe routine for big time, > exposing a stack trace like below if we attempt to reboot the system, for > example: > > task:kworker/0:3 state:D stack:0 pid:662 tgid:662 ppid:2 flags:0x00004000 > Workqueue: usb_hub_wq hub_event > Call Trace: > __schedule+0xe22/0xeb6 > schedule_timeout+0xe7/0x132 > __wait_for_common+0xb5/0x12e > usb_start_wait_urb+0xc5/0x1ef > ? usb_alloc_urb+0x95/0xa4 > usb_control_msg+0xff/0x184 > _usbctrl_vendorreq_sync+0xa0/0x161 > _usb_read_sync+0xb3/0xc5 > read_efuse_byte+0x13c/0x146 > read_efuse+0x351/0x5f0 > efuse_read_all_map+0x42/0x52 > rtl_efuse_shadow_map_update+0x60/0xef > rtl_get_hwinfo+0x5d/0x1c2 > rtl92cu_read_eeprom_info+0x10a/0x8d5 > ? rtl92c_read_chip_version+0x14f/0x17e > rtl_usb_probe+0x323/0x851 > usb_probe_interface+0x278/0x34b > really_probe+0x202/0x4a4 > __driver_probe_device+0x166/0x1b2 > driver_probe_device+0x2f/0xd8 > [...] > > We propose hereby to drastically reduce the attempts of doing the I/O > reads in case of failures, restricted to USB devices (given that > they're inherently slower than PCIe ones). By retrying up to 10 times > (instead of 10000), we got reponsiveness in the reproducer, while seems > reasonable to believe that there's no sane USB device implementation in > the field requiring this amount of retries at every I/O read in order > to properly work. Based on that assumption, it'd be good to have it > backported to stable but maybe not since driver implementation (the 10k > number comes from day 0), perhaps up to 6.x series makes sense. > > [0] Commit 15fffc6a5624 ("driver core: Fix uevent_show() vs driver detach race") > > [1] A note about that: this syzkaller report presents multiple reproducers > that differs by the type of emulated USB device. For this specific case, > check the entry from 2024/08/08 06:23 in the list of crashes; the C repro > is available at https://syzkaller.appspot.com/text?tag=ReproC&x=1521fc83980000. > > Cc: stable@vger.kernel.org # v6.1+ > Reported-by: syzbot+edd9fe0d3a65b14588d5@syzkaller.appspotmail.com > Tested-by: Bitterblue Smith <rtl8821cerfe2@gmail.com> > Signed-off-by: Guilherme G. Piccoli <gpiccoli@igalia.com> 1 patch(es) applied to rtw-next branch of rtw.git, thanks. 5c1b54456300 wifi: rtlwifi: Drastically reduce the attempts to read efuse in case of failures --- https://github.com/pkshih/rtw.git
diff --git a/drivers/net/wireless/realtek/rtlwifi/efuse.c b/drivers/net/wireless/realtek/rtlwifi/efuse.c index 82cf5fb5175f..6518e77b89f5 100644 --- a/drivers/net/wireless/realtek/rtlwifi/efuse.c +++ b/drivers/net/wireless/realtek/rtlwifi/efuse.c @@ -162,10 +162,19 @@ void efuse_write_1byte(struct ieee80211_hw *hw, u16 address, u8 value) void read_efuse_byte(struct ieee80211_hw *hw, u16 _offset, u8 *pbuf) { struct rtl_priv *rtlpriv = rtl_priv(hw); + u16 max_attempts = 10000; u32 value32; u8 readbyte; u16 retry; + /* + * In case of USB devices, transfer speeds are limited, hence + * efuse I/O reads could be (way) slower. So, decrease (a lot) + * the read attempts in case of failures. + */ + if (rtlpriv->rtlhal.interface == INTF_USB) + max_attempts = 10; + rtl_write_byte(rtlpriv, rtlpriv->cfg->maps[EFUSE_CTRL] + 1, (_offset & 0xff)); readbyte = rtl_read_byte(rtlpriv, rtlpriv->cfg->maps[EFUSE_CTRL] + 2); @@ -178,7 +187,7 @@ void read_efuse_byte(struct ieee80211_hw *hw, u16 _offset, u8 *pbuf) retry = 0; value32 = rtl_read_dword(rtlpriv, rtlpriv->cfg->maps[EFUSE_CTRL]); - while (!(((value32 >> 24) & 0xff) & 0x80) && (retry < 10000)) { + while (!(((value32 >> 24) & 0xff) & 0x80) && (retry < max_attempts)) { value32 = rtl_read_dword(rtlpriv, rtlpriv->cfg->maps[EFUSE_CTRL]); retry++;