Message ID | 20250217110623.2383142-1-ziniu.wang_1@nxp.com (mailing list archive) |
---|---|
State | New |
Headers | show |
Series | mmc: sdhci-esdhc-imx: improve imx8mq emmc/sd read performance | expand |
> -----Original Message----- > From: Luke Wang <ziniu.wang_1@nxp.com> > Sent: 2025年2月17日 19:06 > To: adrian.hunter@intel.com; ulf.hansson@linaro.org > Cc: Bough Chen <haibo.chen@nxp.com>; shawnguo@kernel.org; > s.hauer@pengutronix.de; kernel@pengutronix.de; festevam@gmail.com; > imx@lists.linux.dev; linux-mmc@vger.kernel.org; dl-S32 <S32@nxp.com>; > linux-arm-kernel@lists.infradead.org; linux-kernel@vger.kernel.org > Subject: [PATCH] mmc: sdhci-esdhc-imx: improve imx8mq emmc/sd read > performance > > From: Luke Wang <ziniu.wang_1@nxp.com> > > Compared with kernel 6.1, imx8mq eMMC/SD read performance drops by about > 30% with kernel 6.6. > > The eMMC/SD read thread will be put to sleep until the hardware completes > data transfer. Normally, the read thread will be woken up immediately when the > data transfer is completed. However, due to a known ic bug, if imx8mq is in > cpuidle, it will take a long time (about 500us) to exit cpuidle. As a result, the > read thread cannot immediately read the next data block, affecting the read > performance. > > Kernel 6.6 uses EEVDF as the new scheduler, which affects cpu scheduling and > cpuidle behavior. With kernel 6.6, the cpu which the read thread resides has a > greater probability in cpuidle (about 80%), while with kernel 6.1, the probability > is only about 20-30%. For other platforms, this does not have a significant impact > on read performance because the cpuidle exit time is very short (for example, > imx93 is about 60us). But for imx8mq, this results in longer waits for the thread > to be woken up while reading eMMC/SD, which drops performance. > > So for imx8mq, use the ESDHC_FLAG_PMQOS flag to request the cpu latency > QoS constraint. This can prevent entering cpuidle during data transfer. Reviewed-by: Haibo Chen <haibo.chen@nxp.com> Regards Haibo Chen > > Signed-off-by: Luke Wang <ziniu.wang_1@nxp.com> > --- > drivers/mmc/host/sdhci-esdhc-imx.c | 10 ++++++++++ > 1 file changed, 10 insertions(+) > > diff --git a/drivers/mmc/host/sdhci-esdhc-imx.c > b/drivers/mmc/host/sdhci-esdhc-imx.c > index ff78a7c6a04c..b3bf9c171d46 100644 > --- a/drivers/mmc/host/sdhci-esdhc-imx.c > +++ b/drivers/mmc/host/sdhci-esdhc-imx.c > @@ -337,6 +337,15 @@ static struct esdhc_soc_data usdhc_imx8mm_data = { > .quirks = SDHCI_QUIRK_NO_LED, > }; > > +static struct esdhc_soc_data usdhc_imx8mq_data = { > + .flags = ESDHC_FLAG_USDHC | ESDHC_FLAG_STD_TUNING > + | ESDHC_FLAG_HAVE_CAP1 | ESDHC_FLAG_HS200 > + | ESDHC_FLAG_HS400 | ESDHC_FLAG_PMQOS > + | ESDHC_FLAG_STATE_LOST_IN_LPMODE > + | ESDHC_FLAG_BROKEN_AUTO_CMD23, > + .quirks = SDHCI_QUIRK_NO_LED, > +}; > + > struct pltfm_imx_data { > u32 scratchpad; > struct pinctrl *pinctrl; > @@ -381,6 +390,7 @@ static const struct of_device_id imx_esdhc_dt_ids[] = { > { .compatible = "fsl,imx7ulp-usdhc", .data = &usdhc_imx7ulp_data, }, > { .compatible = "fsl,imx8qxp-usdhc", .data = &usdhc_imx8qxp_data, }, > { .compatible = "fsl,imx8mm-usdhc", .data = &usdhc_imx8mm_data, }, > + { .compatible = "fsl,imx8mq-usdhc", .data = &usdhc_imx8mq_data, }, > { .compatible = "fsl,imxrt1050-usdhc", .data = &usdhc_imxrt1050_data, }, > { .compatible = "nxp,s32g2-usdhc", .data = &usdhc_s32g2_data, }, > { /* sentinel */ } > -- > 2.34.1
On Mon, Feb 17, 2025 at 8:05 AM <ziniu.wang_1@nxp.com> wrote: > > From: Luke Wang <ziniu.wang_1@nxp.com> > > Compared with kernel 6.1, imx8mq eMMC/SD read performance drops by about > 30% with kernel 6.6. Since this fixes a significant performance regression, what about adding a Fixes tag?
Hi Luke, Am Montag, dem 17.02.2025 um 19:06 +0800 schrieb ziniu.wang_1@nxp.com: > From: Luke Wang <ziniu.wang_1@nxp.com> > > Compared with kernel 6.1, imx8mq eMMC/SD read performance drops by about > 30% with kernel 6.6. > > The eMMC/SD read thread will be put to sleep until the hardware completes > data transfer. Normally, the read thread will be woken up immediately > when the data transfer is completed. However, due to a known ic bug, if > imx8mq is in cpuidle, it will take a long time (about 500us) to exit > cpuidle. As a result, the read thread cannot immediately read the next > data block, affecting the read performance. > Is this really a problem with the upstream kernel? i.MX8MQ upstream does not use the deeper PSCI idle states, but only uses WFI, so I doubt that upstream is affected by this issue. Regards, Lucas > Kernel 6.6 uses EEVDF as the new scheduler, which affects cpu scheduling > and cpuidle behavior. With kernel 6.6, the cpu which the read thread > resides has a greater probability in cpuidle (about 80%), while with > kernel 6.1, the probability is only about 20-30%. For other platforms, > this does not have a significant impact on read performance because the > cpuidle exit time is very short (for example, imx93 is about 60us). But > for imx8mq, this results in longer waits for the thread to be woken up > while reading eMMC/SD, which drops performance. > > So for imx8mq, use the ESDHC_FLAG_PMQOS flag to request the cpu latency > QoS constraint. This can prevent entering cpuidle during data transfer. > > Signed-off-by: Luke Wang <ziniu.wang_1@nxp.com> > --- > drivers/mmc/host/sdhci-esdhc-imx.c | 10 ++++++++++ > 1 file changed, 10 insertions(+) > > diff --git a/drivers/mmc/host/sdhci-esdhc-imx.c b/drivers/mmc/host/sdhci-esdhc-imx.c > index ff78a7c6a04c..b3bf9c171d46 100644 > --- a/drivers/mmc/host/sdhci-esdhc-imx.c > +++ b/drivers/mmc/host/sdhci-esdhc-imx.c > @@ -337,6 +337,15 @@ static struct esdhc_soc_data usdhc_imx8mm_data = { > .quirks = SDHCI_QUIRK_NO_LED, > }; > > +static struct esdhc_soc_data usdhc_imx8mq_data = { > + .flags = ESDHC_FLAG_USDHC | ESDHC_FLAG_STD_TUNING > + | ESDHC_FLAG_HAVE_CAP1 | ESDHC_FLAG_HS200 > + | ESDHC_FLAG_HS400 | ESDHC_FLAG_PMQOS > + | ESDHC_FLAG_STATE_LOST_IN_LPMODE > + | ESDHC_FLAG_BROKEN_AUTO_CMD23, > + .quirks = SDHCI_QUIRK_NO_LED, > +}; > + > struct pltfm_imx_data { > u32 scratchpad; > struct pinctrl *pinctrl; > @@ -381,6 +390,7 @@ static const struct of_device_id imx_esdhc_dt_ids[] = { > { .compatible = "fsl,imx7ulp-usdhc", .data = &usdhc_imx7ulp_data, }, > { .compatible = "fsl,imx8qxp-usdhc", .data = &usdhc_imx8qxp_data, }, > { .compatible = "fsl,imx8mm-usdhc", .data = &usdhc_imx8mm_data, }, > + { .compatible = "fsl,imx8mq-usdhc", .data = &usdhc_imx8mq_data, }, > { .compatible = "fsl,imxrt1050-usdhc", .data = &usdhc_imxrt1050_data, }, > { .compatible = "nxp,s32g2-usdhc", .data = &usdhc_s32g2_data, }, > { /* sentinel */ }
On 2/17/25 11:58, Lucas Stach wrote: > Hi Luke, > > Am Montag, dem 17.02.2025 um 19:06 +0800 schrieb ziniu.wang_1@nxp.com: >> From: Luke Wang <ziniu.wang_1@nxp.com> >> >> Compared with kernel 6.1, imx8mq eMMC/SD read performance drops by about >> 30% with kernel 6.6. >> >> The eMMC/SD read thread will be put to sleep until the hardware completes >> data transfer. Normally, the read thread will be woken up immediately >> when the data transfer is completed. However, due to a known ic bug, if >> imx8mq is in cpuidle, it will take a long time (about 500us) to exit >> cpuidle. As a result, the read thread cannot immediately read the next >> data block, affecting the read performance. >> > Is this really a problem with the upstream kernel? i.MX8MQ upstream > does not use the deeper PSCI idle states, but only uses WFI, so I doubt > that upstream is affected by this issue. > > Regards, > Lucas Furthermore if that were to be the case the correct solution would probably to have that reflected in the dts, too?
Hi Lucas, You are right. This issue is observed on local kernel. I checked that the local kernel does use a deeper PSCI idle state (power off). So, the upstream kernel will not be affected by this. Thank you, Luke > -----Original Message----- > From: Lucas Stach <l.stach@pengutronix.de> > Sent: Monday, February 17, 2025 7:58 PM > To: Luke Wang <ziniu.wang_1@nxp.com>; adrian.hunter@intel.com; > ulf.hansson@linaro.org > Cc: imx@lists.linux.dev; dl-S32 <S32@nxp.com>; shawnguo@kernel.org; > s.hauer@pengutronix.de; linux-mmc@vger.kernel.org; linux- > kernel@vger.kernel.org; Bough Chen <haibo.chen@nxp.com>; > kernel@pengutronix.de; festevam@gmail.com; linux-arm- > kernel@lists.infradead.org > Subject: [EXT] Re: [PATCH] mmc: sdhci-esdhc-imx: improve imx8mq emmc/sd > read performance > > [You don't often get email from l.stach@pengutronix.de. Learn why this is > important at https://aka.ms/LearnAboutSenderIdentification ] > > Caution: This is an external email. Please take care when clicking links or > opening attachments. When in doubt, report the message using the 'Report > this email' button > > > Hi Luke, > > Am Montag, dem 17.02.2025 um 19:06 +0800 schrieb > ziniu.wang_1@nxp.com: > > From: Luke Wang <ziniu.wang_1@nxp.com> > > > > Compared with kernel 6.1, imx8mq eMMC/SD read performance drops by > about > > 30% with kernel 6.6. > > > > The eMMC/SD read thread will be put to sleep until the hardware > completes > > data transfer. Normally, the read thread will be woken up immediately > > when the data transfer is completed. However, due to a known ic bug, if > > imx8mq is in cpuidle, it will take a long time (about 500us) to exit > > cpuidle. As a result, the read thread cannot immediately read the next > > data block, affecting the read performance. > > > Is this really a problem with the upstream kernel? i.MX8MQ upstream > does not use the deeper PSCI idle states, but only uses WFI, so I doubt > that upstream is affected by this issue. > > Regards, > Lucas > > > Kernel 6.6 uses EEVDF as the new scheduler, which affects cpu scheduling > > and cpuidle behavior. With kernel 6.6, the cpu which the read thread > > resides has a greater probability in cpuidle (about 80%), while with > > kernel 6.1, the probability is only about 20-30%. For other platforms, > > this does not have a significant impact on read performance because the > > cpuidle exit time is very short (for example, imx93 is about 60us). But > > for imx8mq, this results in longer waits for the thread to be woken up > > while reading eMMC/SD, which drops performance. > > > > So for imx8mq, use the ESDHC_FLAG_PMQOS flag to request the cpu > latency > > QoS constraint. This can prevent entering cpuidle during data transfer. > > > > Signed-off-by: Luke Wang <ziniu.wang_1@nxp.com> > > --- > > drivers/mmc/host/sdhci-esdhc-imx.c | 10 ++++++++++ > > 1 file changed, 10 insertions(+) > > > > diff --git a/drivers/mmc/host/sdhci-esdhc-imx.c b/drivers/mmc/host/sdhci- > esdhc-imx.c > > index ff78a7c6a04c..b3bf9c171d46 100644 > > --- a/drivers/mmc/host/sdhci-esdhc-imx.c > > +++ b/drivers/mmc/host/sdhci-esdhc-imx.c > > @@ -337,6 +337,15 @@ static struct esdhc_soc_data usdhc_imx8mm_data > = { > > .quirks = SDHCI_QUIRK_NO_LED, > > }; > > > > +static struct esdhc_soc_data usdhc_imx8mq_data = { > > + .flags = ESDHC_FLAG_USDHC | ESDHC_FLAG_STD_TUNING > > + | ESDHC_FLAG_HAVE_CAP1 | ESDHC_FLAG_HS200 > > + | ESDHC_FLAG_HS400 | ESDHC_FLAG_PMQOS > > + | ESDHC_FLAG_STATE_LOST_IN_LPMODE > > + | ESDHC_FLAG_BROKEN_AUTO_CMD23, > > + .quirks = SDHCI_QUIRK_NO_LED, > > +}; > > + > > struct pltfm_imx_data { > > u32 scratchpad; > > struct pinctrl *pinctrl; > > @@ -381,6 +390,7 @@ static const struct of_device_id imx_esdhc_dt_ids[] > = { > > { .compatible = "fsl,imx7ulp-usdhc", .data = &usdhc_imx7ulp_data, }, > > { .compatible = "fsl,imx8qxp-usdhc", .data = &usdhc_imx8qxp_data, }, > > { .compatible = "fsl,imx8mm-usdhc", .data = &usdhc_imx8mm_data, }, > > + { .compatible = "fsl,imx8mq-usdhc", .data = &usdhc_imx8mq_data, }, > > { .compatible = "fsl,imxrt1050-usdhc", .data = > &usdhc_imxrt1050_data, }, > > { .compatible = "nxp,s32g2-usdhc", .data = &usdhc_s32g2_data, }, > > { /* sentinel */ }
diff --git a/drivers/mmc/host/sdhci-esdhc-imx.c b/drivers/mmc/host/sdhci-esdhc-imx.c index ff78a7c6a04c..b3bf9c171d46 100644 --- a/drivers/mmc/host/sdhci-esdhc-imx.c +++ b/drivers/mmc/host/sdhci-esdhc-imx.c @@ -337,6 +337,15 @@ static struct esdhc_soc_data usdhc_imx8mm_data = { .quirks = SDHCI_QUIRK_NO_LED, }; +static struct esdhc_soc_data usdhc_imx8mq_data = { + .flags = ESDHC_FLAG_USDHC | ESDHC_FLAG_STD_TUNING + | ESDHC_FLAG_HAVE_CAP1 | ESDHC_FLAG_HS200 + | ESDHC_FLAG_HS400 | ESDHC_FLAG_PMQOS + | ESDHC_FLAG_STATE_LOST_IN_LPMODE + | ESDHC_FLAG_BROKEN_AUTO_CMD23, + .quirks = SDHCI_QUIRK_NO_LED, +}; + struct pltfm_imx_data { u32 scratchpad; struct pinctrl *pinctrl; @@ -381,6 +390,7 @@ static const struct of_device_id imx_esdhc_dt_ids[] = { { .compatible = "fsl,imx7ulp-usdhc", .data = &usdhc_imx7ulp_data, }, { .compatible = "fsl,imx8qxp-usdhc", .data = &usdhc_imx8qxp_data, }, { .compatible = "fsl,imx8mm-usdhc", .data = &usdhc_imx8mm_data, }, + { .compatible = "fsl,imx8mq-usdhc", .data = &usdhc_imx8mq_data, }, { .compatible = "fsl,imxrt1050-usdhc", .data = &usdhc_imxrt1050_data, }, { .compatible = "nxp,s32g2-usdhc", .data = &usdhc_s32g2_data, }, { /* sentinel */ }