Message ID | 9a365cffe5af9ec5a1f79638968c3a2efa979b65.1634622716.git.mchehab+huawei@kernel.org (mailing list archive) |
---|---|
State | Superseded |
Delegated to: | Lorenzo Pieralisi |
Headers | show |
Series | Add support for Hikey 970 PCIe | expand |
On Tue, Oct 19, 2021 at 07:06:42AM +0100, Mauro Carvalho Chehab wrote: > Before code refactor, the PERST# signals were sent at the > end of the power_on logic. Then, the PCI core would probe for > the buses and add them. > > The new logic changed it to send PERST# signals during > add_bus operation. That altered the timings. > > Also, HiKey 970 require a little more waiting time for > the PCI bridge - which is outside the SoC - to finish > the PERST# reset, and then initialize the eye diagram. > Ok, now you explained it and we should move this explanation in the commit log that this change is affecting (I mean we should squash this patch with the patch that actually requires it - I am not sure whether it is patch 6 or another one). I can do it for you; I thought it would be a standalone change but it actually isn't, because it is brought about by the changes you are making and therefore there it belongs. Thanks for explaining it and apologies for the churn. Lorenzo > So, increase the waiting time for the PERST# signals to > what's required for it to also work with HiKey 970. > > Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org> > --- > > See [PATCH v14 00/11] at: https://lore.kernel.org/all/cover.1634622716.git.mchehab+huawei@kernel.org/ > > drivers/pci/controller/dwc/pcie-kirin.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/drivers/pci/controller/dwc/pcie-kirin.c b/drivers/pci/controller/dwc/pcie-kirin.c > index de375795a3b8..bc329673632a 100644 > --- a/drivers/pci/controller/dwc/pcie-kirin.c > +++ b/drivers/pci/controller/dwc/pcie-kirin.c > @@ -113,7 +113,7 @@ struct kirin_pcie { > #define CRGCTRL_PCIE_ASSERT_BIT 0x8c000000 > > /* Time for delay */ > -#define REF_2_PERST_MIN 20000 > +#define REF_2_PERST_MIN 21000 > #define REF_2_PERST_MAX 25000 > #define PERST_2_ACCESS_MIN 10000 > #define PERST_2_ACCESS_MAX 12000 > -- > 2.31.1 >
Em Thu, 21 Oct 2021 13:27:29 +0100 Lorenzo Pieralisi <lorenzo.pieralisi@arm.com> escreveu: > On Tue, Oct 19, 2021 at 07:06:42AM +0100, Mauro Carvalho Chehab wrote: > > Before code refactor, the PERST# signals were sent at the > > end of the power_on logic. Then, the PCI core would probe for > > the buses and add them. > > > > The new logic changed it to send PERST# signals during > > add_bus operation. That altered the timings. > > > > Also, HiKey 970 require a little more waiting time for > > the PCI bridge - which is outside the SoC - to finish > > the PERST# reset, and then initialize the eye diagram. > > > > Ok, now you explained it and we should move this explanation > in the commit log that this change is affecting (I mean we > should squash this patch with the patch that actually requires it > - I am not sure whether it is patch 6 or another one). IMO, having it on a separate patch has the advantage of better documenting this single line change, but yeah, this is part of the change needed to handle PERST# on a more portable way to work for both chipsets. > I can do it for you; I thought it would be a standalone change > but it actually isn't, because it is brought about by the > changes you are making and therefore there it belongs. Feel free to squash this patch if you prefer so. whatever works best for you. > Thanks for explaining it and apologies for the churn. No problem. > > Lorenzo > > > So, increase the waiting time for the PERST# signals to > > what's required for it to also work with HiKey 970. > > > > Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org> > > --- > > > > See [PATCH v14 00/11] at: https://lore.kernel.org/all/cover.1634622716.git.mchehab+huawei@kernel.org/ > > > > drivers/pci/controller/dwc/pcie-kirin.c | 2 +- > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > diff --git a/drivers/pci/controller/dwc/pcie-kirin.c b/drivers/pci/controller/dwc/pcie-kirin.c > > index de375795a3b8..bc329673632a 100644 > > --- a/drivers/pci/controller/dwc/pcie-kirin.c > > +++ b/drivers/pci/controller/dwc/pcie-kirin.c > > @@ -113,7 +113,7 @@ struct kirin_pcie { > > #define CRGCTRL_PCIE_ASSERT_BIT 0x8c000000 > > > > /* Time for delay */ > > -#define REF_2_PERST_MIN 20000 > > +#define REF_2_PERST_MIN 21000 > > #define REF_2_PERST_MAX 25000 > > #define PERST_2_ACCESS_MIN 10000 > > #define PERST_2_ACCESS_MAX 12000 > > -- > > 2.31.1 > >
On Tuesday 19 October 2021 07:06:42 Mauro Carvalho Chehab wrote: > Before code refactor, the PERST# signals were sent at the > end of the power_on logic. Then, the PCI core would probe for > the buses and add them. > > The new logic changed it to send PERST# signals during > add_bus operation. That altered the timings. > > Also, HiKey 970 require a little more waiting time for > the PCI bridge - which is outside the SoC - to finish > the PERST# reset, and then initialize the eye diagram. Hello! Which PCIe port do you mean by PCI bridge device? Do you mean PCIe Root Port? Or upstream port on some external PCIe switch connected via PCIe bus to the PCIe Root Port? Because all of these (virtual) PCIe devices are presented as PCI bridge devices, so it is not clear to which device it refers. Normally PERST# signal is used to reset endpoint card, other end of PCIe link and so PERST# signal should not affect PCIe Root Port at all. > So, increase the waiting time for the PERST# signals to > what's required for it to also work with HiKey 970. Because PERST# signal resets endpoint card, this reset timeout should not be driver or controller specific. Mauro, if you understand this issue more deeply, could you look at my email? https://lore.kernel.org/linux-pci/20210310110535.zh4pnn4vpmvzwl5q@pali/ I think that kernel PCI subsystem does not properly handle PCIe Warm Reset and correct initialization of endpoint cards. Because similar "random PERST# timeout patches" were applied to lot of native controller drivers. PS: I'm not opposing this patch, I'm just trying to understand what is happening here and why particular number "21000" was chosen. It is defined in some standard? Or was it just randomly chosen and measures that with this number is initialization working fine? > Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org> > --- > > See [PATCH v14 00/11] at: https://lore.kernel.org/all/cover.1634622716.git.mchehab+huawei@kernel.org/ > > drivers/pci/controller/dwc/pcie-kirin.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/drivers/pci/controller/dwc/pcie-kirin.c b/drivers/pci/controller/dwc/pcie-kirin.c > index de375795a3b8..bc329673632a 100644 > --- a/drivers/pci/controller/dwc/pcie-kirin.c > +++ b/drivers/pci/controller/dwc/pcie-kirin.c > @@ -113,7 +113,7 @@ struct kirin_pcie { > #define CRGCTRL_PCIE_ASSERT_BIT 0x8c000000 > > /* Time for delay */ > -#define REF_2_PERST_MIN 20000 > +#define REF_2_PERST_MIN 21000 > #define REF_2_PERST_MAX 25000 > #define PERST_2_ACCESS_MIN 10000 > #define PERST_2_ACCESS_MAX 12000 > -- > 2.31.1 >
Hi Pali, Em Fri, 22 Oct 2021 17:16:24 +0200 Pali Rohár <pali@kernel.org> escreveu: > On Tuesday 19 October 2021 07:06:42 Mauro Carvalho Chehab wrote: > > Before code refactor, the PERST# signals were sent at the > > end of the power_on logic. Then, the PCI core would probe for > > the buses and add them. > > > > The new logic changed it to send PERST# signals during > > add_bus operation. That altered the timings. > > > > Also, HiKey 970 require a little more waiting time for > > the PCI bridge - which is outside the SoC - to finish > > the PERST# reset, and then initialize the eye diagram. > > Hello! Which PCIe port do you mean by PCI bridge device? Do you mean > PCIe Root Port? Or upstream port on some external PCIe switch connected > via PCIe bus to the PCIe Root Port? Because all of these (virtual) PCIe > devices are presented as PCI bridge devices, so it is not clear to which > device it refers. HiKey 970 uses an external PCI bridge chipset (a Broadcom PEX 8606[1]), with 3 elements connected to the bus: an Ethernet card, a M.2 slot and a mini PCIe slot. It seems HiKey 970 is unique with regards to PERST# signal, as there are 4 independent PERST# signals there: - one for PEX 8606 (the PCIe root port); - one for Ethernet; - one for M.2; - one for mini-PCIe. After sending the PCIe PERST# signals, the device has to wait for 21 ms before adjusting the eye diagram. [1] https://docs.broadcom.com/docs/PEX_8606_AIC_RDK_HRM_v1.3_06Aug10.pdf > Normally PERST# signal is used to reset endpoint card, other end of PCIe > link and so PERST# signal should not affect PCIe Root Port at all. That's not the case, as PEX 8606 needs to complete its reset sequence for the rest of the devices to be visible. If the wait time is reduced or removed, the devices behind it won't be detected. > > So, increase the waiting time for the PERST# signals to > > what's required for it to also work with HiKey 970. > > Because PERST# signal resets endpoint card, this reset timeout should > not be driver or controller specific. Not sure if it would be possible to implement it at the core without breaking devices like this one where there's a separate chip to actually implement the PCIe bus. > Mauro, if you understand this issue more deeply, could you look at my > email? https://lore.kernel.org/linux-pci/20210310110535.zh4pnn4vpmvzwl5q@pali/ > > I think that kernel PCI subsystem does not properly handle PCIe Warm > Reset and correct initialization of endpoint cards. Because similar > "random PERST# timeout patches" were applied to lot of native controller > drivers. I don't know enough about PCIe documentation in order to help with that. Yet, if the PCI/PCIe specs doesn't define a maximum time for PERST# to finish, hardware manufacturers will do whatever they please. So, finding a common value is impossible. Well, even if specs define it, vendors may still violate that. So, whatever implementation is done, some quirks may be needed. Sending PERST# signals to the devices connected to the bridge too early will cause the bridge to not detect the devices behind it. That's what happens with HiKey 970: lower reset values cause it to miss devices. Looking from harware perspective, I'd say that the reset time pretty much depends on how the PCIe bridges are implemented: if it is FPGA, it is probably slower than if it is a dedicated hardware. It can be even slower if the bridge uses a microcontroller and needs to read the firmware from some place. > PS: I'm not opposing this patch, I'm just trying to understand what is > happening here and why particular number "21000" was chosen. It is > defined in some standard? Or was it just randomly chosen and measures > that with this number is initialization working fine? It is the value used by the HiKey 970 PCIe out-of-tree driver. The patch which added support for it at the pcie-kirin increased the time out there. I tried to preserve the previous value, but that cause some devices to be missed during PCI probe time. Btw, PEX 8606 datasheet says: `Reset Circuit The PEX 8606BA-AIC1U1D RDK accepts a PERST# from the host PC via card edge connector P1. This signal is OR’d with a manual reset circuit. The manual reset circuit consists of a pushbutton (SW7, upper left corner) that feeds into a reset timer. The reset timer monitors its power rail and reset input. If the reset input is low or the supply rail is out of range, the reset output is held. Once both conditions no longer exist, the reset output will de- assert after a programmable reset timeout period (capacitor adjustable, default value 128 msec). The OR’d reset signal goes to the PEX 8606 device’s PEX_PERST# input pin, and the downstream slots’ PERST# connector pins. PERST# to Slot J1 can be controlled by the PEX 8606 device’s Hot-Plug interface.' If I understood it well, the PERST# time is hardware-configurable, by changing the value of a capacitor. Regards, Mauro > > > Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org> > > --- > > > > See [PATCH v14 00/11] at: https://lore.kernel.org/all/cover.1634622716.git.mchehab+huawei@kernel.org/ > > > > drivers/pci/controller/dwc/pcie-kirin.c | 2 +- > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > diff --git a/drivers/pci/controller/dwc/pcie-kirin.c b/drivers/pci/controller/dwc/pcie-kirin.c > > index de375795a3b8..bc329673632a 100644 > > --- a/drivers/pci/controller/dwc/pcie-kirin.c > > +++ b/drivers/pci/controller/dwc/pcie-kirin.c > > @@ -113,7 +113,7 @@ struct kirin_pcie { > > #define CRGCTRL_PCIE_ASSERT_BIT 0x8c000000 > > > > /* Time for delay */ > > -#define REF_2_PERST_MIN 20000 > > +#define REF_2_PERST_MIN 21000 > > #define REF_2_PERST_MAX 25000 > > #define PERST_2_ACCESS_MIN 10000 > > #define PERST_2_ACCESS_MAX 12000 > > -- > > 2.31.1 > >
Hello! On Saturday 23 October 2021 10:30:59 Mauro Carvalho Chehab wrote: > Hi Pali, > > Em Fri, 22 Oct 2021 17:16:24 +0200 > Pali Rohár <pali@kernel.org> escreveu: > > > On Tuesday 19 October 2021 07:06:42 Mauro Carvalho Chehab wrote: > > > Before code refactor, the PERST# signals were sent at the > > > end of the power_on logic. Then, the PCI core would probe for > > > the buses and add them. > > > > > > The new logic changed it to send PERST# signals during > > > add_bus operation. That altered the timings. > > > > > > Also, HiKey 970 require a little more waiting time for > > > the PCI bridge - which is outside the SoC - to finish > > > the PERST# reset, and then initialize the eye diagram. > > > > Hello! Which PCIe port do you mean by PCI bridge device? Do you mean > > PCIe Root Port? Or upstream port on some external PCIe switch connected > > via PCIe bus to the PCIe Root Port? Because all of these (virtual) PCIe > > devices are presented as PCI bridge devices, so it is not clear to which > > device it refers. > > HiKey 970 uses an external PCI bridge chipset (a Broadcom PEX 8606[1]), Ok! Now I understood. You have probably one PCIe Root Port on your board and to this port you have connected (external) PCIe switch card from Broadcom to increase number of PCIe ports for endpoint cards. It is classic setup for boards with just one PCIe port. > with 3 elements connected to the bus: an Ethernet card, a M.2 slot and > a mini PCIe slot. It seems HiKey 970 is unique with regards to PERST# signal, > as there are 4 independent PERST# signals there: > > - one for PEX 8606 (the PCIe root port); > - one for Ethernet; > - one for M.2; > - one for mini-PCIe. This is not unique setup, its pretty normal. Every PCIe card has (own) PERST# pin and obviously you want to control each pin separately via SW. And because PCIe switch is also (upstream) PCIe device it has also PERST# pin. > After sending the PCIe PERST# signals, the device has to wait for 21 ms > before adjusting the eye diagram. "the device" which has to wait is HiKey970 or PEX8606? > [1] https://docs.broadcom.com/docs/PEX_8606_AIC_RDK_HRM_v1.3_06Aug10.pdf > > > Normally PERST# signal is used to reset endpoint card, other end of PCIe > > link and so PERST# signal should not affect PCIe Root Port at all. > > That's not the case, as PEX 8606 needs to complete its reset sequence > for the rest of the devices to be visible. If the wait time is reduced > or removed, the devices behind it won't be detected. Well, "endpoint card" for HiKey970 PCIe link is here PEX8606. And if you connect PEX8606 to any other board (which could have totally different PCIe controller), it means that same wait timeouts are required for that other board. So this wait timeout 21 ms is not HiKey970 specific, but rather PEX8606 specific, right? > > > So, increase the waiting time for the PERST# signals to > > > what's required for it to also work with HiKey 970. > > > > Because PERST# signal resets endpoint card, this reset timeout should > > not be driver or controller specific. > > Not sure if it would be possible to implement it at the core without > breaking devices like this one where there's a separate chip to actually > implement the PCIe bus. I think it should be possible. Probably not so easy, would need more testing, etc... But as I wrote above, this setup is not unique, it is really normal and kernel is prepared to work PCI and PCIe topologies when one or more PCIe switches, PCIe-to-PCI bridges or even more PCI-to-PCI bridges are used and connected to system board. I send email with proposal / idea how could be PCI subsystem extended to handle initialization of native PCIe controller drivers: https://lore.kernel.org/linux-pci/20211022183808.jdeo7vntnagqkg7g@pali/ (if you have some more points, feel free to reply) > > Mauro, if you understand this issue more deeply, could you look at my > > email? https://lore.kernel.org/linux-pci/20210310110535.zh4pnn4vpmvzwl5q@pali/ > > > > I think that kernel PCI subsystem does not properly handle PCIe Warm > > Reset and correct initialization of endpoint cards. Because similar > > "random PERST# timeout patches" were applied to lot of native controller > > drivers. > > I don't know enough about PCIe documentation in order to help with that. > Yet, if the PCI/PCIe specs doesn't define a maximum time for PERST# to > finish, hardware manufacturers will do whatever they please. So, finding > a common value is impossible. Well, it is possible that just I was not able to "find and decode" this timeout from specifications. So I'm just asking if somebody else was able to do it :-) > Well, even if specs define it, vendors may still violate that. So, whatever > implementation is done, some quirks may be needed. Of course, we know it and kernel has hooks and corrections for such situation. Fixes are in most cases in one place: drivers/pci/quirks.c > Sending PERST# signals to the devices connected to the bridge too early > will cause the bridge to not detect the devices behind it. That's what > happens with HiKey 970: lower reset values cause it to miss devices. Just to make sure, that I understand your problem. Is your setup looks like this? +-------------------------------------PERST#--+---+ | |eth| | +------PERST#--+ +--PCIe---+---+ | | | | +-------------+ +-------------+ +-------+ +---+ |GPIO-HiKey970| |PCIe-HiKey970|--PCIe--|PEX8606|-PCIe-|m.2| +-------------+ +-------------+ +-------+ +---+ | | +-----+ | | | +--PERST#--|mPCIe|--PCIe--+ | | +-----+ | +---------------------------------------PERST#--+ And if yes, in which order you need to assert individual PERST# signals and in which order to de-assert them? > Looking from harware perspective, I'd say that the reset time pretty > much depends on how the PCIe bridges are implemented: if it is FPGA, it is > probably slower than if it is a dedicated hardware. It can be even slower > if the bridge uses a microcontroller and needs to read the firmware from > some place. > > > PS: I'm not opposing this patch, I'm just trying to understand what is > > happening here and why particular number "21000" was chosen. It is > > defined in some standard? Or was it just randomly chosen and measures > > that with this number is initialization working fine? > > It is the value used by the HiKey 970 PCIe out-of-tree driver. The patch > which added support for it at the pcie-kirin increased the time out there. > > I tried to preserve the previous value, but that cause some devices to > be missed during PCI probe time. > > Btw, PEX 8606 datasheet says: > > `Reset Circuit > > The PEX 8606BA-AIC1U1D RDK accepts a PERST# from the host PC via card edge connector P1. This signal is > OR’d with a manual reset circuit. The manual reset circuit consists of a pushbutton (SW7, upper left corner) that > feeds into a reset timer. The reset timer monitors its power rail and reset input. If the reset input is low or the > supply rail is out of range, the reset output is held. Once both conditions no longer exist, the reset output will de- > assert after a programmable reset timeout period (capacitor adjustable, default value 128 msec). The OR’d reset > signal goes to the PEX 8606 device’s PEX_PERST# input pin, and the downstream slots’ PERST# connector > pins. PERST# to Slot J1 can be controlled by the PEX 8606 device’s Hot-Plug interface.' > > If I understood it well, the PERST# time is hardware-configurable, by > changing the value of a capacitor. Hm... this is something different. It says: "Once both conditions no longer exist, the reset output will de-assert after a programmable reset timeout period (capacitor adjustable, default value 128 msec)." I understand this part that if signal is no longer in reset that then this capacitor cause that reset is held for another 128 ms. So if host stops reset signal then it has to wait 128 ms prior doing something (to ensure that reset finished), right? > Regards, > Mauro > > > > > Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org> > > > --- > > > > > > See [PATCH v14 00/11] at: https://lore.kernel.org/all/cover.1634622716.git.mchehab+huawei@kernel.org/ > > > > > > drivers/pci/controller/dwc/pcie-kirin.c | 2 +- > > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > > > diff --git a/drivers/pci/controller/dwc/pcie-kirin.c b/drivers/pci/controller/dwc/pcie-kirin.c > > > index de375795a3b8..bc329673632a 100644 > > > --- a/drivers/pci/controller/dwc/pcie-kirin.c > > > +++ b/drivers/pci/controller/dwc/pcie-kirin.c > > > @@ -113,7 +113,7 @@ struct kirin_pcie { > > > #define CRGCTRL_PCIE_ASSERT_BIT 0x8c000000 > > > > > > /* Time for delay */ > > > -#define REF_2_PERST_MIN 20000 > > > +#define REF_2_PERST_MIN 21000 > > > #define REF_2_PERST_MAX 25000 > > > #define PERST_2_ACCESS_MIN 10000 > > > #define PERST_2_ACCESS_MAX 12000 > > > -- > > > 2.31.1 > > >
Em Sat, 23 Oct 2021 12:40:11 +0200 Pali Rohár <pali@kernel.org> escreveu: > Hello! > > On Saturday 23 October 2021 10:30:59 Mauro Carvalho Chehab wrote: > > Hi Pali, > > > > Em Fri, 22 Oct 2021 17:16:24 +0200 > > Pali Rohár <pali@kernel.org> escreveu: > > > > > On Tuesday 19 October 2021 07:06:42 Mauro Carvalho Chehab wrote: > > > > Before code refactor, the PERST# signals were sent at the > > > > end of the power_on logic. Then, the PCI core would probe for > > > > the buses and add them. > > > > > > > > The new logic changed it to send PERST# signals during > > > > add_bus operation. That altered the timings. > > > > > > > > Also, HiKey 970 require a little more waiting time for > > > > the PCI bridge - which is outside the SoC - to finish > > > > the PERST# reset, and then initialize the eye diagram. > > > > > > Hello! Which PCIe port do you mean by PCI bridge device? Do you mean > > > PCIe Root Port? Or upstream port on some external PCIe switch connected > > > via PCIe bus to the PCIe Root Port? Because all of these (virtual) PCIe > > > devices are presented as PCI bridge devices, so it is not clear to which > > > device it refers. > > > > HiKey 970 uses an external PCI bridge chipset (a Broadcom PEX 8606[1]), > > Ok! Now I understood. You have probably one PCIe Root Port on your board > and to this port you have connected (external) PCIe switch card from > Broadcom to increase number of PCIe ports for endpoint cards. Yes. > It is classic setup for boards with just one PCIe port. > > > with 3 elements connected to the bus: an Ethernet card, a M.2 slot and > > a mini PCIe slot. It seems HiKey 970 is unique with regards to PERST# signal, > > as there are 4 independent PERST# signals there: > > > > - one for PEX 8606 (the PCIe root port); > > - one for Ethernet; > > - one for M.2; > > - one for mini-PCIe. > > This is not unique setup, its pretty normal. Every PCIe card has (own) > PERST# pin and obviously you want to control each pin separately via SW. > And because PCIe switch is also (upstream) PCIe device it has also > PERST# pin. Based on the discussions we had to add per-port DT PERST# gpios, it sounded to me that this is was not a typical setup ;-) It seems that the typical setup is to have a single PERST# connected to all devices inside the bus. > > > After sending the PCIe PERST# signals, the device has to wait for 21 ms > > before adjusting the eye diagram. > > "the device" which has to wait is HiKey970 or PEX8606? I guess both need, but not really sure. Kirin 970 (the SoC used on HiKey 970 board) needs to wait for PEX8606 to reset, in order to adjust the eye diagram. The bus probing code needs to wait after sending the PERST# signals to the devices behind PEX8606, as otherwise they aren't detected. > > [1] https://docs.broadcom.com/docs/PEX_8606_AIC_RDK_HRM_v1.3_06Aug10.pdf > > > > > Normally PERST# signal is used to reset endpoint card, other end of PCIe > > > link and so PERST# signal should not affect PCIe Root Port at all. > > > > That's not the case, as PEX 8606 needs to complete its reset sequence > > for the rest of the devices to be visible. If the wait time is reduced > > or removed, the devices behind it won't be detected. > > Well, "endpoint card" for HiKey970 PCIe link is here PEX8606. And if you > connect PEX8606 to any other board (which could have totally different > PCIe controller), it means that same wait timeouts are required for that > other board. > > So this wait timeout 21 ms is not HiKey970 specific, but rather PEX8606 > specific, right? I guess so. > > > > So, increase the waiting time for the PERST# signals to > > > > what's required for it to also work with HiKey 970. > > > > > > Because PERST# signal resets endpoint card, this reset timeout should > > > not be driver or controller specific. > > > > Not sure if it would be possible to implement it at the core without > > breaking devices like this one where there's a separate chip to actually > > implement the PCIe bus. > > I think it should be possible. Probably not so easy, would need more > testing, etc... But as I wrote above, this setup is not unique, it is > really normal and kernel is prepared to work PCI and PCIe topologies > when one or more PCIe switches, PCIe-to-PCI bridges or even more > PCI-to-PCI bridges are used and connected to system board. Yeah, technically, it is doable, but applying change like that requires testing the code with all affected devices in order to avoid regressions. > I send email with proposal / idea how could be PCI subsystem extended to > handle initialization of native PCIe controller drivers: > https://lore.kernel.org/linux-pci/20211022183808.jdeo7vntnagqkg7g@pali/ > (if you have some more points, feel free to reply) Ok. Will try to reply to it later. > > > Mauro, if you understand this issue more deeply, could you look at my > > > email? https://lore.kernel.org/linux-pci/20210310110535.zh4pnn4vpmvzwl5q@pali/ > > > > > > I think that kernel PCI subsystem does not properly handle PCIe Warm > > > Reset and correct initialization of endpoint cards. Because similar > > > "random PERST# timeout patches" were applied to lot of native controller > > > drivers. > > > > I don't know enough about PCIe documentation in order to help with that. > > Yet, if the PCI/PCIe specs doesn't define a maximum time for PERST# to > > finish, hardware manufacturers will do whatever they please. So, finding > > a common value is impossible. > > Well, it is possible that just I was not able to "find and decode" this > timeout from specifications. So I'm just asking if somebody else was > able to do it :-) My past experiences with other drivers/devices show that, even for things that are clearly documented at the specs, hardware vendors end missing some things. That's why there are lots of quicks at USB and PCI code all over the drivers. > > Well, even if specs define it, vendors may still violate that. So, whatever > > implementation is done, some quirks may be needed. > > Of course, we know it and kernel has hooks and corrections for such > situation. Fixes are in most cases in one place: drivers/pci/quirks.c There are PCI quirks outside PCI core. For instance, on media, we had to add several quicks to avoid PCI2PCI data transfers with some broken hardware, where enabing it would cause disk data corruption (see bt8xx, saa7134 and cx88 drivers, for instance). > > Sending PERST# signals to the devices connected to the bridge too early > > will cause the bridge to not detect the devices behind it. That's what > > happens with HiKey 970: lower reset values cause it to miss devices. > > Just to make sure, that I understand your problem. Is your setup looks > like this? > > +-------------------------------------PERST#--+---+ > | |eth| > | +------PERST#--+ +--PCIe---+---+ > | | | | > +-------------+ +-------------+ +-------+ +---+ > |GPIO-HiKey970| |PCIe-HiKey970|--PCIe--|PEX8606|-PCIe-|m.2| > +-------------+ +-------------+ +-------+ +---+ > | | +-----+ | | > | +--PERST#--|mPCIe|--PCIe--+ | > | +-----+ | > +---------------------------------------PERST#--+ It is: +-------------------------------------PERST#--+---+ | |eth| | +--------------------PERST#--+ +--PCIe---+---+ | | | | +-------------+ +------------+ +-------+ +---+ |Kirin970 GPIO| |Kirin970 |--PCIe--|PEX8606|-PCIe-|m.2| | | |DWC and PHY | +-------+ +---+ +-------------+ +------------+ | | | | +-----+ | | | +---PERST#----------|mPCIe|--PCIe--+ | | +-----+ | +--------------------------------------PERST#---+ You can see more details by looking at the schematics of the board. It is available at: https://www.96boards.org/documentation/consumer/hikey/hikey970/hardware-docs/files/hikey970-schematics.pdf > And if yes, in which order you need to assert individual PERST# signals > and in which order to de-assert them? As requested by Rob, the current code triggers a PERST# signal to PEX8606 at the end of the power-on sequence: if (!gpio_request(kirin_pcie->gpio_id_dwc_perst, "pcie_perst_bridge")) { ret = gpio_direction_output(kirin_pcie->gpio_id_dwc_perst, 1); if (ret) goto err; } usleep_range(PERST_2_ACCESS_MIN, PERST_2_ACCESS_MAX); Then, it sends a per-slot reset during add_bus ops: static int kirin_pcie_add_bus(struct pci_bus *bus) { struct dw_pcie *pci = to_dw_pcie_from_pp(bus->sysdata); struct kirin_pcie *kirin_pcie = to_kirin_pcie(pci); int i, ret; if (!kirin_pcie->num_slots) return 0; /* Send PERST# to each slot */ for (i = 0; i < kirin_pcie->num_slots; i++) { ret = gpio_direction_output(kirin_pcie->gpio_id_reset[i], 1); if (ret) { dev_err(pci->dev, "PERST# %s error: %d\n", kirin_pcie->reset_names[i], ret); } } usleep_range(PERST_2_ACCESS_MIN, PERST_2_ACCESS_MAX); return 0; } > > Looking from harware perspective, I'd say that the reset time pretty > > much depends on how the PCIe bridges are implemented: if it is FPGA, it is > > probably slower than if it is a dedicated hardware. It can be even slower > > if the bridge uses a microcontroller and needs to read the firmware from > > some place. > > > > > PS: I'm not opposing this patch, I'm just trying to understand what is > > > happening here and why particular number "21000" was chosen. It is > > > defined in some standard? Or was it just randomly chosen and measures > > > that with this number is initialization working fine? > > > > It is the value used by the HiKey 970 PCIe out-of-tree driver. The patch > > which added support for it at the pcie-kirin increased the time out there. > > > > I tried to preserve the previous value, but that cause some devices to > > be missed during PCI probe time. > > > > Btw, PEX 8606 datasheet says: > > > > `Reset Circuit > > > > The PEX 8606BA-AIC1U1D RDK accepts a PERST# from the host PC via card edge connector P1. This signal is > > OR’d with a manual reset circuit. The manual reset circuit consists of a pushbutton (SW7, upper left corner) that > > feeds into a reset timer. The reset timer monitors its power rail and reset input. If the reset input is low or the > > supply rail is out of range, the reset output is held. Once both conditions no longer exist, the reset output will de- > > assert after a programmable reset timeout period (capacitor adjustable, default value 128 msec). The OR’d reset > > signal goes to the PEX 8606 device’s PEX_PERST# input pin, and the downstream slots’ PERST# connector > > pins. PERST# to Slot J1 can be controlled by the PEX 8606 device’s Hot-Plug interface.' > > > > If I understood it well, the PERST# time is hardware-configurable, by > > changing the value of a capacitor. > > Hm... this is something different. It says: "Once both conditions no > longer exist, the reset output will de-assert after a programmable reset > timeout period (capacitor adjustable, default value 128 msec)." > > I understand this part that if signal is no longer in reset that then > this capacitor cause that reset is held for another 128 ms. So if host > stops reset signal then it has to wait 128 ms prior doing something > (to ensure that reset finished), right? It seems that a 0 -> 1 transition at the GPIO triggers PEX8606 code to send the PERST# signal to the time configured via the capacitor (128 ms by default). There's no need to change the GPIO back to 0. Regards, Mauro
On Saturday 23 October 2021 14:45:34 Mauro Carvalho Chehab wrote: > Em Sat, 23 Oct 2021 12:40:11 +0200 > Pali Rohár <pali@kernel.org> escreveu: > > It is classic setup for boards with just one PCIe port. > > > > > with 3 elements connected to the bus: an Ethernet card, a M.2 slot and > > > a mini PCIe slot. It seems HiKey 970 is unique with regards to PERST# signal, > > > as there are 4 independent PERST# signals there: > > > > > > - one for PEX 8606 (the PCIe root port); > > > - one for Ethernet; > > > - one for M.2; > > > - one for mini-PCIe. > > > > This is not unique setup, its pretty normal. Every PCIe card has (own) > > PERST# pin and obviously you want to control each pin separately via SW. > > And because PCIe switch is also (upstream) PCIe device it has also > > PERST# pin. > > Based on the discussions we had to add per-port DT PERST# gpios, it > sounded to me that this is was not a typical setup ;-) > > It seems that the typical setup is to have a single PERST# connected > to all devices inside the bus. Hello! I'm sure it is not unique :) Just seems that these boards either do not use device tree (x86-based) or do not have specified reset-gpios in DTS at all. Sometimes there is no need to touch PERST# gpio as either firmware during boot handles it (x86 BIOS/UEFI case, or U-Boot for arm case) or because board/cpu reset toggle PERST# in a way that is compatible for cards init. Looks like you have non-x86 board, which does not have PCIe init code in firmware, needs special handling of PERST# and you are doing it with upstream kernel :-) So maybe all these conditions are unique... But HW design not. As this setup with reset-gpios per card in DTS nodes is something which I will need too, I sent email to Rob with proposal how to universally declare it in DTS, independently of PCIe controller (you are on CC): https://lore.kernel.org/linux-pci/20211023144252.z7ou2l2tvm6cvtf7@pali/ Due to how PCIe cards are broken, PERST# signal is sometimes the only one option how to reset card at runtime. So requirement for separate PERST# per card configurable at runtime by OS will be requirement for more and more boards.
On Sat, Oct 23, 2021 at 10:30:59AM +0100, Mauro Carvalho Chehab wrote: > Hi Pali, > > Em Fri, 22 Oct 2021 17:16:24 +0200 > Pali Rohár <pali@kernel.org> escreveu: > > > On Tuesday 19 October 2021 07:06:42 Mauro Carvalho Chehab wrote: > > > Before code refactor, the PERST# signals were sent at the > > > end of the power_on logic. Then, the PCI core would probe for > > > the buses and add them. > > > > > > The new logic changed it to send PERST# signals during > > > add_bus operation. That altered the timings. > > > > > > Also, HiKey 970 require a little more waiting time for > > > the PCI bridge - which is outside the SoC - to finish > > > the PERST# reset, and then initialize the eye diagram. > > > > Hello! Which PCIe port do you mean by PCI bridge device? Do you mean > > PCIe Root Port? Or upstream port on some external PCIe switch connected > > via PCIe bus to the PCIe Root Port? Because all of these (virtual) PCIe > > devices are presented as PCI bridge devices, so it is not clear to which > > device it refers. > > HiKey 970 uses an external PCI bridge chipset (a Broadcom PEX 8606[1]), > with 3 elements connected to the bus: an Ethernet card, a M.2 slot and > a mini PCIe slot. It seems HiKey 970 is unique with regards to PERST# signal, > as there are 4 independent PERST# signals there: > > - one for PEX 8606 (the PCIe root port); > - one for Ethernet; > - one for M.2; > - one for mini-PCIe. > > After sending the PCIe PERST# signals, the device has to wait for 21 ms > before adjusting the eye diagram. > > [1] https://docs.broadcom.com/docs/PEX_8606_AIC_RDK_HRM_v1.3_06Aug10.pdf > > > Normally PERST# signal is used to reset endpoint card, other end of PCIe > > link and so PERST# signal should not affect PCIe Root Port at all. > > That's not the case, as PEX 8606 needs to complete its reset sequence > for the rest of the devices to be visible. If the wait time is reduced > or removed, the devices behind it won't be detected. These pieces of information should go into the commit log (or I can add a Link: tag to this discussion) - it is fundamental to understand these changes. I believe we can merge this series but we have to document this discussion appropriately. Lorenzo
Em Mon, 25 Oct 2021 11:25:11 +0100 Lorenzo Pieralisi <lorenzo.pieralisi@arm.com> escreveu: > On Sat, Oct 23, 2021 at 10:30:59AM +0100, Mauro Carvalho Chehab wrote: > > Hi Pali, > > > > Em Fri, 22 Oct 2021 17:16:24 +0200 > > Pali Rohár <pali@kernel.org> escreveu: > > > > > On Tuesday 19 October 2021 07:06:42 Mauro Carvalho Chehab wrote: > > > > Before code refactor, the PERST# signals were sent at the > > > > end of the power_on logic. Then, the PCI core would probe for > > > > the buses and add them. > > > > > > > > The new logic changed it to send PERST# signals during > > > > add_bus operation. That altered the timings. > > > > > > > > Also, HiKey 970 require a little more waiting time for > > > > the PCI bridge - which is outside the SoC - to finish > > > > the PERST# reset, and then initialize the eye diagram. > > > > > > Hello! Which PCIe port do you mean by PCI bridge device? Do you mean > > > PCIe Root Port? Or upstream port on some external PCIe switch connected > > > via PCIe bus to the PCIe Root Port? Because all of these (virtual) PCIe > > > devices are presented as PCI bridge devices, so it is not clear to which > > > device it refers. > > > > HiKey 970 uses an external PCI bridge chipset (a Broadcom PEX 8606[1]), > > with 3 elements connected to the bus: an Ethernet card, a M.2 slot and > > a mini PCIe slot. It seems HiKey 970 is unique with regards to PERST# signal, > > as there are 4 independent PERST# signals there: > > > > - one for PEX 8606 (the PCIe root port); > > - one for Ethernet; > > - one for M.2; > > - one for mini-PCIe. > > > > After sending the PCIe PERST# signals, the device has to wait for 21 ms > > before adjusting the eye diagram. > > > > [1] https://docs.broadcom.com/docs/PEX_8606_AIC_RDK_HRM_v1.3_06Aug10.pdf > > > > > Normally PERST# signal is used to reset endpoint card, other end of PCIe > > > link and so PERST# signal should not affect PCIe Root Port at all. > > > > That's not the case, as PEX 8606 needs to complete its reset sequence > > for the rest of the devices to be visible. If the wait time is reduced > > or removed, the devices behind it won't be detected. > > These pieces of information should go into the commit log (or I can add > a Link: tag to this discussion) - it is fundamental to understand these > changes. > > I believe we can merge this series but we have to document this > discussion appropriately. IMO, the best is to add a Link: to the discussion: Link: https://lore.kernel.org/all/9a365cffe5af9ec5a1f79638968c3a2efa979b65.1634622716.git.mchehab+huawei@kernel.org/ But if you prefer otherwise and want me to re-submit the series, please let me know. Regards, Mauro
On Mon, Oct 25, 2021 at 11:40:11AM +0100, Mauro Carvalho Chehab wrote: > Em Mon, 25 Oct 2021 11:25:11 +0100 > Lorenzo Pieralisi <lorenzo.pieralisi@arm.com> escreveu: > > > On Sat, Oct 23, 2021 at 10:30:59AM +0100, Mauro Carvalho Chehab wrote: > > > Hi Pali, > > > > > > Em Fri, 22 Oct 2021 17:16:24 +0200 > > > Pali Rohár <pali@kernel.org> escreveu: > > > > > > > On Tuesday 19 October 2021 07:06:42 Mauro Carvalho Chehab wrote: > > > > > Before code refactor, the PERST# signals were sent at the > > > > > end of the power_on logic. Then, the PCI core would probe for > > > > > the buses and add them. > > > > > > > > > > The new logic changed it to send PERST# signals during > > > > > add_bus operation. That altered the timings. > > > > > > > > > > Also, HiKey 970 require a little more waiting time for > > > > > the PCI bridge - which is outside the SoC - to finish > > > > > the PERST# reset, and then initialize the eye diagram. > > > > > > > > Hello! Which PCIe port do you mean by PCI bridge device? Do you mean > > > > PCIe Root Port? Or upstream port on some external PCIe switch connected > > > > via PCIe bus to the PCIe Root Port? Because all of these (virtual) PCIe > > > > devices are presented as PCI bridge devices, so it is not clear to which > > > > device it refers. > > > > > > HiKey 970 uses an external PCI bridge chipset (a Broadcom PEX 8606[1]), > > > with 3 elements connected to the bus: an Ethernet card, a M.2 slot and > > > a mini PCIe slot. It seems HiKey 970 is unique with regards to PERST# signal, > > > as there are 4 independent PERST# signals there: > > > > > > - one for PEX 8606 (the PCIe root port); > > > - one for Ethernet; > > > - one for M.2; > > > - one for mini-PCIe. > > > > > > After sending the PCIe PERST# signals, the device has to wait for 21 ms > > > before adjusting the eye diagram. > > > > > > [1] https://docs.broadcom.com/docs/PEX_8606_AIC_RDK_HRM_v1.3_06Aug10.pdf > > > > > > > Normally PERST# signal is used to reset endpoint card, other end of PCIe > > > > link and so PERST# signal should not affect PCIe Root Port at all. > > > > > > That's not the case, as PEX 8606 needs to complete its reset sequence > > > for the rest of the devices to be visible. If the wait time is reduced > > > or removed, the devices behind it won't be detected. > > > > These pieces of information should go into the commit log (or I can add > > a Link: tag to this discussion) - it is fundamental to understand these > > changes. > > > > I believe we can merge this series but we have to document this > > discussion appropriately. > > IMO, the best is to add a Link: to the discussion: > > Link: https://lore.kernel.org/all/9a365cffe5af9ec5a1f79638968c3a2efa979b65.1634622716.git.mchehab+huawei@kernel.org/ > > But if you prefer otherwise and want me to re-submit the series, please > let me know. I will squash this patch with the previous one (that describes the bridge PERST# requirements) and add the Link above to the commit log. Lorenzo
diff --git a/drivers/pci/controller/dwc/pcie-kirin.c b/drivers/pci/controller/dwc/pcie-kirin.c index de375795a3b8..bc329673632a 100644 --- a/drivers/pci/controller/dwc/pcie-kirin.c +++ b/drivers/pci/controller/dwc/pcie-kirin.c @@ -113,7 +113,7 @@ struct kirin_pcie { #define CRGCTRL_PCIE_ASSERT_BIT 0x8c000000 /* Time for delay */ -#define REF_2_PERST_MIN 20000 +#define REF_2_PERST_MIN 21000 #define REF_2_PERST_MAX 25000 #define PERST_2_ACCESS_MIN 10000 #define PERST_2_ACCESS_MAX 12000
Before code refactor, the PERST# signals were sent at the end of the power_on logic. Then, the PCI core would probe for the buses and add them. The new logic changed it to send PERST# signals during add_bus operation. That altered the timings. Also, HiKey 970 require a little more waiting time for the PCI bridge - which is outside the SoC - to finish the PERST# reset, and then initialize the eye diagram. So, increase the waiting time for the PERST# signals to what's required for it to also work with HiKey 970. Signed-off-by: Mauro Carvalho Chehab <mchehab+huawei@kernel.org> --- See [PATCH v14 00/11] at: https://lore.kernel.org/all/cover.1634622716.git.mchehab+huawei@kernel.org/ drivers/pci/controller/dwc/pcie-kirin.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-)