Message ID | 20210107093340.15279-1-ajaykumar.rs@samsung.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
Series | iommu/arm-smmu-v3: Handle duplicated Stream IDs from other masters | expand |
On Thu, Jan 07, 2021 at 03:03:40PM +0530, Ajay Kumar wrote: > When PCI function drivers(ex:pci-endpoint-test) are probed for already > initialized PCIe-RC(Root Complex), and PCIe-RC is already bound to SMMU, > then we encounter a situation where the function driver tries to attach > itself to the smmu with the same stream-id as PCIe-RC and re-initialize > an already initialized STE. This causes ste_live BUG_ON() in the driver. I don't understand why the endpoint is using the same stream ID as the root complex in this case. Why is that? Is the grouping logic not working properly? > There is an already existing check in the driver to manage duplicated ids > if duplicated ids are added in same master device, but there can be > scenarios like above where we need to extend the check for other masters > using the same stream-id. > > Signed-off-by: Ajay Kumar <ajaykumar.rs@samsung.com> > --- > drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 33 +++++++++++++++++++++ > 1 file changed, 33 insertions(+) It doesn't feel like the driver is the right place to fix this, as the same issue could surely occur for other IOMMUs too, right? In which case, I think we should avoid getting into the situation where different groups have overlapping stream IDs. Will
On 2021-01-07 13:03, Will Deacon wrote: > On Thu, Jan 07, 2021 at 03:03:40PM +0530, Ajay Kumar wrote: >> When PCI function drivers(ex:pci-endpoint-test) are probed for already >> initialized PCIe-RC(Root Complex), and PCIe-RC is already bound to SMMU, >> then we encounter a situation where the function driver tries to attach >> itself to the smmu with the same stream-id as PCIe-RC and re-initialize >> an already initialized STE. This causes ste_live BUG_ON() in the driver. Note that this is actually expected behaviour, since Stream ID aliasing has remained officially not supported until a sufficiently compelling reason to do so appears. I always thought the most likely scenario would be a legacy PCI bridge with multiple devices behind it, but even that seems increasingly improbable for a modern SMMUv3-based system to ever see. > I don't understand why the endpoint is using the same stream ID as the root > complex in this case. Why is that? Is the grouping logic not working > properly? It's not so much that it isn't working properly, it's more that it needs to be implemented at all ;) >> There is an already existing check in the driver to manage duplicated ids >> if duplicated ids are added in same master device, but there can be >> scenarios like above where we need to extend the check for other masters >> using the same stream-id. >> >> Signed-off-by: Ajay Kumar <ajaykumar.rs@samsung.com> >> --- >> drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 33 +++++++++++++++++++++ >> 1 file changed, 33 insertions(+) > > It doesn't feel like the driver is the right place to fix this, as the same > issue could surely occur for other IOMMUs too, right? In which case, I think > we should avoid getting into the situation where different groups have > overlapping stream IDs. Yes, this patch does not represent the correct thing to do either way. The main reason that Stream ID aliasing hasn't been supported so far is that the required Stream ID to group lookup is rather awkward, and adding all of that complexity just for the sake of a rather unlikely possibility seemed dubious. However, PRI support has always had a more pressing need to implement almost the same thing (Stream ID to device), so once that lands we can finally get round to adding the rest of proper group support relatively easily. Robin.
On Tue, Jan 12, 2021 at 12:57 AM Robin Murphy <robin.murphy@arm.com> wrote: > > On 2021-01-07 13:03, Will Deacon wrote: > > On Thu, Jan 07, 2021 at 03:03:40PM +0530, Ajay Kumar wrote: > >> When PCI function drivers(ex:pci-endpoint-test) are probed for already > >> initialized PCIe-RC(Root Complex), and PCIe-RC is already bound to SMMU, > >> then we encounter a situation where the function driver tries to attach > >> itself to the smmu with the same stream-id as PCIe-RC and re-initialize > >> an already initialized STE. This causes ste_live BUG_ON() in the driver. > > Note that this is actually expected behaviour, since Stream ID aliasing > has remained officially not supported until a sufficiently compelling > reason to do so appears. I always thought the most likely scenario would > be a legacy PCI bridge with multiple devices behind it, but even that > seems increasingly improbable for a modern SMMUv3-based system to ever see. Thanks to Will and Robin for reviewing this. I am pretty new to PCI, sorry about that. I assumed that the support for stream-id alias is already handled as part of this patch: https://www.spinics.net/lists/arm-kernel/msg626087.html which prevents STE re-initialization. But, what I do not understand is why the path taken by the arm-smmu-v3 driver misses the aforementioned check for my usecase. > > I don't understand why the endpoint is using the same stream ID as the root > > complex in this case. Why is that? Is the grouping logic not working > > properly? > > It's not so much that it isn't working properly, it's more that it needs > to be implemented at all ;) The pci_endpoint_test picks up the same of_ DMA config node as the PCI RC because they sit on the same PCI bus [via pci_dma_configure( )] While in the arm-smmu-v3 driver, I can see that the pci_device_group( ) hands over the same iommu group as the Root Complex to the newly added master device (pci_endpoint_test in our case) because they share the same stream ID. Shouldn't they? > >> There is an already existing check in the driver to manage duplicated ids > >> if duplicated ids are added in same master device, but there can be > >> scenarios like above where we need to extend the check for other masters > >> using the same stream-id. > >> > >> Signed-off-by: Ajay Kumar <ajaykumar.rs@samsung.com> > >> --- > >> drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 33 +++++++++++++++++++++ > >> 1 file changed, 33 insertions(+) > > > > It doesn't feel like the driver is the right place to fix this, as the same > > issue could surely occur for other IOMMUs too, right? In which case, I think > > we should avoid getting into the situation where different groups have > > overlapping stream IDs. > > Yes, this patch does not represent the correct thing to do either way. > The main reason that Stream ID aliasing hasn't been supported so far is > that the required Stream ID to group lookup is rather awkward, and > adding all of that complexity just for the sake of a rather unlikely > possibility seemed dubious. However, PRI support has always had a more > pressing need to implement almost the same thing (Stream ID to device), > so once that lands we can finally get round to adding the rest of proper > group support relatively easily. I hope the support will be added soon. Also, can you point me to few drivers which already handle this type of stream-ID aliasing? Thanks, Ajay Kumar > ______________________________________________ > iommu mailing list > iommu@lists.linux-foundation.org > https://lists.linuxfoundation.org/mailman/listinfo/iommu
On 2021-01-12 20:29, Ajay Kumar wrote: > On Tue, Jan 12, 2021 at 12:57 AM Robin Murphy <robin.murphy@arm.com> wrote: >> >> On 2021-01-07 13:03, Will Deacon wrote: >>> On Thu, Jan 07, 2021 at 03:03:40PM +0530, Ajay Kumar wrote: >>>> When PCI function drivers(ex:pci-endpoint-test) are probed for already >>>> initialized PCIe-RC(Root Complex), and PCIe-RC is already bound to SMMU, >>>> then we encounter a situation where the function driver tries to attach >>>> itself to the smmu with the same stream-id as PCIe-RC and re-initialize >>>> an already initialized STE. This causes ste_live BUG_ON() in the driver. >> >> Note that this is actually expected behaviour, since Stream ID aliasing >> has remained officially not supported until a sufficiently compelling >> reason to do so appears. I always thought the most likely scenario would >> be a legacy PCI bridge with multiple devices behind it, but even that >> seems increasingly improbable for a modern SMMUv3-based system to ever see. > Thanks to Will and Robin for reviewing this. I am pretty new to PCI, > sorry about that. > I assumed that the support for stream-id alias is already handled as > part of this patch: > https://www.spinics.net/lists/arm-kernel/msg626087.html > which prevents STE re-initialization. But, what I do not understand is > why the path > taken by the arm-smmu-v3 driver misses the aforementioned check for my usecase. That case is where a single device, due to combinations of PCI DMA aliasing conditions, has multiple IDs of its own, and two or more of those IDs also happen to end up the same as each other. What you have is two distinct devices both claiming the same ID, since apparently they both represent the same underlying physical device (I don't know how the endpoint and pcieport drivers interoperate and/or coexist, but I can easily imagine that some liberties may be taken that the IOMMU layer doesn't really anticipate). >>> I don't understand why the endpoint is using the same stream ID as the root >>> complex in this case. Why is that? Is the grouping logic not working >>> properly? >> >> It's not so much that it isn't working properly, it's more that it needs >> to be implemented at all ;) > The pci_endpoint_test picks up the same of_ DMA config node as the PCI RC > because they sit on the same PCI bus [via pci_dma_configure( )] > While in the arm-smmu-v3 driver, I can see that the pci_device_group( ) hands > over the same iommu group as the Root Complex to the newly added master > device (pci_endpoint_test in our case) because they share the same stream ID. > Shouldn't they? I'd imagine it's most likely that the PCI grouping rules are just putting everything together due to a lack of ACS. Either way, I'm pretty sure the PCI logic *doesn't* consider actual PCI devices having overlapping Requester IDs, because that isn't a real thing (how would config accesses work?). You can consider yourself lucky that the devices do happen to be grouped already in your particular case, but that doesn't change the fact that there's basically no point in trying to handle Stream ID aliasing within groups without properly implementing the grouping-by-Stream-ID logic in the first place (note that even distinct ACS-isolated PCI endpoints could still alias beyond the host bridge at the SMMU input if the system's translation from Requester ID space to Stream ID space isn't 1:1) >>>> There is an already existing check in the driver to manage duplicated ids >>>> if duplicated ids are added in same master device, but there can be >>>> scenarios like above where we need to extend the check for other masters >>>> using the same stream-id. >>>> >>>> Signed-off-by: Ajay Kumar <ajaykumar.rs@samsung.com> >>>> --- >>>> drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 33 +++++++++++++++++++++ >>>> 1 file changed, 33 insertions(+) >>> >>> It doesn't feel like the driver is the right place to fix this, as the same >>> issue could surely occur for other IOMMUs too, right? In which case, I think >>> we should avoid getting into the situation where different groups have >>> overlapping stream IDs. >> >> Yes, this patch does not represent the correct thing to do either way. >> The main reason that Stream ID aliasing hasn't been supported so far is >> that the required Stream ID to group lookup is rather awkward, and >> adding all of that complexity just for the sake of a rather unlikely >> possibility seemed dubious. However, PRI support has always had a more >> pressing need to implement almost the same thing (Stream ID to device), >> so once that lands we can finally get round to adding the rest of proper >> group support relatively easily. > I hope the support will be added soon. Also, can you point me to few drivers > which already handle this type of stream-ID aliasing? We handle it in the SMMUv2 driver - the way that architecture does stream mapping makes it really easy to hang group pointers off the S2CRs and have arm_smmu_master_alloc_smes() and arm_smmu_device_group() work nicely together. Unfortunately it's not feasible to take the same approach for SMMUv3, since the Stream Table there may be up to 2^32 entries (vs. up to 128 S2CRs), and there just isn't enough room to encode a group pointer directly in an STE itself (you don't want to imagine how much time I've spent trying to think up schemes for that...) Robin.
diff --git a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c index e634bbe60573..a91c3c0e9ee8 100644 --- a/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c +++ b/drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c @@ -2022,10 +2022,26 @@ static __le64 *arm_smmu_get_step_for_sid(struct arm_smmu_device *smmu, u32 sid) return step; } +static bool arm_smmu_check_duplicated_sid(struct arm_smmu_master *master, + int sid) +{ + int i; + + for (i = 0; i < master->num_sids; ++i) + if (master->sids[i] == sid) + return true; + + return false; +} + static void arm_smmu_install_ste_for_dev(struct arm_smmu_master *master) { + bool sid_in_other_masters; int i, j; struct arm_smmu_device *smmu = master->smmu; + unsigned long flags; + struct arm_smmu_domain *smmu_domain = master->domain; + struct arm_smmu_master *other_masters; for (i = 0; i < master->num_sids; ++i) { u32 sid = master->sids[i]; @@ -2038,6 +2054,23 @@ static void arm_smmu_install_ste_for_dev(struct arm_smmu_master *master) if (j < i) continue; + /* Check for stream-ID duplication in masters in given domain */ + sid_in_other_masters = false; + spin_lock_irqsave(&smmu_domain->devices_lock, flags); + list_for_each_entry(other_masters, &smmu_domain->devices, + domain_head) { + sid_in_other_masters = + arm_smmu_check_duplicated_sid(other_masters, + sid); + if (sid_in_other_masters) + break; + } + spin_unlock_irqrestore(&smmu_domain->devices_lock, flags); + + /* Skip STE re-init if stream-id found in other masters */ + if (sid_in_other_masters) + continue; + arm_smmu_write_strtab_ent(master, sid, step); } }
When PCI function drivers(ex:pci-endpoint-test) are probed for already initialized PCIe-RC(Root Complex), and PCIe-RC is already bound to SMMU, then we encounter a situation where the function driver tries to attach itself to the smmu with the same stream-id as PCIe-RC and re-initialize an already initialized STE. This causes ste_live BUG_ON() in the driver. There is an already existing check in the driver to manage duplicated ids if duplicated ids are added in same master device, but there can be scenarios like above where we need to extend the check for other masters using the same stream-id. Signed-off-by: Ajay Kumar <ajaykumar.rs@samsung.com> --- drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c | 33 +++++++++++++++++++++ 1 file changed, 33 insertions(+)