Message ID | 1422052359-12384-4-git-send-email-m-karicheri2@ti.com (mailing list archive) |
---|---|
State | New, archived |
Delegated to: | Bjorn Helgaas |
Headers | show |
Hi Murali, On 23/01/15 22:32, Murali Karicheri wrote: > Fix the dma-range size when the DT attribute is missing. i.e set size to > dev->coherent_dma_mask + 1 instead of dev->coherent_dma_mask. To detect > overflow when mask is set to max of u64, add a check, log error and return. > Some platform use mask format for size in DTS. So add a work around to > catch this and fix. > > Cc: Joerg Roedel <joro@8bytes.org> > Cc: Grant Likely <grant.likely@linaro.org> > Cc: Rob Herring <robh+dt@kernel.org> > Cc: Bjorn Helgaas <bhelgaas@google.com> > Cc: Will Deacon <will.deacon@arm.com> > Cc: Russell King <linux@arm.linux.org.uk> > Cc: Arnd Bergmann <arnd@arndb.de> > Cc: Suravee Suthikulpanit <Suravee.Suthikulpanit@amd.com> > > Signed-off-by: Murali Karicheri <m-karicheri2@ti.com> > --- > drivers/of/device.c | 14 +++++++++++++- > 1 file changed, 13 insertions(+), 1 deletion(-) > > diff --git a/drivers/of/device.c b/drivers/of/device.c > index 2de320d..0a5ff54 100644 > --- a/drivers/of/device.c > +++ b/drivers/of/device.c > @@ -105,12 +105,24 @@ void of_dma_configure(struct device *dev, struct device_node *np) > ret = of_dma_get_range(np, &dma_addr, &paddr, &size); > if (ret < 0) { > dma_addr = offset = 0; > - size = dev->coherent_dma_mask; > + size = dev->coherent_dma_mask + 1; > } else { > offset = PFN_DOWN(paddr - dma_addr); > + /* > + * Add a work around to treat the size as mask + 1 in case > + * it is defined in DT as a mask. > + */ > + if (size & 1) > + size = size + 1; > dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); > } > > + /* if size is 0, we have an overflow of u64 */ > + if (!size) { > + dev_err(dev, "invalid size\n"); > + return; > + } > + This seems potentially fragile to dodgy DTs given that we might also be using size to make a mask later. Would it make sense to double-up a sanity check as mask-format detection? Something like: if is_power_of_2(size) // use size else if is_power_of_2(size + 1) // use size + 1 else // cry Robin. > dev->dma_pfn_offset = offset; > > coherent = of_dma_is_coherent(np); > -- To unsubscribe from this list: send the line "unsubscribe linux-pci" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 01/27/2015 06:27 AM, Robin Murphy wrote: > Hi Murali, > > On 23/01/15 22:32, Murali Karicheri wrote: >> Fix the dma-range size when the DT attribute is missing. i.e set size to >> dev->coherent_dma_mask + 1 instead of dev->coherent_dma_mask. To detect >> overflow when mask is set to max of u64, add a check, log error and >> return. >> Some platform use mask format for size in DTS. So add a work around to >> catch this and fix. >> >> Cc: Joerg Roedel <joro@8bytes.org> >> Cc: Grant Likely <grant.likely@linaro.org> >> Cc: Rob Herring <robh+dt@kernel.org> >> Cc: Bjorn Helgaas <bhelgaas@google.com> >> Cc: Will Deacon <will.deacon@arm.com> >> Cc: Russell King <linux@arm.linux.org.uk> >> Cc: Arnd Bergmann <arnd@arndb.de> >> Cc: Suravee Suthikulpanit <Suravee.Suthikulpanit@amd.com> >> >> Signed-off-by: Murali Karicheri <m-karicheri2@ti.com> >> --- >> drivers/of/device.c | 14 +++++++++++++- >> 1 file changed, 13 insertions(+), 1 deletion(-) >> >> diff --git a/drivers/of/device.c b/drivers/of/device.c >> index 2de320d..0a5ff54 100644 >> --- a/drivers/of/device.c >> +++ b/drivers/of/device.c >> @@ -105,12 +105,24 @@ void of_dma_configure(struct device *dev, struct >> device_node *np) >> ret = of_dma_get_range(np, &dma_addr, &paddr, &size); >> if (ret < 0) { >> dma_addr = offset = 0; >> - size = dev->coherent_dma_mask; >> + size = dev->coherent_dma_mask + 1; >> } else { >> offset = PFN_DOWN(paddr - dma_addr); >> + /* >> + * Add a work around to treat the size as mask + 1 in case >> + * it is defined in DT as a mask. >> + */ >> + if (size & 1) >> + size = size + 1; >> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); >> } >> >> + /* if size is 0, we have an overflow of u64 */ >> + if (!size) { >> + dev_err(dev, "invalid size\n"); >> + return; >> + } >> + > > This seems potentially fragile to dodgy DTs given that we might also be > using size to make a mask later. Would it make sense to double-up a > sanity check as mask-format detection? Something like: > > if is_power_of_2(size) > // use size > else if is_power_of_2(size + 1) > // use size + 1 > else > // cry > Robin, I think this is better. I will wait for some more time for anyone to respond and re-send my patch with this change. Thanks Murali > > Robin. > >> dev->dma_pfn_offset = offset; >> >> coherent = of_dma_is_coherent(np); >> > >
On 01/27/2015 06:27 AM, Robin Murphy wrote: > Hi Murali, > > On 23/01/15 22:32, Murali Karicheri wrote: >> Fix the dma-range size when the DT attribute is missing. i.e set size to >> dev->coherent_dma_mask + 1 instead of dev->coherent_dma_mask. To detect >> overflow when mask is set to max of u64, add a check, log error and >> return. >> Some platform use mask format for size in DTS. So add a work around to >> catch this and fix. >> >> Cc: Joerg Roedel <joro@8bytes.org> >> Cc: Grant Likely <grant.likely@linaro.org> >> Cc: Rob Herring <robh+dt@kernel.org> >> Cc: Bjorn Helgaas <bhelgaas@google.com> >> Cc: Will Deacon <will.deacon@arm.com> >> Cc: Russell King <linux@arm.linux.org.uk> >> Cc: Arnd Bergmann <arnd@arndb.de> >> Cc: Suravee Suthikulpanit <Suravee.Suthikulpanit@amd.com> >> >> Signed-off-by: Murali Karicheri <m-karicheri2@ti.com> >> --- >> drivers/of/device.c | 14 +++++++++++++- >> 1 file changed, 13 insertions(+), 1 deletion(-) >> >> diff --git a/drivers/of/device.c b/drivers/of/device.c >> index 2de320d..0a5ff54 100644 >> --- a/drivers/of/device.c >> +++ b/drivers/of/device.c >> @@ -105,12 +105,24 @@ void of_dma_configure(struct device *dev, struct >> device_node *np) >> ret = of_dma_get_range(np, &dma_addr, &paddr, &size); >> if (ret < 0) { >> dma_addr = offset = 0; >> - size = dev->coherent_dma_mask; >> + size = dev->coherent_dma_mask + 1; >> } else { >> offset = PFN_DOWN(paddr - dma_addr); >> + /* >> + * Add a work around to treat the size as mask + 1 in case >> + * it is defined in DT as a mask. >> + */ >> + if (size & 1) >> + size = size + 1; >> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); >> } >> >> + /* if size is 0, we have an overflow of u64 */ >> + if (!size) { >> + dev_err(dev, "invalid size\n"); >> + return; >> + } >> + > > This seems potentially fragile to dodgy DTs given that we might also be > using size to make a mask later. Would it make sense to double-up a > sanity check as mask-format detection? Something like: > > if is_power_of_2(size) > // use size > else if is_power_of_2(size + 1) > // use size + 1 > else > // cry Robin, How about having the logic like this? ret = of_dma_get_range(np, &dma_addr, &paddr, &size); if (ret < 0) { dma_addr = offset = 0; size = dev->coherent_dma_mask + 1; } else { offset = PFN_DOWN(paddr - dma_addr); dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); } if (is_power_of_2(size + 1)) size = size + 1; else if (!is_power_of_2(size)) { dev_err(dev, "invalid size\n"); return; } Murali > > > Robin. > >> dev->dma_pfn_offset = offset; >> >> coherent = of_dma_is_coherent(np); >> > >
On 28/01/15 11:05, Catalin Marinas wrote: > On Tue, Jan 27, 2015 at 06:55:15PM +0000, Murali Karicheri wrote: >> On 01/27/2015 06:27 AM, Robin Murphy wrote: >>> On 23/01/15 22:32, Murali Karicheri wrote: >>>> Fix the dma-range size when the DT attribute is missing. i.e set size to >>>> dev->coherent_dma_mask + 1 instead of dev->coherent_dma_mask. To detect >>>> overflow when mask is set to max of u64, add a check, log error and >>>> return. >>>> Some platform use mask format for size in DTS. So add a work around to >>>> catch this and fix. >>>> >>>> Cc: Joerg Roedel <joro@8bytes.org> >>>> Cc: Grant Likely <grant.likely@linaro.org> >>>> Cc: Rob Herring <robh+dt@kernel.org> >>>> Cc: Bjorn Helgaas <bhelgaas@google.com> >>>> Cc: Will Deacon <will.deacon@arm.com> >>>> Cc: Russell King <linux@arm.linux.org.uk> >>>> Cc: Arnd Bergmann <arnd@arndb.de> >>>> Cc: Suravee Suthikulpanit <Suravee.Suthikulpanit@amd.com> >>>> >>>> Signed-off-by: Murali Karicheri <m-karicheri2@ti.com> >>>> --- >>>> drivers/of/device.c | 14 +++++++++++++- >>>> 1 file changed, 13 insertions(+), 1 deletion(-) >>>> >>>> diff --git a/drivers/of/device.c b/drivers/of/device.c >>>> index 2de320d..0a5ff54 100644 >>>> --- a/drivers/of/device.c >>>> +++ b/drivers/of/device.c >>>> @@ -105,12 +105,24 @@ void of_dma_configure(struct device *dev, struct >>>> device_node *np) >>>> ret = of_dma_get_range(np, &dma_addr, &paddr, &size); >>>> if (ret < 0) { >>>> dma_addr = offset = 0; >>>> - size = dev->coherent_dma_mask; >>>> + size = dev->coherent_dma_mask + 1; >>>> } else { >>>> offset = PFN_DOWN(paddr - dma_addr); >>>> + /* >>>> + * Add a work around to treat the size as mask + 1 in case >>>> + * it is defined in DT as a mask. >>>> + */ >>>> + if (size & 1) >>>> + size = size + 1; >>>> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); >>>> } >>>> >>>> + /* if size is 0, we have an overflow of u64 */ >>>> + if (!size) { >>>> + dev_err(dev, "invalid size\n"); >>>> + return; >>>> + } >>>> + >>> >>> This seems potentially fragile to dodgy DTs given that we might also be >>> using size to make a mask later. Would it make sense to double-up a >>> sanity check as mask-format detection? Something like: >>> >>> if is_power_of_2(size) >>> // use size >>> else if is_power_of_2(size + 1) >>> // use size + 1 >>> else >>> // cry >> >> How about having the logic like this? >> >> ret = of_dma_get_range(np, &dma_addr, &paddr, &size); >> if (ret < 0) { >> dma_addr = offset = 0; >> size = dev->coherent_dma_mask + 1; >> } else { >> offset = PFN_DOWN(paddr - dma_addr); >> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); >> } >> >> if (is_power_of_2(size + 1)) >> size = size + 1; >> else if (!is_power_of_2(size)) >> { >> dev_err(dev, "invalid size\n"); >> return; >> } > > In of_dma_configure(), we currently assume that the default coherent > mask is 32-bit. In this thread: > > http://article.gmane.org/gmane.linux.kernel/1835096 > > we talked about setting the coherent mask based on size automatically. > I'm not sure about the size but I think we can assume is 32-bit mask + 1 > if it is not specified in the DT. Instead of just assuming a default > mask, let's assume a default size and create the mask based on this > (untested): > > diff --git a/drivers/of/platform.c b/drivers/of/platform.c > index 5b33c6a21807..9ff8d1286b44 100644 > --- a/drivers/of/platform.c > +++ b/drivers/of/platform.c > @@ -170,10 +170,10 @@ static void of_dma_configure(struct device *dev) > struct iommu_ops *iommu; > > /* > - * Set default dma-mask to 32 bit. Drivers are expected to setup > - * the correct supported dma_mask. > + * Set default size to cover the 32-bit. Drivers are expected to setup > + * the correct size and dma_mask. > */ > - dev->coherent_dma_mask = DMA_BIT_MASK(32); > + size = 1ULL << 32; > > /* > * Set it to coherent_dma_mask by default if the architecture > @@ -185,13 +185,24 @@ static void of_dma_configure(struct device *dev) > ret = of_dma_get_range(dev->of_node, &dma_addr, &paddr, &size); > if (ret < 0) { > dma_addr = offset = 0; > - size = dev->coherent_dma_mask; > } else { > offset = PFN_DOWN(paddr - dma_addr); > dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", dev->dma_pfn_offset); > } > dev->dma_pfn_offset = offset; > > + /* > + * Workaround for DTs setting the size to a mask or 0. > + */ > + if (is_power_of_2(size + 1)) > + size += 1; In fact, since the ilog2 below ends up effectively rounding down, we might as well do away with this check as well and just add 1 unconditionally. The only time it makes any difference is when we want it to anyway! I like this approach, it ends up looking a lot neater. Robin. > + > + /* > + * Coherent DMA masks larger than 32-bit must be explicitly set by the > + * driver. > + */ > + dev->coherent_dma_mask = min(DMA_BIT_MASK(32), DMA_BIT_MASK(ilog2(size))); > + > coherent = of_dma_is_coherent(dev->of_node); > dev_dbg(dev, "device is%sdma coherent\n", > coherent ? " " : " not "); > -- To unsubscribe from this list: send the line "unsubscribe linux-pci" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Wed, Jan 28, 2015 at 03:55:57PM +0000, Robin Murphy wrote: > On 28/01/15 11:05, Catalin Marinas wrote: > > On Tue, Jan 27, 2015 at 06:55:15PM +0000, Murali Karicheri wrote: > >> How about having the logic like this? > >> > >> ret = of_dma_get_range(np, &dma_addr, &paddr, &size); > >> if (ret < 0) { > >> dma_addr = offset = 0; > >> size = dev->coherent_dma_mask + 1; > >> } else { > >> offset = PFN_DOWN(paddr - dma_addr); > >> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); > >> } > >> > >> if (is_power_of_2(size + 1)) > >> size = size + 1; > >> else if (!is_power_of_2(size)) > >> { > >> dev_err(dev, "invalid size\n"); > >> return; > >> } > > > > In of_dma_configure(), we currently assume that the default coherent > > mask is 32-bit. In this thread: > > > > http://article.gmane.org/gmane.linux.kernel/1835096 > > > > we talked about setting the coherent mask based on size automatically. > > I'm not sure about the size but I think we can assume is 32-bit mask + 1 > > if it is not specified in the DT. Instead of just assuming a default > > mask, let's assume a default size and create the mask based on this > > (untested): > > > > diff --git a/drivers/of/platform.c b/drivers/of/platform.c > > index 5b33c6a21807..9ff8d1286b44 100644 > > --- a/drivers/of/platform.c > > +++ b/drivers/of/platform.c > > @@ -170,10 +170,10 @@ static void of_dma_configure(struct device *dev) > > struct iommu_ops *iommu; > > > > /* > > - * Set default dma-mask to 32 bit. Drivers are expected to setup > > - * the correct supported dma_mask. > > + * Set default size to cover the 32-bit. Drivers are expected to setup > > + * the correct size and dma_mask. > > */ > > - dev->coherent_dma_mask = DMA_BIT_MASK(32); > > + size = 1ULL << 32; > > > > /* > > * Set it to coherent_dma_mask by default if the architecture > > @@ -185,13 +185,24 @@ static void of_dma_configure(struct device *dev) > > ret = of_dma_get_range(dev->of_node, &dma_addr, &paddr, &size); > > if (ret < 0) { > > dma_addr = offset = 0; > > - size = dev->coherent_dma_mask; > > } else { > > offset = PFN_DOWN(paddr - dma_addr); > > dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", dev->dma_pfn_offset); > > } > > dev->dma_pfn_offset = offset; > > > > + /* > > + * Workaround for DTs setting the size to a mask or 0. > > + */ > > + if (is_power_of_2(size + 1)) > > + size += 1; > > In fact, since the ilog2 below ends up effectively rounding down, we > might as well do away with this check as well and just add 1 > unconditionally. The only time it makes any difference is when we want > it to anyway! Well, we could simply ignore the is_power_of_2() check but without incrementing size as we don't know what arch_setup_dma_ops() does with it. I think we can remove this check altogether (we leaved without it for a while) but we need to add 1 when calculating the mask: dev->coherent_dma_mask = min(DMA_BIT_MASK(32), DMA_BIT_MASK(ilog2(size + 1)));
On 01/28/2015 12:30 PM, Catalin Marinas wrote: > On Wed, Jan 28, 2015 at 03:55:57PM +0000, Robin Murphy wrote: >> On 28/01/15 11:05, Catalin Marinas wrote: >>> On Tue, Jan 27, 2015 at 06:55:15PM +0000, Murali Karicheri wrote: >>>> How about having the logic like this? >>>> >>>> ret = of_dma_get_range(np,&dma_addr,&paddr,&size); >>>> if (ret< 0) { >>>> dma_addr = offset = 0; >>>> size = dev->coherent_dma_mask + 1; >>>> } else { >>>> offset = PFN_DOWN(paddr - dma_addr); >>>> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); >>>> } >>>> >>>> if (is_power_of_2(size + 1)) >>>> size = size + 1; >>>> else if (!is_power_of_2(size)) >>>> { >>>> dev_err(dev, "invalid size\n"); >>>> return; >>>> } >>> >>> In of_dma_configure(), we currently assume that the default coherent >>> mask is 32-bit. In this thread: >>> >>> http://article.gmane.org/gmane.linux.kernel/1835096 >>> >>> we talked about setting the coherent mask based on size automatically. >>> I'm not sure about the size but I think we can assume is 32-bit mask + 1 >>> if it is not specified in the DT. Instead of just assuming a default >>> mask, let's assume a default size and create the mask based on this >>> (untested): >>> >>> diff --git a/drivers/of/platform.c b/drivers/of/platform.c >>> index 5b33c6a21807..9ff8d1286b44 100644 >>> --- a/drivers/of/platform.c >>> +++ b/drivers/of/platform.c >>> @@ -170,10 +170,10 @@ static void of_dma_configure(struct device *dev) >>> struct iommu_ops *iommu; >>> >>> /* >>> - * Set default dma-mask to 32 bit. Drivers are expected to setup >>> - * the correct supported dma_mask. >>> + * Set default size to cover the 32-bit. Drivers are expected to setup >>> + * the correct size and dma_mask. >>> */ >>> - dev->coherent_dma_mask = DMA_BIT_MASK(32); >>> + size = 1ULL<< 32; >>> >>> /* >>> * Set it to coherent_dma_mask by default if the architecture >>> @@ -185,13 +185,24 @@ static void of_dma_configure(struct device *dev) >>> ret = of_dma_get_range(dev->of_node,&dma_addr,&paddr,&size); >>> if (ret< 0) { >>> dma_addr = offset = 0; >>> - size = dev->coherent_dma_mask; >>> } else { >>> offset = PFN_DOWN(paddr - dma_addr); >>> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", dev->dma_pfn_offset); >>> } >>> dev->dma_pfn_offset = offset; >>> >>> + /* >>> + * Workaround for DTs setting the size to a mask or 0. >>> + */ >>> + if (is_power_of_2(size + 1)) >>> + size += 1; >> >> In fact, since the ilog2 below ends up effectively rounding down, we >> might as well do away with this check as well and just add 1 >> unconditionally. The only time it makes any difference is when we want >> it to anyway! > > Well, we could simply ignore the is_power_of_2() check but without > incrementing size as we don't know what arch_setup_dma_ops() does with > it. > > I think we can remove this check altogether (we leaved without it for a > while) but we need to add 1 when calculating the mask: > > dev->coherent_dma_mask = min(DMA_BIT_MASK(32), > DMA_BIT_MASK(ilog2(size + 1))); > For Keystone, the dma_addr is to be taken care as well to determine the mask. The above will not work. Based on the discussion so far, this is the function I have come up with incorporating the suggestions. Please review this and see if I have missed out any. This works fine on Keystone. void of_dma_configure(struct device *dev, struct device_node *np) { u64 dma_addr = 0, paddr, size; int ret; bool coherent; unsigned long offset = 0; struct iommu_ops *iommu; /* * Set default size to cover the 32-bit. Drivers are expected to setup * the correct size and dma_mask. */ size = 1ULL << 32; ret = of_dma_get_range(np, &dma_addr, &paddr, &size); if (!ret) { offset = PFN_DOWN(paddr - dma_addr); if (!size) { dev_err(dev, "Invalid size (%llx)\n", size); return; } if (size & 1) { size = size + 1; dev_warn(dev, "Incorrect usage of size (%llx)\n", size); } dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); } dev->dma_pfn_offset = offset; /* * Coherent DMA masks larger than 32-bit must be explicitly set by the * driver. */ dev->coherent_dma_mask = min(DMA_BIT_MASK(32), DMA_BIT_MASK(ilog2(dma_addr + size))); /* * Set dma_mask to coherent_dma_mask by default if the architecture * code has not set it. */ if (!dev->dma_mask) dev->dma_mask = &dev->coherent_dma_mask; coherent = of_dma_is_coherent(np); dev_dbg(dev, "device is%sdma coherent\n", coherent ? " " : " not "); iommu = of_iommu_configure(dev, np); dev_dbg(dev, "device is%sbehind an iommu\n", iommu ? " " : " not "); arch_setup_dma_ops(dev, dma_addr, size, iommu, coherent); }
On Fri, Jan 30, 2015 at 06:06:27PM +0000, Murali Karicheri wrote: > On 01/28/2015 12:30 PM, Catalin Marinas wrote: > > I think we can remove this check altogether (we leaved without it for a > > while) but we need to add 1 when calculating the mask: > > > > dev->coherent_dma_mask = min(DMA_BIT_MASK(32), > > DMA_BIT_MASK(ilog2(size + 1))); > > > For Keystone, the dma_addr is to be taken care as well to determine the > mask. The above will not work. This was discussed before (not on this thread) and dma_addr should not affect the mask, it only affects the pfn offset. > Based on the discussion so far, this is the function I have come up with > incorporating the suggestions. Please review this and see if I have > missed out any. This works fine on Keystone. > > void of_dma_configure(struct device *dev, struct device_node *np) > { > u64 dma_addr = 0, paddr, size; > int ret; > bool coherent; > unsigned long offset = 0; > struct iommu_ops *iommu; > > /* > * Set default size to cover the 32-bit. Drivers are expected to setup > * the correct size and dma_mask. > */ > size = 1ULL << 32; > > ret = of_dma_get_range(np, &dma_addr, &paddr, &size); > if (!ret) { > offset = PFN_DOWN(paddr - dma_addr); > if (!size) { > dev_err(dev, "Invalid size (%llx)\n", > size); > return; > } > if (size & 1) { > size = size + 1; > dev_warn(dev, "Incorrect usage of size (%llx)\n", > size); > } > dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); > } > dev->dma_pfn_offset = offset; > > /* > * Coherent DMA masks larger than 32-bit must be explicitly set by the > * driver. > */ > dev->coherent_dma_mask = min(DMA_BIT_MASK(32), > DMA_BIT_MASK(ilog2(dma_addr + size))); That's not correct, coherent_dma_mask should still be calculated solely based on size, not dma_addr. Functions like swiotlb_dma_supported() use phys_to_dma() which on ARM (32-bit) subtracts the dma_pfn_offset, so the mask based on size works fine. In the arm64 tree, we haven't taken dma_pfn_offset into account for phys_to_dma() yet but if needed for a SoC, we'll add it.
On 02/02/2015 07:18 AM, Catalin Marinas wrote: > On Fri, Jan 30, 2015 at 06:06:27PM +0000, Murali Karicheri wrote: >> On 01/28/2015 12:30 PM, Catalin Marinas wrote: >>> I think we can remove this check altogether (we leaved without it for a >>> while) but we need to add 1 when calculating the mask: >>> >>> dev->coherent_dma_mask = min(DMA_BIT_MASK(32), >>> DMA_BIT_MASK(ilog2(size + 1))); >>> >> For Keystone, the dma_addr is to be taken care as well to determine the >> mask. The above will not work. > > This was discussed before (not on this thread) and dma_addr should not > affect the mask, it only affects the pfn offset. > >> Based on the discussion so far, this is the function I have come up with >> incorporating the suggestions. Please review this and see if I have >> missed out any. This works fine on Keystone. >> >> void of_dma_configure(struct device *dev, struct device_node *np) >> { >> u64 dma_addr = 0, paddr, size; >> int ret; >> bool coherent; >> unsigned long offset = 0; >> struct iommu_ops *iommu; >> >> /* >> * Set default size to cover the 32-bit. Drivers are expected to setup >> * the correct size and dma_mask. >> */ >> size = 1ULL<< 32; >> >> ret = of_dma_get_range(np,&dma_addr,&paddr,&size); >> if (!ret) { >> offset = PFN_DOWN(paddr - dma_addr); >> if (!size) { >> dev_err(dev, "Invalid size (%llx)\n", >> size); >> return; >> } >> if (size& 1) { >> size = size + 1; >> dev_warn(dev, "Incorrect usage of size (%llx)\n", >> size); >> } >> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); >> } >> dev->dma_pfn_offset = offset; >> >> /* >> * Coherent DMA masks larger than 32-bit must be explicitly set by the >> * driver. >> */ >> dev->coherent_dma_mask = min(DMA_BIT_MASK(32), >> DMA_BIT_MASK(ilog2(dma_addr + size))); > > That's not correct, coherent_dma_mask should still be calculated solely > based on size, not dma_addr. > > Functions like swiotlb_dma_supported() use phys_to_dma() which on ARM > (32-bit) subtracts the dma_pfn_offset, so the mask based on size works > fine. > > In the arm64 tree, we haven't taken dma_pfn_offset into account for > phys_to_dma() yet but if needed for a SoC, we'll add it. > I need to hear Arnd's comment on this. I am seeing an issue without this change. Probably it needs a change else where. I will post the error I am getting to this list. Murali
On 02/02/2015 07:18 AM, Catalin Marinas wrote: > On Fri, Jan 30, 2015 at 06:06:27PM +0000, Murali Karicheri wrote: >> On 01/28/2015 12:30 PM, Catalin Marinas wrote: >>> I think we can remove this check altogether (we leaved without it for a >>> while) but we need to add 1 when calculating the mask: >>> >>> dev->coherent_dma_mask = min(DMA_BIT_MASK(32), >>> DMA_BIT_MASK(ilog2(size + 1))); >>> >> For Keystone, the dma_addr is to be taken care as well to determine the >> mask. The above will not work. > > This was discussed before (not on this thread) and dma_addr should not > affect the mask, it only affects the pfn offset. > >> Based on the discussion so far, this is the function I have come up with >> incorporating the suggestions. Please review this and see if I have >> missed out any. This works fine on Keystone. >> >> void of_dma_configure(struct device *dev, struct device_node *np) >> { >> u64 dma_addr = 0, paddr, size; >> int ret; >> bool coherent; >> unsigned long offset = 0; >> struct iommu_ops *iommu; >> >> /* >> * Set default size to cover the 32-bit. Drivers are expected to setup >> * the correct size and dma_mask. >> */ >> size = 1ULL<< 32; >> >> ret = of_dma_get_range(np,&dma_addr,&paddr,&size); >> if (!ret) { >> offset = PFN_DOWN(paddr - dma_addr); >> if (!size) { >> dev_err(dev, "Invalid size (%llx)\n", >> size); >> return; >> } >> if (size& 1) { >> size = size + 1; >> dev_warn(dev, "Incorrect usage of size (%llx)\n", >> size); >> } >> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); >> } >> dev->dma_pfn_offset = offset; >> >> /* >> * Coherent DMA masks larger than 32-bit must be explicitly set by the >> * driver. >> */ >> dev->coherent_dma_mask = min(DMA_BIT_MASK(32), >> DMA_BIT_MASK(ilog2(dma_addr + size))); > > That's not correct, coherent_dma_mask should still be calculated solely > based on size, not dma_addr. > > Functions like swiotlb_dma_supported() use phys_to_dma() which on ARM > (32-bit) subtracts the dma_pfn_offset, so the mask based on size works > fine. > > In the arm64 tree, we haven't taken dma_pfn_offset into account for > phys_to_dma() yet but if needed for a SoC, we'll add it. > Catalin, The size based dma mask calculation itself can be a separate topic patch. This series is adding important fix to get the PCI driver functional and I would like to get this merged as soon as possible. I also want to hear from Arnd about yout comment as we had discussed this in the initial discussion of this patch series and 8/8 is essentially added based on that discussion. I will add a simple check to catch and warn the incorrect size setting in DT for dma-range as suggested by Rob Herring and create a new patch to for size based mask calculation. I will be sending v6 (expected to be merged soon) today and will work to add a new patch. Before that we need to agree on what is the content of the patch. 1. On keystone, DMA address start at 0x8000_0000 and DMA-able memory is 2G from the above base address. So without taking into account the dma_addr, mask calculated will be 0x7fff_ffff where as we need that to be 0xffff_ffff for keystone. So need to use this in the calculation. 2. W.r.t arm_pfn_offset, this was added to support Keystone as the DDR physical address for LPAE startes at 0x8_0000_0000 and the pfn offset is calculated as the PFN of (0x8_0000_0000 - 0x8000_0000) to do the dma address to DDR address translation. I haven't looked at swiotlb_dma_supported() but will do so. Murali
Murali, On Thu, Feb 05, 2015 at 09:42:27PM +0000, Murali Karicheri wrote: > On 02/02/2015 07:18 AM, Catalin Marinas wrote: > > On Fri, Jan 30, 2015 at 06:06:27PM +0000, Murali Karicheri wrote: > >> On 01/28/2015 12:30 PM, Catalin Marinas wrote: > >>> I think we can remove this check altogether (we leaved without it for a > >>> while) but we need to add 1 when calculating the mask: > >>> > >>> dev->coherent_dma_mask = min(DMA_BIT_MASK(32), > >>> DMA_BIT_MASK(ilog2(size + 1))); > >>> > >> For Keystone, the dma_addr is to be taken care as well to determine the > >> mask. The above will not work. > > > > This was discussed before (not on this thread) and dma_addr should not > > affect the mask, it only affects the pfn offset. > > > >> Based on the discussion so far, this is the function I have come up with > >> incorporating the suggestions. Please review this and see if I have > >> missed out any. This works fine on Keystone. > >> > >> void of_dma_configure(struct device *dev, struct device_node *np) > >> { > >> u64 dma_addr = 0, paddr, size; > >> int ret; > >> bool coherent; > >> unsigned long offset = 0; > >> struct iommu_ops *iommu; > >> > >> /* > >> * Set default size to cover the 32-bit. Drivers are expected to setup > >> * the correct size and dma_mask. > >> */ > >> size = 1ULL<< 32; > >> > >> ret = of_dma_get_range(np,&dma_addr,&paddr,&size); > >> if (!ret) { > >> offset = PFN_DOWN(paddr - dma_addr); > >> if (!size) { > >> dev_err(dev, "Invalid size (%llx)\n", > >> size); > >> return; > >> } > >> if (size& 1) { > >> size = size + 1; > >> dev_warn(dev, "Incorrect usage of size (%llx)\n", > >> size); > >> } > >> dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); > >> } > >> dev->dma_pfn_offset = offset; > >> > >> /* > >> * Coherent DMA masks larger than 32-bit must be explicitly set by the > >> * driver. > >> */ > >> dev->coherent_dma_mask = min(DMA_BIT_MASK(32), > >> DMA_BIT_MASK(ilog2(dma_addr + size))); > > > > That's not correct, coherent_dma_mask should still be calculated solely > > based on size, not dma_addr. > > > > Functions like swiotlb_dma_supported() use phys_to_dma() which on ARM > > (32-bit) subtracts the dma_pfn_offset, so the mask based on size works > > fine. > > > > In the arm64 tree, we haven't taken dma_pfn_offset into account for > > phys_to_dma() yet but if needed for a SoC, we'll add it. > > The size based dma mask calculation itself can be a separate topic > patch. This series is adding important fix to get the PCI driver > functional and I would like to get this merged as soon as possible. Well as long as you don't break the existing users of of_dma_configure() ;). > I also want to hear from Arnd about yout comment as we had discussed > this in the initial discussion of this patch series and 8/8 is > essentially added based on that discussion. I will add a simple check > to catch and warn the incorrect size setting in DT for dma-range as > suggested by Rob Herring and create a new patch to for size based mask > calculation. I will be sending v6 (expected to be merged soon) today > and will work to add a new patch. Before that we need to agree on what > is the content of the patch. > > 1. On keystone, DMA address start at 0x8000_0000 and DMA-able memory is > 2G from the above base address. So without taking into account the > dma_addr, mask calculated will be 0x7fff_ffff where as we need that to > be 0xffff_ffff for keystone. So need to use this in the calculation. Ah, you are right. I confused dma_addr in your patch with the offset. > 2. W.r.t arm_pfn_offset, this was added to support Keystone as the DDR > physical address for LPAE startes at 0x8_0000_0000 and the pfn offset is > calculated as the PFN of (0x8_0000_0000 - 0x8000_0000) to do the dma > address to DDR address translation. Indeed. I'll look at your patches again tomorrow.
diff --git a/drivers/of/device.c b/drivers/of/device.c index 2de320d..0a5ff54 100644 --- a/drivers/of/device.c +++ b/drivers/of/device.c @@ -105,12 +105,24 @@ void of_dma_configure(struct device *dev, struct device_node *np) ret = of_dma_get_range(np, &dma_addr, &paddr, &size); if (ret < 0) { dma_addr = offset = 0; - size = dev->coherent_dma_mask; + size = dev->coherent_dma_mask + 1; } else { offset = PFN_DOWN(paddr - dma_addr); + /* + * Add a work around to treat the size as mask + 1 in case + * it is defined in DT as a mask. + */ + if (size & 1) + size = size + 1; dev_dbg(dev, "dma_pfn_offset(%#08lx)\n", offset); } + /* if size is 0, we have an overflow of u64 */ + if (!size) { + dev_err(dev, "invalid size\n"); + return; + } + dev->dma_pfn_offset = offset; coherent = of_dma_is_coherent(np);
Fix the dma-range size when the DT attribute is missing. i.e set size to dev->coherent_dma_mask + 1 instead of dev->coherent_dma_mask. To detect overflow when mask is set to max of u64, add a check, log error and return. Some platform use mask format for size in DTS. So add a work around to catch this and fix. Cc: Joerg Roedel <joro@8bytes.org> Cc: Grant Likely <grant.likely@linaro.org> Cc: Rob Herring <robh+dt@kernel.org> Cc: Bjorn Helgaas <bhelgaas@google.com> Cc: Will Deacon <will.deacon@arm.com> Cc: Russell King <linux@arm.linux.org.uk> Cc: Arnd Bergmann <arnd@arndb.de> Cc: Suravee Suthikulpanit <Suravee.Suthikulpanit@amd.com> Signed-off-by: Murali Karicheri <m-karicheri2@ti.com> --- drivers/of/device.c | 14 +++++++++++++- 1 file changed, 13 insertions(+), 1 deletion(-)