Message ID | 1461407219-72027-10-git-send-email-oulijun@huawei.com (mailing list archive) |
---|---|
State | Superseded |
Headers | show |
On Sat, Apr 23, 2016 at 06:26:47PM +0800, Lijun Ou wrote: > This patch mainly setup hca for RoCE. it will do a series of > initial works as follows: > 1. init uar table, allocate uar resource > 2. init pd table > 3. init cq table > 4. init mr table > 5. init qp table > > Signed-off-by: Lijun Ou <oulijun@huawei.com> > Signed-off-by: Wei Hu(Xavier) <xavier.huwei@huawei.com> > --- > drivers/infiniband/hw/hns/hns_roce_alloc.c | 104 ++++++++++++++++ > drivers/infiniband/hw/hns/hns_roce_cq.c | 25 ++++ > drivers/infiniband/hw/hns/hns_roce_device.h | 69 ++++++++++ > drivers/infiniband/hw/hns/hns_roce_eq.c | 1 - > drivers/infiniband/hw/hns/hns_roce_icm.c | 88 +++++++++++++ > drivers/infiniband/hw/hns/hns_roce_icm.h | 9 ++ > drivers/infiniband/hw/hns/hns_roce_main.c | 79 ++++++++++++ > drivers/infiniband/hw/hns/hns_roce_mr.c | 187 ++++++++++++++++++++++++++++ > drivers/infiniband/hw/hns/hns_roce_pd.c | 65 ++++++++++ > drivers/infiniband/hw/hns/hns_roce_qp.c | 30 +++++ > 10 files changed, 656 insertions(+), 1 deletion(-) > create mode 100644 drivers/infiniband/hw/hns/hns_roce_alloc.c > create mode 100644 drivers/infiniband/hw/hns/hns_roce_mr.c > create mode 100644 drivers/infiniband/hw/hns/hns_roce_pd.c > > diff --git a/drivers/infiniband/hw/hns/hns_roce_alloc.c b/drivers/infiniband/hw/hns/hns_roce_alloc.c > new file mode 100644 > index 0000000..0c76f1b > --- /dev/null > +++ b/drivers/infiniband/hw/hns/hns_roce_alloc.c > @@ -0,0 +1,104 @@ > +/* > + * Copyright (c) 2016 Hisilicon Limited. > + * > + * This program is free software; you can redistribute it and/or modify > + * it under the terms of the GNU General Public License as published by > + * the Free Software Foundation; either version 2 of the License, or > + * (at your option) any later version. > + */ > + > +#include <linux/bitmap.h> > +#include <linux/dma-mapping.h> > +#include <linux/errno.h> > +#include <linux/mm.h> > +#include <linux/slab.h> > +#include <linux/vmalloc.h> > +#include "hns_roce_device.h" > + > +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj) > +{ > + int ret = 0; > + > + spin_lock(&bitmap->lock); > + *obj = find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); > + if (*obj >= bitmap->max) { > + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) > + & bitmap->mask; > + *obj = find_first_zero_bit(bitmap->table, bitmap->max); find_first_zero_bit function returns "unsigned long" which may or may not be equal to u32 on some architectures.
On 2016/4/24 15:54, Leon Romanovsky wrote: > On Sat, Apr 23, 2016 at 06:26:47PM +0800, Lijun Ou wrote: >> This patch mainly setup hca for RoCE. it will do a series of >> initial works as follows: >> 1. init uar table, allocate uar resource >> 2. init pd table >> 3. init cq table >> 4. init mr table >> 5. init qp table >> >> Signed-off-by: Lijun Ou <oulijun@huawei.com> >> Signed-off-by: Wei Hu(Xavier) <xavier.huwei@huawei.com> >> --- >> drivers/infiniband/hw/hns/hns_roce_alloc.c | 104 ++++++++++++++++ >> drivers/infiniband/hw/hns/hns_roce_cq.c | 25 ++++ >> drivers/infiniband/hw/hns/hns_roce_device.h | 69 ++++++++++ >> drivers/infiniband/hw/hns/hns_roce_eq.c | 1 - >> drivers/infiniband/hw/hns/hns_roce_icm.c | 88 +++++++++++++ >> drivers/infiniband/hw/hns/hns_roce_icm.h | 9 ++ >> drivers/infiniband/hw/hns/hns_roce_main.c | 79 ++++++++++++ >> drivers/infiniband/hw/hns/hns_roce_mr.c | 187 ++++++++++++++++++++++++++++ >> drivers/infiniband/hw/hns/hns_roce_pd.c | 65 ++++++++++ >> drivers/infiniband/hw/hns/hns_roce_qp.c | 30 +++++ >> 10 files changed, 656 insertions(+), 1 deletion(-) >> create mode 100644 drivers/infiniband/hw/hns/hns_roce_alloc.c >> create mode 100644 drivers/infiniband/hw/hns/hns_roce_mr.c >> create mode 100644 drivers/infiniband/hw/hns/hns_roce_pd.c >> >> diff --git a/drivers/infiniband/hw/hns/hns_roce_alloc.c b/drivers/infiniband/hw/hns/hns_roce_alloc.c >> new file mode 100644 >> index 0000000..0c76f1b >> --- /dev/null >> +++ b/drivers/infiniband/hw/hns/hns_roce_alloc.c >> @@ -0,0 +1,104 @@ >> +/* >> + * Copyright (c) 2016 Hisilicon Limited. >> + * >> + * This program is free software; you can redistribute it and/or modify >> + * it under the terms of the GNU General Public License as published by >> + * the Free Software Foundation; either version 2 of the License, or >> + * (at your option) any later version. >> + */ >> + >> +#include <linux/bitmap.h> >> +#include <linux/dma-mapping.h> >> +#include <linux/errno.h> >> +#include <linux/mm.h> >> +#include <linux/slab.h> >> +#include <linux/vmalloc.h> >> +#include "hns_roce_device.h" >> + >> +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj) >> +{ >> + int ret = 0; >> + >> + spin_lock(&bitmap->lock); >> + *obj = find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); >> + if (*obj >= bitmap->max) { >> + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) >> + & bitmap->mask; >> + *obj = find_first_zero_bit(bitmap->table, bitmap->max); > > find_first_zero_bit function returns "unsigned long" which may or may > not be equal to u32 on some architectures. > Hi Leon, I appreciate your keen eye. this code is meant for ARM64bit therefore should run corretly for 64-bit AARCH64. I will consider changing it as part of good partice and better portability " I will give a primary plan to modified it. for example: *obj = (u32)find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); Beause the max size of bitmap->table is u32 in current version. int hns_roce_bitmap_init(struct hns_roce_bitmap *bitmap, u32 num, u32 mask, u32 reserved_bot, u32 reserved_top) { u32 i; if (num != roundup_pow_of_two(num)) return -EINVAL; bitmap->last = 0; bitmap->top = 0; bitmap->max = num - reserved_top; bitmap->mask = mask; bitmap->reserved_top = reserved_top; spin_lock_init(&bitmap->lock); bitmap->table = kcalloc(BITS_TO_LONGS(bitmap->max), sizeof(long), GFP_KERNEL); Is this plan ok? Thanks Lijun Ou -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Tue, Apr 26, 2016 at 02:34:44PM +0800, oulijun wrote: > On 2016/4/24 15:54, Leon Romanovsky wrote: > > On Sat, Apr 23, 2016 at 06:26:47PM +0800, Lijun Ou wrote: > >> This patch mainly setup hca for RoCE. it will do a series of > >> initial works as follows: > >> 1. init uar table, allocate uar resource > >> 2. init pd table > >> 3. init cq table > >> 4. init mr table > >> 5. init qp table > >> > >> Signed-off-by: Lijun Ou <oulijun@huawei.com> > >> Signed-off-by: Wei Hu(Xavier) <xavier.huwei@huawei.com> > >> --- > >> drivers/infiniband/hw/hns/hns_roce_alloc.c | 104 ++++++++++++++++ > >> drivers/infiniband/hw/hns/hns_roce_cq.c | 25 ++++ > >> drivers/infiniband/hw/hns/hns_roce_device.h | 69 ++++++++++ > >> drivers/infiniband/hw/hns/hns_roce_eq.c | 1 - > >> drivers/infiniband/hw/hns/hns_roce_icm.c | 88 +++++++++++++ > >> drivers/infiniband/hw/hns/hns_roce_icm.h | 9 ++ > >> drivers/infiniband/hw/hns/hns_roce_main.c | 79 ++++++++++++ > >> drivers/infiniband/hw/hns/hns_roce_mr.c | 187 ++++++++++++++++++++++++++++ > >> drivers/infiniband/hw/hns/hns_roce_pd.c | 65 ++++++++++ > >> drivers/infiniband/hw/hns/hns_roce_qp.c | 30 +++++ > >> 10 files changed, 656 insertions(+), 1 deletion(-) > >> create mode 100644 drivers/infiniband/hw/hns/hns_roce_alloc.c > >> create mode 100644 drivers/infiniband/hw/hns/hns_roce_mr.c > >> create mode 100644 drivers/infiniband/hw/hns/hns_roce_pd.c > >> > >> diff --git a/drivers/infiniband/hw/hns/hns_roce_alloc.c b/drivers/infiniband/hw/hns/hns_roce_alloc.c > >> new file mode 100644 > >> index 0000000..0c76f1b > >> --- /dev/null > >> +++ b/drivers/infiniband/hw/hns/hns_roce_alloc.c > >> @@ -0,0 +1,104 @@ > >> +/* > >> + * Copyright (c) 2016 Hisilicon Limited. > >> + * > >> + * This program is free software; you can redistribute it and/or modify > >> + * it under the terms of the GNU General Public License as published by > >> + * the Free Software Foundation; either version 2 of the License, or > >> + * (at your option) any later version. > >> + */ > >> + > >> +#include <linux/bitmap.h> > >> +#include <linux/dma-mapping.h> > >> +#include <linux/errno.h> > >> +#include <linux/mm.h> > >> +#include <linux/slab.h> > >> +#include <linux/vmalloc.h> > >> +#include "hns_roce_device.h" > >> + > >> +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj) > >> +{ > >> + int ret = 0; > >> + > >> + spin_lock(&bitmap->lock); > >> + *obj = find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); > >> + if (*obj >= bitmap->max) { > >> + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) > >> + & bitmap->mask; > >> + *obj = find_first_zero_bit(bitmap->table, bitmap->max); > > > > find_first_zero_bit function returns "unsigned long" which may or may > > not be equal to u32 on some architectures. > > > Hi Leon, > I appreciate your keen eye. this code is meant for ARM64bit therefore should run corretly for 64-bit AARCH64. > I will consider changing it as part of good partice and better portability " > I will give a primary plan to modified it. > for example: > *obj = (u32)find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); > Beause the max size of bitmap->table is u32 in current version. > > int hns_roce_bitmap_init(struct hns_roce_bitmap *bitmap, u32 num, u32 mask, > u32 reserved_bot, u32 reserved_top) > { > u32 i; > > if (num != roundup_pow_of_two(num)) > return -EINVAL; > > bitmap->last = 0; > bitmap->top = 0; > bitmap->max = num - reserved_top; > bitmap->mask = mask; > bitmap->reserved_top = reserved_top; > spin_lock_init(&bitmap->lock); > bitmap->table = kcalloc(BITS_TO_LONGS(bitmap->max), sizeof(long), > GFP_KERNEL); > > Is this plan ok? No, You are submitting new driver, please do it properly (without casting) from the beginning. > > Thanks > Lijun Ou > > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html
Tue, Apr 26, 2016 at 04:18:21PM CEST, leon@kernel.org wrote: >On Tue, Apr 26, 2016 at 02:34:44PM +0800, oulijun wrote: >> On 2016/4/24 15:54, Leon Romanovsky wrote: <snip> >> >> +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj) >> >> +{ >> >> + int ret = 0; >> >> + >> >> + spin_lock(&bitmap->lock); >> >> + *obj = find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); >> >> + if (*obj >= bitmap->max) { >> >> + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) >> >> + & bitmap->mask; >> >> + *obj = find_first_zero_bit(bitmap->table, bitmap->max); >> > >> > find_first_zero_bit function returns "unsigned long" which may or may >> > not be equal to u32 on some architectures. >> > >> Hi Leon, >> I appreciate your keen eye. this code is meant for ARM64bit therefore should run corretly for 64-bit AARCH64. The driver should run correctly on any arch. >> I will consider changing it as part of good partice and better portability " >> I will give a primary plan to modified it. >> for example: >> *obj = (u32)find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); >> Beause the max size of bitmap->table is u32 in current version. >> >> int hns_roce_bitmap_init(struct hns_roce_bitmap *bitmap, u32 num, u32 mask, >> u32 reserved_bot, u32 reserved_top) >> { >> u32 i; >> >> if (num != roundup_pow_of_two(num)) >> return -EINVAL; >> >> bitmap->last = 0; >> bitmap->top = 0; >> bitmap->max = num - reserved_top; >> bitmap->mask = mask; >> bitmap->reserved_top = reserved_top; >> spin_lock_init(&bitmap->lock); >> bitmap->table = kcalloc(BITS_TO_LONGS(bitmap->max), sizeof(long), >> GFP_KERNEL); >> >> Is this plan ok? > >No, >You are submitting new driver, please do it properly (without casting) >from the beginning. > >> >> Thanks >> Lijun Ou >> >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 2016/4/26 22:25, Jiri Pirko wrote: > Tue, Apr 26, 2016 at 04:18:21PM CEST, leon@kernel.org wrote: >> On Tue, Apr 26, 2016 at 02:34:44PM +0800, oulijun wrote: >>> On 2016/4/24 15:54, Leon Romanovsky wrote: > > <snip> > >>>>> +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj) >>>>> +{ >>>>> + int ret = 0; >>>>> + >>>>> + spin_lock(&bitmap->lock); >>>>> + *obj = find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); >>>>> + if (*obj >= bitmap->max) { >>>>> + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) >>>>> + & bitmap->mask; >>>>> + *obj = find_first_zero_bit(bitmap->table, bitmap->max); >>>> >>>> find_first_zero_bit function returns "unsigned long" which may or may >>>> not be equal to u32 on some architectures. >>>> >>> Hi Leon, >>> I appreciate your keen eye. this code is meant for ARM64bit therefore should run corretly for 64-bit AARCH64. > > The driver should run correctly on any arch. > Hi Jiri Pirko, Our driver run in ARM64 platform by depending on Kconfig. It will be configure in Kconfig file. Thanks Lijun Ou > >>> I will consider changing it as part of good partice and better portability " >>> I will give a primary plan to modified it. >>> for example: >>> *obj = (u32)find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); >>> Beause the max size of bitmap->table is u32 in current version. >>> >>> int hns_roce_bitmap_init(struct hns_roce_bitmap *bitmap, u32 num, u32 mask, >>> u32 reserved_bot, u32 reserved_top) >>> { >>> u32 i; >>> >>> if (num != roundup_pow_of_two(num)) >>> return -EINVAL; >>> >>> bitmap->last = 0; >>> bitmap->top = 0; >>> bitmap->max = num - reserved_top; >>> bitmap->mask = mask; >>> bitmap->reserved_top = reserved_top; >>> spin_lock_init(&bitmap->lock); >>> bitmap->table = kcalloc(BITS_TO_LONGS(bitmap->max), sizeof(long), >>> GFP_KERNEL); >>> >>> Is this plan ok? >> >> No, >> You are submitting new driver, please do it properly (without casting) >>from the beginning. >> >>> >>> Thanks >>> Lijun Ou >>> >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html > > > > . > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 2016/4/26 22:18, Leon Romanovsky wrote: > On Tue, Apr 26, 2016 at 02:34:44PM +0800, oulijun wrote: >> On 2016/4/24 15:54, Leon Romanovsky wrote: >>> On Sat, Apr 23, 2016 at 06:26:47PM +0800, Lijun Ou wrote: >>>> This patch mainly setup hca for RoCE. it will do a series of >>>> initial works as follows: >>>> 1. init uar table, allocate uar resource >>>> 2. init pd table >>>> 3. init cq table >>>> 4. init mr table >>>> 5. init qp table >>>> >>>> Signed-off-by: Lijun Ou <oulijun@huawei.com> >>>> Signed-off-by: Wei Hu(Xavier) <xavier.huwei@huawei.com> >>>> --- >>>> drivers/infiniband/hw/hns/hns_roce_alloc.c | 104 ++++++++++++++++ >>>> drivers/infiniband/hw/hns/hns_roce_cq.c | 25 ++++ >>>> drivers/infiniband/hw/hns/hns_roce_device.h | 69 ++++++++++ >>>> drivers/infiniband/hw/hns/hns_roce_eq.c | 1 - >>>> drivers/infiniband/hw/hns/hns_roce_icm.c | 88 +++++++++++++ >>>> drivers/infiniband/hw/hns/hns_roce_icm.h | 9 ++ >>>> drivers/infiniband/hw/hns/hns_roce_main.c | 79 ++++++++++++ >>>> drivers/infiniband/hw/hns/hns_roce_mr.c | 187 ++++++++++++++++++++++++++++ >>>> drivers/infiniband/hw/hns/hns_roce_pd.c | 65 ++++++++++ >>>> drivers/infiniband/hw/hns/hns_roce_qp.c | 30 +++++ >>>> 10 files changed, 656 insertions(+), 1 deletion(-) >>>> create mode 100644 drivers/infiniband/hw/hns/hns_roce_alloc.c >>>> create mode 100644 drivers/infiniband/hw/hns/hns_roce_mr.c >>>> create mode 100644 drivers/infiniband/hw/hns/hns_roce_pd.c >>>> >>>> diff --git a/drivers/infiniband/hw/hns/hns_roce_alloc.c b/drivers/infiniband/hw/hns/hns_roce_alloc.c >>>> new file mode 100644 >>>> index 0000000..0c76f1b >>>> --- /dev/null >>>> +++ b/drivers/infiniband/hw/hns/hns_roce_alloc.c >>>> @@ -0,0 +1,104 @@ >>>> +/* >>>> + * Copyright (c) 2016 Hisilicon Limited. >>>> + * >>>> + * This program is free software; you can redistribute it and/or modify >>>> + * it under the terms of the GNU General Public License as published by >>>> + * the Free Software Foundation; either version 2 of the License, or >>>> + * (at your option) any later version. >>>> + */ >>>> + >>>> +#include <linux/bitmap.h> >>>> +#include <linux/dma-mapping.h> >>>> +#include <linux/errno.h> >>>> +#include <linux/mm.h> >>>> +#include <linux/slab.h> >>>> +#include <linux/vmalloc.h> >>>> +#include "hns_roce_device.h" >>>> + >>>> +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj) >>>> +{ >>>> + int ret = 0; >>>> + >>>> + spin_lock(&bitmap->lock); >>>> + *obj = find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); >>>> + if (*obj >= bitmap->max) { >>>> + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) >>>> + & bitmap->mask; >>>> + *obj = find_first_zero_bit(bitmap->table, bitmap->max); >>> >>> find_first_zero_bit function returns "unsigned long" which may or may >>> not be equal to u32 on some architectures. >>> >> Hi Leon, >> I appreciate your keen eye. this code is meant for ARM64bit therefore should run corretly for 64-bit AARCH64. >> I will consider changing it as part of good partice and better portability " >> I will give a primary plan to modified it. >> for example: >> *obj = (u32)find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); >> Beause the max size of bitmap->table is u32 in current version. >> >> int hns_roce_bitmap_init(struct hns_roce_bitmap *bitmap, u32 num, u32 mask, >> u32 reserved_bot, u32 reserved_top) >> { >> u32 i; >> >> if (num != roundup_pow_of_two(num)) >> return -EINVAL; >> >> bitmap->last = 0; >> bitmap->top = 0; >> bitmap->max = num - reserved_top; >> bitmap->mask = mask; >> bitmap->reserved_top = reserved_top; >> spin_lock_init(&bitmap->lock); >> bitmap->table = kcalloc(BITS_TO_LONGS(bitmap->max), sizeof(long), >> GFP_KERNEL); >> >> Is this plan ok? > > No, > You are submitting new driver, please do it properly (without casting) > from the beginning. > oh, i see. I will give a better plan to modify it. Thanks Lijun Ou >> >> Thanks >> Lijun Ou >> >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-rdma" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Wed, Apr 27, 2016 at 6:34 AM, oulijun <oulijun@huawei.com> wrote: > On 2016/4/26 22:25, Jiri Pirko wrote: >> Tue, Apr 26, 2016 at 04:18:21PM CEST, leon@kernel.org wrote: >>>> I appreciate your keen eye. this code is meant for ARM64bit therefore should run corretly for 64-bit AARCH64. >> The driver should run correctly on any arch. > Hi Jiri Pirko, > Our driver run in ARM64 platform by depending on Kconfig. It will be configure in Kconfig file. Can you elaborate what design aspects in the driver or anywhere else should impose that limitation? Or. -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 2016/4/30 12:33, Or Gerlitz wrote: > On Wed, Apr 27, 2016 at 6:34 AM, oulijun <oulijun@huawei.com> wrote: >> On 2016/4/26 22:25, Jiri Pirko wrote: >>> Tue, Apr 26, 2016 at 04:18:21PM CEST, leon@kernel.org wrote: >>>>> I appreciate your keen eye. this code is meant for ARM64bit therefore should run corretly for 64-bit AARCH64. >>> The driver should run correctly on any arch. >> Hi Jiri Pirko, >> Our driver run in ARM64 platform by depending on Kconfig. It will be configure in Kconfig file. > Can you elaborate what design aspects in the driver or anywhere else > should impose that limitation? > > Or. Hi, Or Gerlitz 1. Oulijun resloved the problem, and sent PATCH V6 on 2016-4-28. Thanks for more comments. 2. This driver for Hisilicon RoCE Engine run in ARM SoCs. The Hisilicon Network Subsystem is a long term evolution IP which is supposed to be used in Hisilicon ICT SoCs. HNS(Hisilicon Network Subsystem) has a hardware support of performing RDMA with RoCE engine. This Kconfig related with this driver as below: config INFINIBAND_HISILICON_HNS tristate "Hisilicon Hns ROCE Driver" depends on NET_VENDOR_HISILICON depends on ARM64 && HNS && HNS_DSAF && HNS_ENET Regards Wei Hu > -- > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > > . > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Tue, May 3, 2016 at 10:57 AM, Wei Hu (Xavier) <xavier.huwei@huawei.com> wrote: > On 2016/4/30 12:33, Or Gerlitz wrote: >> Can you elaborate what design aspects in the driver or anywhere else >> should impose that limitation? > 1. Oulijun resolved the problem, and sent PATCH V6 on 2016-4-28. Thanks for > more comments. V6 still conditions things on ARM > 2. This driver for Hisilicon RoCE Engine run in ARM SoCs. > The Hisilicon Network Subsystem is a long term evolution IP which is > supposed to be used in Hisilicon ICT SoCs. HNS(Hisilicon Network Subsystem) > has a hardware support of performing RDMA with RoCE engine. I understand that the HW is targeted just for ARM environments. Is this the reason why you want to impose this build limitation or there's something in the driver design or code that is not going to work in other environments? > This Kconfig related with this driver as below: > > config INFINIBAND_HISILICON_HNS > tristate "Hisilicon Hns ROCE Driver" > depends on NET_VENDOR_HISILICON > depends on ARM64 && HNS && HNS_DSAF && HNS_ENET this is understood, my question came to better understand the limitations -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On 2016/5/3 16:14, Or Gerlitz wrote: > On Tue, May 3, 2016 at 10:57 AM, Wei Hu (Xavier) > <xavier.huwei@huawei.com> wrote: >> On 2016/4/30 12:33, Or Gerlitz wrote: > >>> Can you elaborate what design aspects in the driver or anywhere else >>> should impose that limitation? >> 1. Oulijun resolved the problem, and sent PATCH V6 on 2016-4-28. Thanks for >> more comments. > V6 still conditions things on ARM > > >> 2. This driver for Hisilicon RoCE Engine run in ARM SoCs. >> The Hisilicon Network Subsystem is a long term evolution IP which is >> supposed to be used in Hisilicon ICT SoCs. HNS(Hisilicon Network Subsystem) >> has a hardware support of performing RDMA with RoCE engine. > I understand that the HW is targeted just for ARM environments. Is > this the reason > why you want to impose this build limitation or there's something in > the driver design > or code that is not going to work in other environments? yes. Regards Wei Hu > >> This Kconfig related with this driver as below: >> >> config INFINIBAND_HISILICON_HNS >> tristate "Hisilicon Hns ROCE Driver" >> depends on NET_VENDOR_HISILICON >> depends on ARM64 && HNS && HNS_DSAF && HNS_ENET > this is understood, my question came to better understand the limitations > > . > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/drivers/infiniband/hw/hns/hns_roce_alloc.c b/drivers/infiniband/hw/hns/hns_roce_alloc.c new file mode 100644 index 0000000..0c76f1b --- /dev/null +++ b/drivers/infiniband/hw/hns/hns_roce_alloc.c @@ -0,0 +1,104 @@ +/* + * Copyright (c) 2016 Hisilicon Limited. + * + * This program is free software; you can redistribute it and/or modify + * it under the terms of the GNU General Public License as published by + * the Free Software Foundation; either version 2 of the License, or + * (at your option) any later version. + */ + +#include <linux/bitmap.h> +#include <linux/dma-mapping.h> +#include <linux/errno.h> +#include <linux/mm.h> +#include <linux/slab.h> +#include <linux/vmalloc.h> +#include "hns_roce_device.h" + +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj) +{ + int ret = 0; + + spin_lock(&bitmap->lock); + *obj = find_next_zero_bit(bitmap->table, bitmap->max, bitmap->last); + if (*obj >= bitmap->max) { + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) + & bitmap->mask; + *obj = find_first_zero_bit(bitmap->table, bitmap->max); + } + + if (*obj < bitmap->max) { + set_bit(*obj, bitmap->table); + bitmap->last = (*obj + 1); + if (bitmap->last == bitmap->max) + bitmap->last = 0; + *obj |= bitmap->top; + } else { + ret = -1; + } + + spin_unlock(&bitmap->lock); + + return ret; +} + +void hns_roce_bitmap_free(struct hns_roce_bitmap *bitmap, u32 obj) +{ + hns_roce_bitmap_free_range(bitmap, obj, 1); +} + +void hns_roce_bitmap_free_range(struct hns_roce_bitmap *bitmap, u32 obj, + int cnt) +{ + int i; + + obj &= bitmap->max + bitmap->reserved_top - 1; + + spin_lock(&bitmap->lock); + for (i = 0; i < cnt; i++) + clear_bit(obj + i, bitmap->table); + + bitmap->last = min(bitmap->last, obj); + bitmap->top = (bitmap->top + bitmap->max + bitmap->reserved_top) + & bitmap->mask; + spin_unlock(&bitmap->lock); +} + +int hns_roce_bitmap_init(struct hns_roce_bitmap *bitmap, u32 num, u32 mask, + u32 reserved_bot, u32 reserved_top) +{ + u32 i; + + if (num != roundup_pow_of_two(num)) + return -EINVAL; + + bitmap->last = 0; + bitmap->top = 0; + bitmap->max = num - reserved_top; + bitmap->mask = mask; + bitmap->reserved_top = reserved_top; + spin_lock_init(&bitmap->lock); + bitmap->table = kcalloc(BITS_TO_LONGS(bitmap->max), sizeof(long), + GFP_KERNEL); + if (!bitmap->table) + return -ENOMEM; + + for (i = 0; i < reserved_bot; ++i) + set_bit(i, bitmap->table); + + return 0; +} + +void hns_roce_bitmap_cleanup(struct hns_roce_bitmap *bitmap) +{ + kfree(bitmap->table); +} + +void hns_roce_cleanup_bitmap(struct hns_roce_dev *hr_dev) +{ + hns_roce_cleanup_qp_table(hr_dev); + hns_roce_cleanup_cq_table(hr_dev); + hns_roce_cleanup_mr_table(hr_dev); + hns_roce_cleanup_pd_table(hr_dev); + hns_roce_cleanup_uar_table(hr_dev); +} diff --git a/drivers/infiniband/hw/hns/hns_roce_cq.c b/drivers/infiniband/hw/hns/hns_roce_cq.c index 1dc8635..f7baf82 100644 --- a/drivers/infiniband/hw/hns/hns_roce_cq.c +++ b/drivers/infiniband/hw/hns/hns_roce_cq.c @@ -52,3 +52,28 @@ void hns_roce_cq_event(struct hns_roce_dev *hr_dev, u32 cqn, int event_type) if (atomic_dec_and_test(&cq->refcount)) complete(&cq->free); } + +int hns_roce_init_cq_table(struct hns_roce_dev *hr_dev) +{ + struct hns_roce_cq_table *cq_table = &hr_dev->cq_table; + struct device *dev = &hr_dev->pdev->dev; + int ret; + + spin_lock_init(&cq_table->lock); + INIT_RADIX_TREE(&cq_table->tree, GFP_ATOMIC); + + ret = hns_roce_bitmap_init(&cq_table->bitmap, hr_dev->caps.num_cqs, + hr_dev->caps.num_cqs - 1, + hr_dev->caps.reserved_cqs, 0); + if (ret) { + dev_err(dev, "init_cq_table.Failed to bitmap_init.\n"); + return ret; + } + + return 0; +} + +void hns_roce_cleanup_cq_table(struct hns_roce_dev *hr_dev) +{ + hns_roce_bitmap_cleanup(&hr_dev->cq_table.bitmap); +} diff --git a/drivers/infiniband/hw/hns/hns_roce_device.h b/drivers/infiniband/hw/hns/hns_roce_device.h index decd4fe..569ec42 100644 --- a/drivers/infiniband/hw/hns/hns_roce_device.h +++ b/drivers/infiniband/hw/hns/hns_roce_device.h @@ -91,6 +91,38 @@ enum { HNS_ROCE_CMD_SUCCESS = 1, }; +struct hns_roce_uar { + u64 pfn; + u32 index; +}; + +struct hns_roce_bitmap { + /* Bitmap Traversal last a bit which is 1 */ + u32 last; + u32 top; + u32 max; + u32 reserved_top; + u32 mask; + spinlock_t lock; + unsigned long *table; +}; + +/* Order bitmap length -- bit num compute formula: 1 << (max_order - order) */ +/* Order = 0: bitmap is biggest, order = max bitmap is least (only a bit) */ +/* Every bit repesent to a partner free/used status in bitmap */ +/* +* Initial, bits of other bitmap are all 0 except that a bit of max_order is 1 +* Bit = 1 represent to idle and available; bit = 0: not available +*/ +struct hns_roce_buddy { + /* Members point to every order level bitmap */ + unsigned long **bits; + /* Represent to avail bits of the order level bitmap */ + u32 *num_free; + int max_order; + spinlock_t lock; +}; + struct hns_roce_icm_table { /* ICM type: 0 = qpc 1 = mtt 2 = cqc 3 = srq 4 = other */ u32 type; @@ -107,6 +139,8 @@ struct hns_roce_icm_table { }; struct hns_roce_mr_table { + struct hns_roce_bitmap mtpt_bitmap; + struct hns_roce_buddy mtt_buddy; struct hns_roce_icm_table mtt_table; struct hns_roce_icm_table mtpt_table; }; @@ -124,13 +158,19 @@ struct hns_roce_cq { struct completion free; }; +struct hns_roce_uar_table { + struct hns_roce_bitmap bitmap; +}; + struct hns_roce_qp_table { + struct hns_roce_bitmap bitmap; spinlock_t lock; struct hns_roce_icm_table qp_table; struct hns_roce_icm_table irrl_table; }; struct hns_roce_cq_table { + struct hns_roce_bitmap bitmap; spinlock_t lock; struct radix_tree_root tree; struct hns_roce_icm_table table; @@ -262,6 +302,9 @@ struct hns_roce_hw { struct hns_roce_dev { struct ib_device ib_dev; struct platform_device *pdev; + struct hns_roce_uar priv_uar; + spinlock_t sm_lock; + spinlock_t cq_db_lock; spinlock_t bt_cmd_lock; struct hns_roce_ib_iboe iboe; @@ -277,6 +320,8 @@ struct hns_roce_dev { u32 hw_rev; struct hns_roce_cmdq cmd; + struct hns_roce_bitmap pd_bitmap; + struct hns_roce_uar_table uar_table; struct hns_roce_mr_table mr_table; struct hns_roce_cq_table cq_table; struct hns_roce_qp_table qp_table; @@ -299,6 +344,11 @@ static inline struct hns_roce_qp qpn & (hr_dev->caps.num_qps - 1)); } +int hns_roce_init_uar_table(struct hns_roce_dev *dev); +int hns_roce_uar_alloc(struct hns_roce_dev *dev, struct hns_roce_uar *uar); +void hns_roce_uar_free(struct hns_roce_dev *dev, struct hns_roce_uar *uar); +void hns_roce_cleanup_uar_table(struct hns_roce_dev *dev); + int hns_roce_cmd_init(struct hns_roce_dev *hr_dev); void hns_roce_cmd_cleanup(struct hns_roce_dev *hr_dev); void hns_roce_cmd_event(struct hns_roce_dev *hr_dev, u16 token, u8 status, @@ -306,9 +356,28 @@ void hns_roce_cmd_event(struct hns_roce_dev *hr_dev, u16 token, u8 status, int hns_roce_cmd_use_events(struct hns_roce_dev *hr_dev); void hns_roce_cmd_use_polling(struct hns_roce_dev *hr_dev); +int hns_roce_init_pd_table(struct hns_roce_dev *hr_dev); +int hns_roce_init_mr_table(struct hns_roce_dev *hr_dev); int hns_roce_init_eq_table(struct hns_roce_dev *hr_dev); +int hns_roce_init_cq_table(struct hns_roce_dev *hr_dev); +int hns_roce_init_qp_table(struct hns_roce_dev *hr_dev); +void hns_roce_cleanup_pd_table(struct hns_roce_dev *hr_dev); +void hns_roce_cleanup_mr_table(struct hns_roce_dev *hr_dev); void hns_roce_cleanup_eq_table(struct hns_roce_dev *hr_dev); +void hns_roce_cleanup_cq_table(struct hns_roce_dev *hr_dev); +void hns_roce_cleanup_qp_table(struct hns_roce_dev *hr_dev); + +int hns_roce_bitmap_alloc(struct hns_roce_bitmap *bitmap, u32 *obj); +void hns_roce_bitmap_free(struct hns_roce_bitmap *bitmap, u32 obj); +int hns_roce_bitmap_init(struct hns_roce_bitmap *bitmap, u32 num, u32 mask, + u32 reserved_bot, u32 resetrved_top); +void hns_roce_bitmap_cleanup(struct hns_roce_bitmap *bitmap); +void hns_roce_cleanup_bitmap(struct hns_roce_dev *hr_dev); +int hns_roce_bitmap_alloc_range(struct hns_roce_bitmap *bitmap, int cnt, + int align, u32 *obj); +void hns_roce_bitmap_free_range(struct hns_roce_bitmap *bitmap, u32 obj, + int cnt); void hns_roce_cq_completion(struct hns_roce_dev *hr_dev, u32 cqn); void hns_roce_cq_event(struct hns_roce_dev *hr_dev, u32 cqn, int event_type); diff --git a/drivers/infiniband/hw/hns/hns_roce_eq.c b/drivers/infiniband/hw/hns/hns_roce_eq.c index 7d848a0..67eda84 100644 --- a/drivers/infiniband/hw/hns/hns_roce_eq.c +++ b/drivers/infiniband/hw/hns/hns_roce_eq.c @@ -579,7 +579,6 @@ static void hns_roce_free_eq(struct hns_roce_dev *hr_dev, kfree(eq->buf_list); } - void hns_roce_int_mask_en(struct hns_roce_dev *hr_dev) { void __iomem *reg_caepceirqmsk; diff --git a/drivers/infiniband/hw/hns/hns_roce_icm.c b/drivers/infiniband/hw/hns/hns_roce_icm.c index d0f276d..cae6ec8 100644 --- a/drivers/infiniband/hw/hns/hns_roce_icm.c +++ b/drivers/infiniband/hw/hns/hns_roce_icm.c @@ -338,6 +338,94 @@ int hns_roce_unmap_icm(struct hns_roce_dev *hr_dev, return ret; } +int hns_roce_table_get(struct hns_roce_dev *hr_dev, + struct hns_roce_icm_table *table, int obj) +{ + struct device *dev = &hr_dev->pdev->dev; + int ret = 0; + int i; + + i = (obj & (table->num_obj - 1)) / (HNS_ROCE_TABLE_CHUNK_SIZE / + table->obj_size); + + mutex_lock(&table->mutex); + + if (table->icm[i]) { + ++table->icm[i]->refcount; + goto out; + } + + table->icm[i] = hns_roce_alloc_icm(hr_dev, + HNS_ROCE_TABLE_CHUNK_SIZE >> PAGE_SHIFT, + (table->lowmem ? GFP_KERNEL : + GFP_HIGHUSER) | __GFP_NOWARN, + table->coherent); + if (!table->icm[i]) { + ret = -ENOMEM; + dev_err(dev, "hns_roce_alloc_icm failed\n"); + goto out; + } + + /* Inform icm entry mem pa(128K/page, pa starting address)for hw */ + if (hns_roce_map_icm(hr_dev, table, obj)) { + ret = -ENODEV; + dev_err(dev, "map icm table failed.\n"); + goto out; + } + + ++table->icm[i]->refcount; +out: + mutex_unlock(&table->mutex); + return ret; +} + +void hns_roce_table_put(struct hns_roce_dev *hr_dev, + struct hns_roce_icm_table *table, int obj) +{ + struct device *dev = &hr_dev->pdev->dev; + int i; + + i = (obj & (table->num_obj - 1)) / + (HNS_ROCE_TABLE_CHUNK_SIZE / table->obj_size); + + mutex_lock(&table->mutex); + + if (--table->icm[i]->refcount == 0) { + /* Clear base address table */ + if (hns_roce_unmap_icm(hr_dev, table, obj)) + dev_warn(dev, "unmap icm table failed.\n"); + + hns_roce_free_icm(hr_dev, table->icm[i], table->coherent); + table->icm[i] = NULL; + } + + mutex_unlock(&table->mutex); +} + +int hns_roce_table_get_range(struct hns_roce_dev *hr_dev, + struct hns_roce_icm_table *table, int start, + int end) +{ + int inc = HNS_ROCE_TABLE_CHUNK_SIZE / table->obj_size; + int i = 0, ret = 0; + + /* Allocate MTT entry memory according to chunk(128K) */ + for (i = start; i <= end; i += inc) { + ret = hns_roce_table_get(hr_dev, table, i); + if (ret) + goto fail; + } + + return 0; + +fail: + while (i > start) { + i -= inc; + hns_roce_table_put(hr_dev, table, i); + } + return ret; +} + int hns_roce_init_icm_table(struct hns_roce_dev *hr_dev, struct hns_roce_icm_table *table, u32 type, int obj_size, int nobj, int reserved, diff --git a/drivers/infiniband/hw/hns/hns_roce_icm.h b/drivers/infiniband/hw/hns/hns_roce_icm.h index ce96d89..78f8e90 100644 --- a/drivers/infiniband/hw/hns/hns_roce_icm.h +++ b/drivers/infiniband/hw/hns/hns_roce_icm.h @@ -50,6 +50,15 @@ struct hns_roce_icm_iter { void hns_roce_free_icm(struct hns_roce_dev *hr_dev, struct hns_roce_icm *icm, int coherent); +int hns_roce_table_get(struct hns_roce_dev *hr_dev, + struct hns_roce_icm_table *table, int obj); +void hns_roce_table_put(struct hns_roce_dev *hr_dev, + struct hns_roce_icm_table *table, int obj); + +int hns_roce_table_get_range(struct hns_roce_dev *hr_dev, + struct hns_roce_icm_table *table, int start, + int end); + int hns_roce_init_icm_table(struct hns_roce_dev *hr_dev, struct hns_roce_icm_table *table, u32 type, int obj_size, int nobj, int reserved, diff --git a/drivers/infiniband/hw/hns/hns_roce_main.c b/drivers/infiniband/hw/hns/hns_roce_main.c index 3da0c03..e478fc6 100644 --- a/drivers/infiniband/hw/hns/hns_roce_main.c +++ b/drivers/infiniband/hw/hns/hns_roce_main.c @@ -184,6 +184,75 @@ err_unmap_mtt: } /** +* hns_roce_setup_hca - setup host channel adapter +* @hr_dev: pointer to hns roce device +* Return : int +*/ +int hns_roce_setup_hca(struct hns_roce_dev *hr_dev) +{ + int ret; + struct device *dev = &hr_dev->pdev->dev; + + spin_lock_init(&hr_dev->sm_lock); + spin_lock_init(&hr_dev->cq_db_lock); + spin_lock_init(&hr_dev->bt_cmd_lock); + + ret = hns_roce_init_uar_table(hr_dev); + if (ret) { + dev_err(dev, "Failed to initialize uar table. aborting\n"); + return ret; + } + + ret = hns_roce_uar_alloc(hr_dev, &hr_dev->priv_uar); + if (ret) { + dev_err(dev, "Failed to allocate priv_uar.\n"); + goto err_uar_table_free; + } + + ret = hns_roce_init_pd_table(hr_dev); + if (ret) { + dev_err(dev, "Failed to init protected domain table.\n"); + goto err_uar_alloc_free; + } + + ret = hns_roce_init_mr_table(hr_dev); + if (ret) { + dev_err(dev, "Failed to init memory region table.\n"); + goto err_pd_table_free; + } + + ret = hns_roce_init_cq_table(hr_dev); + if (ret) { + dev_err(dev, "Failed to init completion queue table.\n"); + goto err_mr_table_free; + } + + ret = hns_roce_init_qp_table(hr_dev); + if (ret) { + dev_err(dev, "Failed to init queue pair table.\n"); + goto err_cq_table_free; + } + + return 0; + +err_cq_table_free: + hns_roce_cleanup_cq_table(hr_dev); + +err_mr_table_free: + hns_roce_cleanup_mr_table(hr_dev); + +err_pd_table_free: + hns_roce_cleanup_pd_table(hr_dev); + +err_uar_alloc_free: + hns_roce_uar_free(hr_dev, &hr_dev->priv_uar); + +err_uar_table_free: + hns_roce_cleanup_uar_table(hr_dev); + return ret; +} + +/** * hns_roce_probe - RoCE driver entrance * @pdev: pointer to platform device * Return : int @@ -256,6 +325,15 @@ static int hns_roce_probe(struct platform_device *pdev) goto error_failed_init_icm; } + ret = hns_roce_setup_hca(hr_dev); + if (ret) { + dev_err(dev, "setup hca fail!\n"); + goto error_failed_setup_hca; + } + +error_failed_setup_hca: + hns_roce_cleanup_icm(hr_dev); + error_failed_init_icm: if (hr_dev->cmd_mod) hns_roce_cmd_use_polling(hr_dev); @@ -286,6 +364,7 @@ static int hns_roce_remove(struct platform_device *pdev) struct hns_roce_dev *hr_dev = platform_get_drvdata(pdev); int ret = 0; + hns_roce_cleanup_bitmap(hr_dev); hns_roce_cleanup_icm(hr_dev); if (hr_dev->cmd_mod) diff --git a/drivers/infiniband/hw/hns/hns_roce_mr.c b/drivers/infiniband/hw/hns/hns_roce_mr.c new file mode 100644 index 0000000..1adcdda --- /dev/null +++ b/drivers/infiniband/hw/hns/hns_roce_mr.c @@ -0,0 +1,187 @@ +/* + * Copyright (c) 2016 Hisilicon Limited. + * + * This program is free software; you can redistribute it and/or modify + * it under the terms of the GNU General Public License as published by + * the Free Software Foundation; either version 2 of the License, or + * (at your option) any later version. + */ + +#include <linux/init.h> +#include <linux/errno.h> +#include <linux/slab.h> +#include <linux/platform_device.h> +#include "hns_roce_device.h" + +static int hns_roce_buddy_alloc(struct hns_roce_buddy *buddy, int order, + u32 *seg) +{ + int o; + u32 m; + + spin_lock(&buddy->lock); + + for (o = order; o <= buddy->max_order; ++o) { + if (buddy->num_free[o]) { + m = 1 << (buddy->max_order - o); + *seg = find_first_bit(buddy->bits[o], m); + if (*seg < m) + goto found; + } + } + spin_unlock(&buddy->lock); + return -1; + + found: + clear_bit(*seg, buddy->bits[o]); + --buddy->num_free[o]; + + while (o > order) { + --o; + *seg <<= 1; + set_bit(*seg ^ 1, buddy->bits[o]); + ++buddy->num_free[o]; + } + + spin_unlock(&buddy->lock); + + *seg <<= order; + return 0; +} + +static void hns_roce_buddy_free(struct hns_roce_buddy *buddy, u32 seg, + int order) +{ + seg >>= order; + + spin_lock(&buddy->lock); + + while (test_bit(seg ^ 1, buddy->bits[order])) { + clear_bit(seg ^ 1, buddy->bits[order]); + --buddy->num_free[order]; + seg >>= 1; + ++order; + } + + set_bit(seg, buddy->bits[order]); + ++buddy->num_free[order]; + + spin_unlock(&buddy->lock); +} + +static int hns_roce_buddy_init(struct hns_roce_buddy *buddy, int max_order) +{ + int i, s; + + buddy->max_order = max_order; + spin_lock_init(&buddy->lock); + + buddy->bits = kzalloc((buddy->max_order + 1) * sizeof(long *), + GFP_KERNEL); + buddy->num_free = kzalloc((buddy->max_order + 1) * sizeof(int *), + GFP_KERNEL); + if (!buddy->bits || !buddy->num_free) + goto err_out; + + for (i = 0; i <= buddy->max_order; ++i) { + s = BITS_TO_LONGS(1 << (buddy->max_order - i)); + buddy->bits[i] = kmalloc_array(s, sizeof(long), GFP_KERNEL); + if (!buddy->bits[i]) + goto err_out_free; + + bitmap_zero(buddy->bits[i], 1 << (buddy->max_order - i)); + } + + set_bit(0, buddy->bits[buddy->max_order]); + buddy->num_free[buddy->max_order] = 1; + + return 0; + +err_out_free: + for (i = 0; i <= buddy->max_order; ++i) + kfree(buddy->bits[i]); + +err_out: + kfree(buddy->bits); + kfree(buddy->num_free); + return -ENOMEM; +} + +static void hns_roce_buddy_cleanup(struct hns_roce_buddy *buddy) +{ + int i; + + for (i = 0; i <= buddy->max_order; ++i) + kfree(buddy->bits[i]); + + kfree(buddy->bits); + kfree(buddy->num_free); +} + +static int hns_roce_alloc_mtt_range(struct hns_roce_dev *hr_dev, int order, + u32 *seg) +{ + struct hns_roce_mr_table *mr_table = &hr_dev->mr_table; + int ret = 0; + + ret = hns_roce_buddy_alloc(&mr_table->mtt_buddy, order, seg); + if (ret == -1) + return -1; + + if (hns_roce_table_get_range(hr_dev, &mr_table->mtt_table, *seg, + *seg + (1 << order) - 1)) { + hns_roce_buddy_free(&mr_table->mtt_buddy, *seg, order); + return -1; + } + + return 0; +} + +int hns_roce_init_mr_table(struct hns_roce_dev *hr_dev) +{ + struct hns_roce_mr_table *mr_table = &hr_dev->mr_table; + struct device *dev = &hr_dev->pdev->dev; + u32 first_seg; + int ret = 0; + + ret = hns_roce_bitmap_init(&mr_table->mtpt_bitmap, + hr_dev->caps.num_mtpts, + hr_dev->caps.num_mtpts - 1, + hr_dev->caps.reserved_mrws, 0); + if (ret) + return ret; + + ret = hns_roce_buddy_init(&mr_table->mtt_buddy, + ilog2(hr_dev->caps.num_mtt_segs)); + if (ret) + goto err_buddy; + + if (hr_dev->caps.reserved_mtts) { + if (hns_roce_alloc_mtt_range(hr_dev, + fls(hr_dev->caps.reserved_mtts - 1), + &first_seg) == -1) { + dev_err(dev, "MTT table of order %d is too small.\n", + mr_table->mtt_buddy.max_order); + ret = -ENOMEM; + goto err_reserve_mtts; + } + } + + return 0; + +err_reserve_mtts: + hns_roce_buddy_cleanup(&mr_table->mtt_buddy); + +err_buddy: + hns_roce_bitmap_cleanup(&mr_table->mtpt_bitmap); + return ret; +} + +void hns_roce_cleanup_mr_table(struct hns_roce_dev *hr_dev) +{ + struct hns_roce_mr_table *mr_table = &hr_dev->mr_table; + + hns_roce_buddy_cleanup(&mr_table->mtt_buddy); + hns_roce_bitmap_cleanup(&mr_table->mtpt_bitmap); +} + diff --git a/drivers/infiniband/hw/hns/hns_roce_pd.c b/drivers/infiniband/hw/hns/hns_roce_pd.c new file mode 100644 index 0000000..fb0f7c65 --- /dev/null +++ b/drivers/infiniband/hw/hns/hns_roce_pd.c @@ -0,0 +1,65 @@ +/* + * Copyright (c) 2016 Hisilicon Limited. + * + * This program is free software; you can redistribute it and/or modify + * it under the terms of the GNU General Public License as published by + * the Free Software Foundation; either version 2 of the License, or + * (at your option) any later version. + */ + +#include <asm/page.h> +#include <linux/init.h> +#include <linux/platform_device.h> +#include <linux/slab.h> +#include <rdma/ib_smi.h> +#include <rdma/ib_umem.h> +#include <rdma/ib_user_verbs.h> +#include "hns_roce_common.h" +#include "hns_roce_device.h" + +int hns_roce_init_pd_table(struct hns_roce_dev *hr_dev) +{ + return hns_roce_bitmap_init(&hr_dev->pd_bitmap, hr_dev->caps.num_pds, + hr_dev->caps.num_pds - 1, + hr_dev->caps.reserved_pds, 0); +} + +void hns_roce_cleanup_pd_table(struct hns_roce_dev *hr_dev) +{ + hns_roce_bitmap_cleanup(&hr_dev->pd_bitmap); +} + +int hns_roce_uar_alloc(struct hns_roce_dev *hr_dev, struct hns_roce_uar *uar) +{ + struct resource *res; + int ret = 0; + /* Using bitmap to manager UAR index */ + ret = hns_roce_bitmap_alloc(&hr_dev->uar_table.bitmap, &uar->index); + if (ret == -1) + return -ENOMEM; + + uar->index = (uar->index - 1) % hr_dev->caps.phy_num_uars + 1; + + res = platform_get_resource(hr_dev->pdev, IORESOURCE_MEM, 0); + uar->pfn = (res->start) >> PAGE_SHIFT + uar->index; + + return 0; +} + +void hns_roce_uar_free(struct hns_roce_dev *hr_dev, struct hns_roce_uar *uar) +{ + hns_roce_bitmap_free(&hr_dev->uar_table.bitmap, uar->index); +} + +int hns_roce_init_uar_table(struct hns_roce_dev *hr_dev) +{ + return hns_roce_bitmap_init(&hr_dev->uar_table.bitmap, + hr_dev->caps.num_uars, + hr_dev->caps.num_uars - 1, + hr_dev->caps.reserved_uars, 0); +} + +void hns_roce_cleanup_uar_table(struct hns_roce_dev *hr_dev) +{ + hns_roce_bitmap_cleanup(&hr_dev->uar_table.bitmap); +} diff --git a/drivers/infiniband/hw/hns/hns_roce_qp.c b/drivers/infiniband/hw/hns/hns_roce_qp.c index e0e41ca..e8d396b 100644 --- a/drivers/infiniband/hw/hns/hns_roce_qp.c +++ b/drivers/infiniband/hw/hns/hns_roce_qp.c @@ -13,6 +13,8 @@ #include <rdma/ib_pack.h> #include "hns_roce_device.h" +#define SQP_NUM 12 + void hns_roce_qp_event(struct hns_roce_dev *hr_dev, u32 qpn, int event_type) { struct hns_roce_qp_table *qp_table = &hr_dev->qp_table; @@ -37,3 +39,31 @@ void hns_roce_qp_event(struct hns_roce_dev *hr_dev, u32 qpn, int event_type) if (atomic_dec_and_test(&qp->refcount)) complete(&qp->free); } + +int hns_roce_init_qp_table(struct hns_roce_dev *hr_dev) +{ + struct hns_roce_qp_table *qp_table = &hr_dev->qp_table; + int reserved_from_top = 0; + int ret; + + spin_lock_init(&qp_table->lock); + INIT_RADIX_TREE(&hr_dev->qp_table_tree, GFP_ATOMIC); + + /* A port include two SQP, six port total 12 */ + ret = hns_roce_bitmap_init(&qp_table->bitmap, hr_dev->caps.num_qps, + hr_dev->caps.num_qps - 1, + hr_dev->caps.sqp_start + SQP_NUM, + reserved_from_top); + if (ret) { + dev_err(&hr_dev->pdev->dev, "qp bitmap init failed!error=%d\n", + ret); + return ret; + } + + return 0; +} + +void hns_roce_cleanup_qp_table(struct hns_roce_dev *hr_dev) +{ + hns_roce_bitmap_cleanup(&hr_dev->qp_table.bitmap); +}