Message ID | 20240730073634.114407-1-anand.a.khoje@oracle.com (mailing list archive) |
---|---|
State | Handled Elsewhere |
Headers | show |
Series | [net-next,v7] net/mlx5: Reclaim max 50K pages at once | expand |
Hello: This patch was applied to netdev/net-next.git (main) by Jakub Kicinski <kuba@kernel.org>: On Tue, 30 Jul 2024 13:06:33 +0530 you wrote: > In non FLR context, at times CX-5 requests release of ~8 million FW pages. > This needs humongous number of cmd mailboxes, which to be released once > the pages are reclaimed. Release of humongous number of cmd mailboxes is > consuming cpu time running into many seconds. Which with non preemptible > kernels is leading to critical process starving on that cpu’s RQ. > On top of it, the FW does not use all the mailbox messages as it has a > limit of releasing 50K pages at once per MLX5_CMD_OP_MANAGE_PAGES + > MLX5_PAGES_TAKE device command. Hence, the allocation of these many > mailboxes is extra and adds unnecessary overhead. > To alleviate this, this change restricts the total number of pages > a worker will try to reclaim to maximum 50K pages in one go. > > [...] Here is the summary with links: - [net-next,v7] net/mlx5: Reclaim max 50K pages at once https://git.kernel.org/netdev/net-next/c/501c3005f031 You are awesome, thank you!
diff --git a/drivers/net/ethernet/mellanox/mlx5/core/pagealloc.c b/drivers/net/ethernet/mellanox/mlx5/core/pagealloc.c index d894a88..972e8e9 100644 --- a/drivers/net/ethernet/mellanox/mlx5/core/pagealloc.c +++ b/drivers/net/ethernet/mellanox/mlx5/core/pagealloc.c @@ -608,6 +608,11 @@ enum { RELEASE_ALL_PAGES_MASK = 0x4000, }; +/* This limit is based on the capability of the firmware as it cannot release + * more than 50000 back to the host in one go. + */ +#define MAX_RECLAIM_NPAGES (-50000) + static int req_pages_handler(struct notifier_block *nb, unsigned long type, void *data) { @@ -639,7 +644,16 @@ static int req_pages_handler(struct notifier_block *nb, req->dev = dev; req->func_id = func_id; - req->npages = npages; + + /* npages > 0 means HCA asking host to allocate/give pages, + * npages < 0 means HCA asking host to reclaim back the pages allocated. + * Here we are restricting the maximum number of pages that can be + * reclaimed to be MAX_RECLAIM_NPAGES. Note that MAX_RECLAIM_NPAGES is + * a negative value. + * Since MAX_RECLAIM is negative, we are using max() to restrict + * req->npages (and not min ()). + */ + req->npages = max_t(s32, npages, MAX_RECLAIM_NPAGES); req->ec_function = ec_function; req->release_all = release_all; INIT_WORK(&req->work, pages_work_handler);