[net] neigh: make sure used and confirmed times are valid

Message ID	20230202152551.56390-1-ja@ssi.bg (mailing list archive)
State	Accepted
Commit	c1d2ecdf5e38e3489ce8328238b558b3b2866fe1
Delegated to:	Netdev Maintainers
Headers	show Return-Path: <netdev-owner@vger.kernel.org> From: Julian Anastasov <ja@ssi.bg> To: "David S . Miller" <davem@davemloft.net> Cc: Eric Dumazet <edumazet@google.com>, Jakub Kicinski <kuba@kernel.org>, Paolo Abeni <pabeni@redhat.com>, netdev@vger.kernel.org, Zhang Changzhong <zhangchangzhong@huawei.com>, YueHaibing <yuehaibing@huawei.com> Subject: [PATCH net] neigh: make sure used and confirmed times are valid Date: Thu, 2 Feb 2023 17:25:51 +0200 Message-Id: <20230202152551.56390-1-ja@ssi.bg> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk
Series	[net] neigh: make sure used and confirmed times are valid \| expand [net] neigh: make sure used and confirmed times are valid

Message ID

20230202152551.56390-1-ja@ssi.bg (mailing list archive)

State

Accepted

Commit

c1d2ecdf5e38e3489ce8328238b558b3b2866fe1

Delegated to:

Netdev Maintainers

Headers

show

Return-Path: <netdev-owner@vger.kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
Received: from vger.kernel.org (vger.kernel.org [23.128.96.18])
	by smtp.lore.kernel.org (Postfix) with ESMTP id 61981C678DB
	for <netdev@archiver.kernel.org>; Thu,  2 Feb 2023 15:29:16 +0000 (UTC)
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S233094AbjBBP3O (ORCPT <rfc822;netdev@archiver.kernel.org>);
        Thu, 2 Feb 2023 10:29:14 -0500
Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47702 "EHLO
        lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
        with ESMTP id S233057AbjBBP26 (ORCPT
        <rfc822;netdev@vger.kernel.org>); Thu, 2 Feb 2023 10:28:58 -0500
Received: from mg.ssi.bg (mg.ssi.bg [193.238.174.37])
        by lindbergh.monkeyblade.net (Postfix) with ESMTP id D68C56F717
        for <netdev@vger.kernel.org>; Thu,  2 Feb 2023 07:28:24 -0800 (PST)
Received: from mg.ssi.bg (localhost [127.0.0.1])
        by mg.ssi.bg (Proxmox) with ESMTP id 3911B52B7E;
        Thu,  2 Feb 2023 17:27:21 +0200 (EET)
Received: from ink.ssi.bg (unknown [193.238.174.40])
        by mg.ssi.bg (Proxmox) with ESMTP id 1340552A76;
        Thu,  2 Feb 2023 17:27:18 +0200 (EET)
Received: from ja.ssi.bg (unknown [178.16.129.10])
        by ink.ssi.bg (Postfix) with ESMTPS id DAC083C0435;
        Thu,  2 Feb 2023 17:27:14 +0200 (EET)
Received: from ja.home.ssi.bg (localhost.localdomain [127.0.0.1])
        by ja.ssi.bg (8.17.1/8.16.1) with ESMTP id 312FREHI056617;
        Thu, 2 Feb 2023 17:27:14 +0200
Received: (from root@localhost)
        by ja.home.ssi.bg (8.17.1/8.17.1/Submit) id 312FREDK056616;
        Thu, 2 Feb 2023 17:27:14 +0200
From: Julian Anastasov <ja@ssi.bg>
To: "David S . Miller" <davem@davemloft.net>
Cc: Eric Dumazet <edumazet@google.com>,
        Jakub Kicinski <kuba@kernel.org>,
        Paolo Abeni <pabeni@redhat.com>, netdev@vger.kernel.org,
        Zhang Changzhong <zhangchangzhong@huawei.com>,
        YueHaibing <yuehaibing@huawei.com>
Subject: [PATCH net] neigh: make sure used and confirmed times are valid
Date: Thu,  2 Feb 2023 17:25:51 +0200
Message-Id: <20230202152551.56390-1-ja@ssi.bg>
X-Mailer: git-send-email 2.39.1
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit
Precedence: bulk
List-ID: <netdev.vger.kernel.org>
X-Mailing-List: netdev@vger.kernel.org
X-Patchwork-Delegate: kuba@kernel.org

Series

[net] neigh: make sure used and confirmed times are valid | expand

Context	Check	Description
netdev/tree_selection	success	Clearly marked for net
netdev/fixes_present	fail	Series targets non-next tree, but doesn't contain any Fixes tags
netdev/subject_prefix	success	Link
netdev/cover_letter	success	Single patches do not need cover letters
netdev/patch_count	success	Link
netdev/header_inline	success	No static functions without inline keyword in header files
netdev/build_32bit	success	Errors and warnings before: 4 this patch: 4
netdev/cc_maintainers	warning	1 maintainers not CCed: thomas.zeitlhofer+lkml@ze-it.at
netdev/build_clang	success	Errors and warnings before: 1 this patch: 1
netdev/module_param	success	Was 0 now: 0
netdev/verify_signedoff	success	Signed-off-by tag matches author and committer
netdev/check_selftest	success	No net selftest shell script
netdev/verify_fixes	success	No Fixes tag
netdev/build_allmodconfig_warn	success	Errors and warnings before: 4 this patch: 4
netdev/checkpatch	warning	WARNING: line length of 97 exceeds 80 columns
netdev/kdoc	success	Errors and warnings before: 0 this patch: 0
netdev/source_inline	success	Was 0 now: 0

Context

Check

Description

netdev/tree_selection

success

Clearly marked for net

netdev/fixes_present

fail

Series targets non-next tree, but doesn't contain any Fixes tags

netdev/subject_prefix

success

Link

netdev/cover_letter

success

Single patches do not need cover letters

netdev/patch_count

success

Link

netdev/header_inline

success

No static functions without inline keyword in header files

netdev/build_32bit

success

Errors and warnings before: 4 this patch: 4

netdev/cc_maintainers

warning

1 maintainers not CCed: thomas.zeitlhofer+lkml@ze-it.at

netdev/build_clang

success

Errors and warnings before: 1 this patch: 1

netdev/module_param

success

Was 0 now: 0

netdev/verify_signedoff

success

Signed-off-by tag matches author and committer

netdev/check_selftest

success

No net selftest shell script

netdev/verify_fixes

success

No Fixes tag

netdev/build_allmodconfig_warn

success

Errors and warnings before: 4 this patch: 4

netdev/checkpatch

warning

WARNING: line length of 97 exceeds 80 columns

netdev/kdoc

success

Errors and warnings before: 0 this patch: 0

netdev/source_inline

success

Was 0 now: 0

Commit Message

Julian Anastasov Feb. 2, 2023, 3:25 p.m. UTC

Entries can linger in cache without timer for days, thanks to
the gc_thresh1 limit. As result, without traffic, the confirmed
time can be outdated and to appear to be in the future. Later,
on traffic, NUD_STALE entries can switch to NUD_DELAY and start
the timer which can see the invalid confirmed time and wrongly
switch to NUD_REACHABLE state instead of NUD_PROBE. As result,
timer is set many days in the future. This is more visible on
32-bit platforms, with higher HZ value.

Why this is a problem? While we expect unused entries to expire,
such entries stay in REACHABLE state for too long, locked in
cache. They are not expired normally, only when cache is full.

Problem and the wrong state change reported by Zhang Changzhong:

172.16.1.18 dev bond0 lladdr 0a:0e:0f:01:12:01 ref 1 used 350521/15994171/350520 probes 4 REACHABLE

350520 seconds have elapsed since this entry was last updated, but it is
still in the REACHABLE state (base_reachable_time_ms is 30000),
preventing lladdr from being updated through probe.

Fix it by ensuring timer is started with valid used/confirmed
times. Considering the valid time range is LONG_MAX jiffies,
we try not to go too much in the past while we are in
DELAY/PROBE state. There are also places that need
used/updated times to be validated while timer is not running.

Reported-by: Zhang Changzhong <zhangchangzhong@huawei.com>
Signed-off-by: Julian Anastasov <ja@ssi.bg>
Tested-by: Zhang Changzhong <zhangchangzhong@huawei.com>
---
 net/core/neighbour.c | 18 +++++++++++++++---
 1 file changed, 15 insertions(+), 3 deletions(-)

 This solution prefers to add code in neigh_add_timer()
 assuming it is less used than the timer code. The
 alternative would be to add time_in_range* calls in
 neigh_timer_handler() to be more safe.
 Another solution would be to add the time correction
 only in __neigh_event_send() where we switch from
 STALE to DELAY as it looks to be the only path that
 is affected and where we switch to states that
 consider the confirmed time. OTOH, NUD_INCOMPLETE
 is not affected from invalid times.

Comments

patchwork-bot+netdevbpf@kernel.org Feb. 6, 2023, 9:10 a.m. UTC | #1

Hello:

This patch was applied to netdev/net.git (master)
by David S. Miller <davem@davemloft.net>:

On Thu,  2 Feb 2023 17:25:51 +0200 you wrote:
> Entries can linger in cache without timer for days, thanks to
> the gc_thresh1 limit. As result, without traffic, the confirmed
> time can be outdated and to appear to be in the future. Later,
> on traffic, NUD_STALE entries can switch to NUD_DELAY and start
> the timer which can see the invalid confirmed time and wrongly
> switch to NUD_REACHABLE state instead of NUD_PROBE. As result,
> timer is set many days in the future. This is more visible on
> 32-bit platforms, with higher HZ value.
> 
> [...]

Here is the summary with links:
  - [net] neigh: make sure used and confirmed times are valid
    https://git.kernel.org/netdev/net/c/c1d2ecdf5e38

You are awesome, thank you!

diff --git a/net/core/neighbour.c b/net/core/neighbour.c
index f00a79fc301b..4edd2176e238 100644
--- a/net/core/neighbour.c
+++ b/net/core/neighbour.c
@@ -269,7 +269,7 @@  static int neigh_forced_gc(struct neigh_table *tbl)
 			    (n->nud_state == NUD_NOARP) ||
 			    (tbl->is_multicast &&
 			     tbl->is_multicast(n->primary_key)) ||
-			    time_after(tref, n->updated))
+			    !time_in_range(n->updated, tref, jiffies))
 				remove = true;
 			write_unlock(&n->lock);
 
@@ -289,7 +289,17 @@  static int neigh_forced_gc(struct neigh_table *tbl)
 
 static void neigh_add_timer(struct neighbour *n, unsigned long when)
 {
+	/* Use safe distance from the jiffies - LONG_MAX point while timer
+	 * is running in DELAY/PROBE state but still show to user space
+	 * large times in the past.
+	 */
+	unsigned long mint = jiffies - (LONG_MAX - 86400 * HZ);
+
 	neigh_hold(n);
+	if (!time_in_range(n->confirmed, mint, jiffies))
+		n->confirmed = mint;
+	if (time_before(n->used, n->confirmed))
+		n->used = n->confirmed;
 	if (unlikely(mod_timer(&n->timer, when))) {
 		printk("NEIGH: BUG, double timer add, state is %x\n",
 		       n->nud_state);
@@ -1001,12 +1011,14 @@  static void neigh_periodic_work(struct work_struct *work)
 				goto next_elt;
 			}
 
-			if (time_before(n->used, n->confirmed))
+			if (time_before(n->used, n->confirmed) &&
+			    time_is_before_eq_jiffies(n->confirmed))
 				n->used = n->confirmed;
 
 			if (refcount_read(&n->refcnt) == 1 &&
 			    (state == NUD_FAILED ||
-			     time_after(jiffies, n->used + NEIGH_VAR(n->parms, GC_STALETIME)))) {
+			     !time_in_range_open(jiffies, n->used,
+						 n->used + NEIGH_VAR(n->parms, GC_STALETIME)))) {
 				*np = n->next;
 				neigh_mark_dead(n);
 				write_unlock(&n->lock);

[net] neigh: make sure used and confirmed times are valid

Checks

Commit Message

Comments

Patch