udp: Avoid call to compute_score on multiple sites

Message ID	20240404211111.30493-1-krisman@suse.de (mailing list archive)
State	Superseded
Delegated to:	Netdev Maintainers
Headers	show Received: from smtp-out2.suse.de (smtp-out2.suse.de [195.135.223.131]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 9DAEF13473D for <netdev@vger.kernel.org>; Thu, 4 Apr 2024 21:11:28 +0000 (UTC) From: Gabriel Krisman Bertazi <krisman@suse.de> To: willemdebruijn.kernel@gmail.com, davem@davemloft.net Cc: netdev@vger.kernel.org, martin.lau@kernel.org, Gabriel Krisman Bertazi <krisman@suse.de>, Lorenz Bauer <lmb@isovalent.com> Subject: [PATCH] udp: Avoid call to compute_score on multiple sites Date: Thu, 4 Apr 2024 17:11:11 -0400 Message-ID: <20240404211111.30493-1-krisman@suse.de> Precedence: bulk MIME-Version: 1.0 Content-Transfer-Encoding: 8bit default: False [-1.30 / 50.00]; BAYES_HAM(-3.00)[100.00%]; SUSPICIOUS_RECIPS(1.50)[]; MID_CONTAINS_FROM(1.00)[]; NEURAL_HAM_LONG(-1.00)[-1.000]; R_MISSING_CHARSET(0.50)[]; NEURAL_HAM_SHORT(-0.20)[-0.998]; MIME_GOOD(-0.10)[text/plain]; DBL_BLOCKED_OPENRESOLVER(0.00)[suse.de:email,imap2.dmz-prg2.suse.org:helo,imap2.dmz-prg2.suse.org:rdns]; RCVD_VIA_SMTP_AUTH(0.00)[]; TAGGED_RCPT(0.00)[]; ARC_NA(0.00)[]; TO_DN_SOME(0.00)[]; MIME_TRACE(0.00)[0:+]; FROM_HAS_DN(0.00)[]; FREEMAIL_TO(0.00)[gmail.com,davemloft.net]; FROM_EQ_ENVFROM(0.00)[]; RCVD_COUNT_TWO(0.00)[2]; TO_MATCH_ENVRCPT_ALL(0.00)[]; RCVD_TLS_ALL(0.00)[]; FUZZY_BLOCKED(0.00)[rspamd.com]; RCPT_COUNT_FIVE(0.00)[6]; DKIM_SIGNED(0.00)[suse.de:s=susede2_rsa,suse.de:s=susede2_ed25519]; FREEMAIL_ENVRCPT(0.00)[gmail.com]
Series	udp: Avoid call to compute_score on multiple sites \| expand udp: Avoid call to compute_score on multiple sites

Message ID

20240404211111.30493-1-krisman@suse.de (mailing list archive)

State

Superseded

Delegated to:

Netdev Maintainers

Headers

From: Gabriel Krisman Bertazi <krisman@suse.de>
To: willemdebruijn.kernel@gmail.com,
	davem@davemloft.net
Cc: netdev@vger.kernel.org,
	martin.lau@kernel.org,
	Gabriel Krisman Bertazi <krisman@suse.de>,
	Lorenz Bauer <lmb@isovalent.com>
Subject: [PATCH] udp: Avoid call to compute_score on multiple sites
Date: Thu,  4 Apr 2024 17:11:11 -0400
Message-ID: <20240404211111.30493-1-krisman@suse.de>
Precedence: bulk
MIME-Version: 1.0
Content-Transfer-Encoding: 8bit

Series

udp: Avoid call to compute_score on multiple sites | expand

Context	Check	Description
netdev/series_format	warning	Single patches do not need cover letters; Target tree name not specified in the subject
netdev/tree_selection	success	Guessed tree name to be net-next
netdev/ynl	success	Generated files up to date; no warnings/errors; no diff in generated;
netdev/fixes_present	success	Fixes tag not required for -next series
netdev/header_inline	success	No static functions without inline keyword in header files
netdev/build_32bit	success	Errors and warnings before: 950 this patch: 950
netdev/build_tools	success	No tools touched, skip
netdev/cc_maintainers	fail	1 blamed authors not CCed: kuniyu@amazon.com; 5 maintainers not CCed: kuba@kernel.org edumazet@google.com dsahern@kernel.org kuniyu@amazon.com pabeni@redhat.com
netdev/build_clang	success	Errors and warnings before: 954 this patch: 954
netdev/verify_signedoff	success	Signed-off-by tag matches author and committer
netdev/deprecated_api	success	None detected
netdev/check_selftest	success	No net selftest shell script
netdev/verify_fixes	success	Fixes tag looks correct
netdev/build_allmodconfig_warn	success	Errors and warnings before: 961 this patch: 961
netdev/checkpatch	warning	CHECK: Alignment should match open parenthesis
netdev/build_clang_rust	success	No Rust files in patch. Skipping build
netdev/kdoc	success	Errors and warnings before: 3 this patch: 3
netdev/source_inline	success	Was 0 now: 0
netdev/contest	success	net-next-2024-04-07--00-00 (tests: 956)

Context

Check

Description

netdev/series_format

warning

Single patches do not need cover letters; Target tree name not specified in the subject

netdev/tree_selection

success

Guessed tree name to be net-next

netdev/ynl

success

Generated files up to date; no warnings/errors; no diff in generated;

netdev/fixes_present

success

Fixes tag not required for -next series

netdev/header_inline

success

No static functions without inline keyword in header files

netdev/build_32bit

success

Errors and warnings before: 950 this patch: 950

netdev/build_tools

success

No tools touched, skip

netdev/cc_maintainers

fail

1 blamed authors not CCed: kuniyu@amazon.com; 5 maintainers not CCed: kuba@kernel.org edumazet@google.com dsahern@kernel.org kuniyu@amazon.com pabeni@redhat.com

netdev/build_clang

success

Errors and warnings before: 954 this patch: 954

netdev/verify_signedoff

success

Signed-off-by tag matches author and committer

netdev/deprecated_api

success

None detected

netdev/check_selftest

success

No net selftest shell script

netdev/verify_fixes

success

Fixes tag looks correct

netdev/build_allmodconfig_warn

success

Errors and warnings before: 961 this patch: 961

netdev/checkpatch

warning

CHECK: Alignment should match open parenthesis

netdev/build_clang_rust

success

No Rust files in patch. Skipping build

netdev/kdoc

success

Errors and warnings before: 3 this patch: 3

netdev/source_inline

success

Was 0 now: 0

netdev/contest

success

net-next-2024-04-07--00-00 (tests: 956)

Commit Message

Gabriel Krisman Bertazi April 4, 2024, 9:11 p.m. UTC

We've observed a 7-12% performance regression in iperf3 UDP ipv4 and
ipv6 tests with multiple sockets on Zen3 cpus, which we traced back to
commit f0ea27e7bfe1 ("udp: re-score reuseport groups when connected
sockets are present").  The failing tests were those that would spawn
UDP sockets per-cpu on systems that have a high number of cpus.

Unsurprisingly, it is not caused by the extra re-scoring of the reused
socket, but due to the compiler no longer inlining compute_score, once
it has the extra call site in upd5_lib_lookup2.  This is augmented by
the "Safe RET" mitigation for SRSO, needed in our Zen3 cpus.

We could just explicitly inline it, but compute_score() is quite a large
function, around 300b.  Inlining in two sites would almost double
udp4_lib_lookup2, which is a silly thing to do just to workaround a
mitigation.  Instead, this patch shuffles the code a bit to avoid the
multiple calls to compute_score.  Since it is a static function used in
one spot, the compiler can safely fold it in, as it did before, without
increasing the text size.

With this patch applied I ran my original iperf3 testcases.  The failing
cases all looked like this (ipv4):
	iperf3 -c 127.0.0.1 --udp -4 -f K -b $R -l 8920 -t 30 -i 5 -P 64 -O 2 2>&1

where $R is either 1G/10G/0 (max, unlimited).  I ran 5 times each.
baseline is 6.9.0-rc1-g962490525cff, just a recent checkout of Linus
tree. harmean == harmonic mean; CV == coefficient of variation.

ipv4:
                 1G                10G                  MAX
	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
baseline 1726716.59(0.0401) 1751758.50(0.0068) 1425388.83(0.1276)
patched  1842337.77(0.0711) 1861574.00(0.0774) 1888601.95(0.0580)

ipv6:
                 1G                10G                  MAX
	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
baseline: 1693636.28(0.0132) 1704418.23(0.0094) 1519681.83(0.1299)
patched   1909754.24(0.0307) 1782295.80(0.0539) 1632803.48(0.1185)

This restores the performance we had before the change above with this
benchmark.  We obviously don't expect any real impact when mitigations
are disabled, but just to be sure it also doesn't regresses:

mitigations=off ipv4:
                 1G                10G                  MAX
	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
baseline 3230279.97(0.0066) 3229320.91(0.0060) 2605693.19(0.0697)
patched  3242802.36(0.0073) 3239310.71(0.0035) 2502427.19(0.0882)

Finally, I can see this restores compute_score inlining in my gcc
without extra function attributes. Out of caution, I still added
__always_inline in compute_score, to prevent future changes from
un-inlining it again.  Since it is only in one site, it should be fine.

Cc: Lorenz Bauer <lmb@isovalent.com>
Fixes: f0ea27e7bfe1 ("udp: re-score reuseport groups when connected sockets are present")
Signed-off-by: Gabriel Krisman Bertazi <krisman@suse.de>

---
Another idea would be shrinking compute_score and then inlining it.  I'm
not a network developer, but it seems that we can avoid most of the
"same network" checks of calculate_score when passing a socket from the
reusegroup.  If that is the case, we can fork out a compute_score_fast
that can be safely inlined at the second call site of the existing
compute_score.  I didn't pursue this any further.
---
 net/ipv4/udp.c | 24 ++++++++++++++++++------
 net/ipv6/udp.c | 23 ++++++++++++++++++-----
 2 files changed, 36 insertions(+), 11 deletions(-)

Comments

Willem de Bruijn April 6, 2024, 2:44 p.m. UTC | #1

Gabriel Krisman Bertazi wrote:
> We've observed a 7-12% performance regression in iperf3 UDP ipv4 and
> ipv6 tests with multiple sockets on Zen3 cpus, which we traced back to
> commit f0ea27e7bfe1 ("udp: re-score reuseport groups when connected
> sockets are present").  The failing tests were those that would spawn
> UDP sockets per-cpu on systems that have a high number of cpus.
> 
> Unsurprisingly, it is not caused by the extra re-scoring of the reused
> socket, but due to the compiler no longer inlining compute_score, once
> it has the extra call site in upd5_lib_lookup2.  This is augmented by

udp4_lib_lookup2

> the "Safe RET" mitigation for SRSO, needed in our Zen3 cpus.
> 
> We could just explicitly inline it, but compute_score() is quite a large
> function, around 300b.  Inlining in two sites would almost double
> udp4_lib_lookup2, which is a silly thing to do just to workaround a
> mitigation.  Instead, this patch shuffles the code a bit to avoid the
> multiple calls to compute_score.  Since it is a static function used in
> one spot, the compiler can safely fold it in, as it did before, without
> increasing the text size.
> 
> With this patch applied I ran my original iperf3 testcases.  The failing
> cases all looked like this (ipv4):
> 	iperf3 -c 127.0.0.1 --udp -4 -f K -b $R -l 8920 -t 30 -i 5 -P 64 -O 2 2>&1
> 
> where $R is either 1G/10G/0 (max, unlimited).  I ran 5 times each.
> baseline is 6.9.0-rc1-g962490525cff, just a recent checkout of Linus
> tree. harmean == harmonic mean; CV == coefficient of variation.
> 
> ipv4:
>                  1G                10G                  MAX
> 	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
> baseline 1726716.59(0.0401) 1751758.50(0.0068) 1425388.83(0.1276)
> patched  1842337.77(0.0711) 1861574.00(0.0774) 1888601.95(0.0580)
> 
> ipv6:
>                  1G                10G                  MAX
> 	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
> baseline: 1693636.28(0.0132) 1704418.23(0.0094) 1519681.83(0.1299)
> patched   1909754.24(0.0307) 1782295.80(0.0539) 1632803.48(0.1185)
> 
> This restores the performance we had before the change above with this
> benchmark.  We obviously don't expect any real impact when mitigations
> are disabled, but just to be sure it also doesn't regresses:
> 
> mitigations=off ipv4:
>                  1G                10G                  MAX
> 	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
> baseline 3230279.97(0.0066) 3229320.91(0.0060) 2605693.19(0.0697)
> patched  3242802.36(0.0073) 3239310.71(0.0035) 2502427.19(0.0882)
> 
> Finally, I can see this restores compute_score inlining in my gcc
> without extra function attributes. Out of caution, I still added
> __always_inline in compute_score, to prevent future changes from
> un-inlining it again.  Since it is only in one site, it should be fine.
> 
> Cc: Lorenz Bauer <lmb@isovalent.com>
> Fixes: f0ea27e7bfe1 ("udp: re-score reuseport groups when connected sockets are present")
> Signed-off-by: Gabriel Krisman Bertazi <krisman@suse.de>
> 
> ---
> Another idea would be shrinking compute_score and then inlining it.  I'm
> not a network developer, but it seems that we can avoid most of the
> "same network" checks of calculate_score when passing a socket from the
> reusegroup.  If that is the case, we can fork out a compute_score_fast
> that can be safely inlined at the second call site of the existing
> compute_score.  I didn't pursue this any further.
> ---
>  net/ipv4/udp.c | 24 ++++++++++++++++++------
>  net/ipv6/udp.c | 23 ++++++++++++++++++-----
>  2 files changed, 36 insertions(+), 11 deletions(-)
> 

> diff --git a/net/ipv6/udp.c b/net/ipv6/udp.c
> index 7c1e6469d091..883e62228432 100644
> --- a/net/ipv6/udp.c
> +++ b/net/ipv6/udp.c
> @@ -114,7 +114,11 @@ void udp_v6_rehash(struct sock *sk)
>  	udp_lib_rehash(sk, new_hash);
>  }
>  
> -static int compute_score(struct sock *sk, struct net *net,
> +/* While large, compute_score is in the UDP hot path and only used once
> + * in udp4_lib_lookup2. Avoiding the function call by inlining it has

udp6_lib_lookup2

> + * yield measurable benefits in iperf3-based benchmarks.
> + */
> +static __always_inline int compute_score(struct sock *sk, struct net *net,
>  			 const struct in6_addr *saddr, __be16 sport,
>  			 const struct in6_addr *daddr, unsigned short hnum,
>  			 int dif, int sdif)
> @@ -166,16 +170,20 @@ static struct sock *udp6_lib_lookup2(struct net *net,
>  		int dif, int sdif, struct udp_hslot *hslot2,
>  		struct sk_buff *skb)
>  {
> -	struct sock *sk, *result;
> +	struct sock *sk, *result, *this;
>  	int score, badness;
>  
>  	result = NULL;
>  	badness = -1;
>  	udp_portaddr_for_each_entry_rcu(sk, &hslot2->head) {
> -		score = compute_score(sk, net, saddr, sport,
> +		this = sk;
> +rescore:
> +		score = compute_score(this, net, saddr, sport,
>  				      daddr, hnum, dif, sdif);
>  		if (score > badness) {
>  			badness = score;
> +			if (this != sk)
> +				continue;

Can we just rely on screo not increasing indefinitely on retry
to break out of the loop.

Or, if an explicit "this is a rescore" boolean is needed, a boolean
makes the control flow easier to follow than a third struct sk.

>  
>  			if (sk->sk_state == TCP_ESTABLISHED) {
>  				result = sk;
> @@ -197,8 +205,13 @@ static struct sock *udp6_lib_lookup2(struct net *net,
>  			if (IS_ERR(result))
>  				continue;
>  
> -			badness = compute_score(sk, net, saddr, sport,
> -						daddr, hnum, dif, sdif);
> +			/* compute_score is too long of a function to be
> +			 * inlined, and calling it again yields
> +			 * measureable overhead. Work around it by
> +			 * jumping backwards to score 'result'.
> +			 */
> +			this = result;
> +			goto rescore;
>  		}
>  	}
>  	return result;
> -- 
> 2.44.0
>

Willem de Bruijn April 6, 2024, 5:41 p.m. UTC | #2

Willem de Bruijn wrote:
> Gabriel Krisman Bertazi wrote:
> > We've observed a 7-12% performance regression in iperf3 UDP ipv4 and
> > ipv6 tests with multiple sockets on Zen3 cpus, which we traced back to
> > commit f0ea27e7bfe1 ("udp: re-score reuseport groups when connected
> > sockets are present").  The failing tests were those that would spawn
> > UDP sockets per-cpu on systems that have a high number of cpus.
> > 
> > Unsurprisingly, it is not caused by the extra re-scoring of the reused
> > socket, but due to the compiler no longer inlining compute_score, once
> > it has the extra call site in upd5_lib_lookup2.  This is augmented by
> 
> udp4_lib_lookup2
> 
> > the "Safe RET" mitigation for SRSO, needed in our Zen3 cpus.
> > 
> > We could just explicitly inline it, but compute_score() is quite a large
> > function, around 300b.  Inlining in two sites would almost double
> > udp4_lib_lookup2, which is a silly thing to do just to workaround a
> > mitigation.  Instead, this patch shuffles the code a bit to avoid the
> > multiple calls to compute_score.  Since it is a static function used in
> > one spot, the compiler can safely fold it in, as it did before, without
> > increasing the text size.
> > 
> > With this patch applied I ran my original iperf3 testcases.  The failing
> > cases all looked like this (ipv4):
> > 	iperf3 -c 127.0.0.1 --udp -4 -f K -b $R -l 8920 -t 30 -i 5 -P 64 -O 2 2>&1
> > 
> > where $R is either 1G/10G/0 (max, unlimited).  I ran 5 times each.
> > baseline is 6.9.0-rc1-g962490525cff, just a recent checkout of Linus
> > tree. harmean == harmonic mean; CV == coefficient of variation.
> > 
> > ipv4:
> >                  1G                10G                  MAX
> > 	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
> > baseline 1726716.59(0.0401) 1751758.50(0.0068) 1425388.83(0.1276)
> > patched  1842337.77(0.0711) 1861574.00(0.0774) 1888601.95(0.0580)
> > 
> > ipv6:
> >                  1G                10G                  MAX
> > 	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
> > baseline: 1693636.28(0.0132) 1704418.23(0.0094) 1519681.83(0.1299)
> > patched   1909754.24(0.0307) 1782295.80(0.0539) 1632803.48(0.1185)
> > 
> > This restores the performance we had before the change above with this
> > benchmark.  We obviously don't expect any real impact when mitigations
> > are disabled, but just to be sure it also doesn't regresses:
> > 
> > mitigations=off ipv4:
> >                  1G                10G                  MAX
> > 	    HARMEAN  (CV)      HARMEAN  (CV)    HARMEAN     (CV)
> > baseline 3230279.97(0.0066) 3229320.91(0.0060) 2605693.19(0.0697)
> > patched  3242802.36(0.0073) 3239310.71(0.0035) 2502427.19(0.0882)
> > 
> > Finally, I can see this restores compute_score inlining in my gcc
> > without extra function attributes. Out of caution, I still added
> > __always_inline in compute_score, to prevent future changes from
> > un-inlining it again.  Since it is only in one site, it should be fine.
> > 
> > Cc: Lorenz Bauer <lmb@isovalent.com>
> > Fixes: f0ea27e7bfe1 ("udp: re-score reuseport groups when connected sockets are present")
> > Signed-off-by: Gabriel Krisman Bertazi <krisman@suse.de>
> > 
> > ---
> > Another idea would be shrinking compute_score and then inlining it.  I'm
> > not a network developer, but it seems that we can avoid most of the
> > "same network" checks of calculate_score when passing a socket from the
> > reusegroup.  If that is the case, we can fork out a compute_score_fast
> > that can be safely inlined at the second call site of the existing
> > compute_score.  I didn't pursue this any further.
> > ---
> >  net/ipv4/udp.c | 24 ++++++++++++++++++------
> >  net/ipv6/udp.c | 23 ++++++++++++++++++-----
> >  2 files changed, 36 insertions(+), 11 deletions(-)
> > 
> 
> > diff --git a/net/ipv6/udp.c b/net/ipv6/udp.c
> > index 7c1e6469d091..883e62228432 100644
> > --- a/net/ipv6/udp.c
> > +++ b/net/ipv6/udp.c
> > @@ -114,7 +114,11 @@ void udp_v6_rehash(struct sock *sk)
> >  	udp_lib_rehash(sk, new_hash);
> >  }
> >  
> > -static int compute_score(struct sock *sk, struct net *net,
> > +/* While large, compute_score is in the UDP hot path and only used once
> > + * in udp4_lib_lookup2. Avoiding the function call by inlining it has
> 
> udp6_lib_lookup2
> 
> > + * yield measurable benefits in iperf3-based benchmarks.
> > + */
> > +static __always_inline int compute_score(struct sock *sk, struct net *net,
> >  			 const struct in6_addr *saddr, __be16 sport,
> >  			 const struct in6_addr *daddr, unsigned short hnum,

Forgot to mention: __always_inline is used very sparingly.

I don't think this qualifies. It did not have that attribute before,
nor needs it.

diff --git a/net/ipv4/udp.c b/net/ipv4/udp.c
index 661d0e0d273f..8ce5c4e8663e 100644
--- a/net/ipv4/udp.c
+++ b/net/ipv4/udp.c
@@ -363,7 +363,11 @@  int udp_v4_get_port(struct sock *sk, unsigned short snum)
 	return udp_lib_get_port(sk, snum, hash2_nulladdr);
 }
 
-static int compute_score(struct sock *sk, struct net *net,
+/* While large, compute_score is in the UDP hot path and only used once
+ * in udp4_lib_lookup2. Avoiding the function call by inlining it has
+ * yield measurable benefits in iperf3-based benchmarks.
+ */
+static __always_inline int compute_score(struct sock *sk, struct net *net,
 			 __be32 saddr, __be16 sport,
 			 __be32 daddr, unsigned short hnum,
 			 int dif, int sdif)
@@ -425,16 +429,20 @@  static struct sock *udp4_lib_lookup2(struct net *net,
 				     struct udp_hslot *hslot2,
 				     struct sk_buff *skb)
 {
-	struct sock *sk, *result;
+	struct sock *sk, *result, *this;
 	int score, badness;
 
 	result = NULL;
 	badness = 0;
 	udp_portaddr_for_each_entry_rcu(sk, &hslot2->head) {
-		score = compute_score(sk, net, saddr, sport,
+		this = sk;
+rescore:
+		score = compute_score(this, net, saddr, sport,
 				      daddr, hnum, dif, sdif);
 		if (score > badness) {
 			badness = score;
+			if (this != sk)
+				continue;
 
 			if (sk->sk_state == TCP_ESTABLISHED) {
 				result = sk;
@@ -456,9 +464,13 @@  static struct sock *udp4_lib_lookup2(struct net *net,
 			if (IS_ERR(result))
 				continue;
 
-			badness = compute_score(result, net, saddr, sport,
-						daddr, hnum, dif, sdif);
-
+			/* compute_score is too long of a function to be
+			 * inlined, and calling it again yields
+			 * measureable overhead. Work around it by
+			 * jumping backwards to score 'this'.
+			 */
+			this = result;
+			goto rescore;
 		}
 	}
 	return result;
diff --git a/net/ipv6/udp.c b/net/ipv6/udp.c
index 7c1e6469d091..883e62228432 100644
--- a/net/ipv6/udp.c
+++ b/net/ipv6/udp.c
@@ -114,7 +114,11 @@  void udp_v6_rehash(struct sock *sk)
 	udp_lib_rehash(sk, new_hash);
 }
 
-static int compute_score(struct sock *sk, struct net *net,
+/* While large, compute_score is in the UDP hot path and only used once
+ * in udp4_lib_lookup2. Avoiding the function call by inlining it has
+ * yield measurable benefits in iperf3-based benchmarks.
+ */
+static __always_inline int compute_score(struct sock *sk, struct net *net,
 			 const struct in6_addr *saddr, __be16 sport,
 			 const struct in6_addr *daddr, unsigned short hnum,
 			 int dif, int sdif)
@@ -166,16 +170,20 @@  static struct sock *udp6_lib_lookup2(struct net *net,
 		int dif, int sdif, struct udp_hslot *hslot2,
 		struct sk_buff *skb)
 {
-	struct sock *sk, *result;
+	struct sock *sk, *result, *this;
 	int score, badness;
 
 	result = NULL;
 	badness = -1;
 	udp_portaddr_for_each_entry_rcu(sk, &hslot2->head) {
-		score = compute_score(sk, net, saddr, sport,
+		this = sk;
+rescore:
+		score = compute_score(this, net, saddr, sport,
 				      daddr, hnum, dif, sdif);
 		if (score > badness) {
 			badness = score;
+			if (this != sk)
+				continue;
 
 			if (sk->sk_state == TCP_ESTABLISHED) {
 				result = sk;
@@ -197,8 +205,13 @@  static struct sock *udp6_lib_lookup2(struct net *net,
 			if (IS_ERR(result))
 				continue;
 
-			badness = compute_score(sk, net, saddr, sport,
-						daddr, hnum, dif, sdif);
+			/* compute_score is too long of a function to be
+			 * inlined, and calling it again yields
+			 * measureable overhead. Work around it by
+			 * jumping backwards to score 'result'.
+			 */
+			this = result;
+			goto rescore;
 		}
 	}
 	return result;

udp: Avoid call to compute_score on multiple sites

Checks

Commit Message

Comments

Patch