diff mbox series

[bpf,v3] bpf: Fix memory leaks in __check_func_call

Message ID 1667468524-4926-1-git-send-email-wangyufen@huawei.com (mailing list archive)
State Superseded
Delegated to: BPF
Headers show
Series [bpf,v3] bpf: Fix memory leaks in __check_func_call | expand

Checks

Context Check Description
netdev/tree_selection success Clearly marked for bpf
netdev/fixes_present success Fixes tag present in non-next series
netdev/subject_prefix success Link
netdev/cover_letter success Single patches do not need cover letters
netdev/patch_count success Link
netdev/header_inline success No static functions without inline keyword in header files
netdev/build_32bit success Errors and warnings before: 10 this patch: 10
netdev/cc_maintainers warning 5 maintainers not CCed: sdf@google.com haoluo@google.com jolsa@kernel.org kpsingh@kernel.org song@kernel.org
netdev/build_clang success Errors and warnings before: 5 this patch: 5
netdev/module_param success Was 0 now: 0
netdev/verify_signedoff success Signed-off-by tag matches author and committer
netdev/check_selftest success No net selftest shell script
netdev/verify_fixes success Fixes tag looks correct
netdev/build_allmodconfig_warn success Errors and warnings before: 10 this patch: 10
netdev/checkpatch success total: 0 errors, 0 warnings, 0 checks, 40 lines checked
netdev/kdoc success Errors and warnings before: 0 this patch: 0
netdev/source_inline success Was 0 now: 0
bpf/vmtest-bpf-VM_Test-2 success Logs for build for s390x with gcc
bpf/vmtest-bpf-VM_Test-27 success Logs for test_progs_parallel on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-28 success Logs for test_progs_parallel on x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-30 success Logs for test_verifier on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-31 success Logs for test_verifier on x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-29 success Logs for test_verifier on s390x with gcc
bpf/vmtest-bpf-VM_Test-26 success Logs for test_progs_parallel on s390x with gcc
bpf/vmtest-bpf-PR fail PR summary
bpf/vmtest-bpf-VM_Test-4 success Logs for build for x86_64 with gcc
bpf/vmtest-bpf-VM_Test-5 success Logs for build for x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-3 success Logs for build for s390x with gcc
bpf/vmtest-bpf-VM_Test-18 success Logs for test_progs_no_alu32_parallel on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-19 success Logs for test_progs_no_alu32_parallel on x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-21 success Logs for test_progs_parallel on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-22 success Logs for test_progs_parallel on x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-24 success Logs for test_verifier on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-25 success Logs for test_verifier on x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-15 success Logs for test_progs_no_alu32 on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-16 fail Logs for test_progs_no_alu32 on x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-12 success Logs for test_progs on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-13 success Logs for test_progs on x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-9 success Logs for test_maps on x86_64 with gcc
bpf/vmtest-bpf-VM_Test-23 success Logs for test_verifier on s390x with gcc
bpf/vmtest-bpf-VM_Test-14 success Logs for test_progs_no_alu32 on s390x with gcc
bpf/vmtest-bpf-VM_Test-17 success Logs for test_progs_no_alu32_parallel on s390x with gcc
bpf/vmtest-bpf-VM_Test-20 success Logs for test_progs_parallel on s390x with gcc
bpf/vmtest-bpf-VM_Test-11 success Logs for test_progs on s390x with gcc
bpf/vmtest-bpf-VM_Test-1 pending Logs for ${{ matrix.test }} on ${{ matrix.arch }} with ${{ matrix.toolchain }}
bpf/vmtest-bpf-VM_Test-6 success Logs for build for x86_64 with gcc
bpf/vmtest-bpf-VM_Test-7 success Logs for build for x86_64 with llvm-16
bpf/vmtest-bpf-VM_Test-8 success Logs for llvm-toolchain
bpf/vmtest-bpf-VM_Test-10 success Logs for set-matrix

Commit Message

wangyufen Nov. 3, 2022, 9:42 a.m. UTC
kmemleak reports this issue:

unreferenced object 0xffff88817139d000 (size 2048):
  comm "test_progs", pid 33246, jiffies 4307381979 (age 45851.820s)
  hex dump (first 32 bytes):
    01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
    00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
  backtrace:
    [<0000000045f075f0>] kmalloc_trace+0x27/0xa0
    [<0000000098b7c90a>] __check_func_call+0x316/0x1230
    [<00000000b4c3c403>] check_helper_call+0x172e/0x4700
    [<00000000aa3875b7>] do_check+0x21d8/0x45e0
    [<000000001147357b>] do_check_common+0x767/0xaf0
    [<00000000b5a595b4>] bpf_check+0x43e3/0x5bc0
    [<0000000011e391b1>] bpf_prog_load+0xf26/0x1940
    [<0000000007f765c0>] __sys_bpf+0xd2c/0x3650
    [<00000000839815d6>] __x64_sys_bpf+0x75/0xc0
    [<00000000946ee250>] do_syscall_64+0x3b/0x90
    [<0000000000506b7f>] entry_SYSCALL_64_after_hwframe+0x63/0xcd

The root case here is: In function prepare_func_exit(), the callee is
not released in the abnormal scenario after "state->curframe--;". To
fix, move "state->curframe--;" to the very bottom of the function,
right when we free callee and reset frame[] pointer to NULL, as Andrii
suggested.

In addition, function __check_func_call() has a similar problem. In
the abnormal scenario before "state->curframe++;", the callee is alse
not released.

Fixes: 69c087ba6225 ("bpf: Add bpf_for_each_map_elem() helper")
Fixes: fd978bf7fd31 ("bpf: Add reference tracking to verifier")
Signed-off-by: Wang Yufen <wangyufen@huawei.com>
---
 kernel/bpf/verifier.c | 13 +++++++++----
 1 file changed, 9 insertions(+), 4 deletions(-)

Comments

Martin KaFai Lau Nov. 8, 2022, 12:46 a.m. UTC | #1
On 11/3/22 2:42 AM, Wang Yufen wrote:
> kmemleak reports this issue:
> 
> unreferenced object 0xffff88817139d000 (size 2048):
>    comm "test_progs", pid 33246, jiffies 4307381979 (age 45851.820s)
>    hex dump (first 32 bytes):
>      01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
>      00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00  ................
>    backtrace:
>      [<0000000045f075f0>] kmalloc_trace+0x27/0xa0
>      [<0000000098b7c90a>] __check_func_call+0x316/0x1230
>      [<00000000b4c3c403>] check_helper_call+0x172e/0x4700
>      [<00000000aa3875b7>] do_check+0x21d8/0x45e0
>      [<000000001147357b>] do_check_common+0x767/0xaf0
>      [<00000000b5a595b4>] bpf_check+0x43e3/0x5bc0
>      [<0000000011e391b1>] bpf_prog_load+0xf26/0x1940
>      [<0000000007f765c0>] __sys_bpf+0xd2c/0x3650
>      [<00000000839815d6>] __x64_sys_bpf+0x75/0xc0
>      [<00000000946ee250>] do_syscall_64+0x3b/0x90
>      [<0000000000506b7f>] entry_SYSCALL_64_after_hwframe+0x63/0xcd
> 
> The root case here is: In function prepare_func_exit(), the callee is
> not released in the abnormal scenario after "state->curframe--;". To
> fix, move "state->curframe--;" to the very bottom of the function,
> right when we free callee and reset frame[] pointer to NULL, as Andrii
> suggested.
> 
> In addition, function __check_func_call() has a similar problem. In
> the abnormal scenario before "state->curframe++;", the callee is alse
> not released.
> 
> Fixes: 69c087ba6225 ("bpf: Add bpf_for_each_map_elem() helper")
> Fixes: fd978bf7fd31 ("bpf: Add reference tracking to verifier")
> Signed-off-by: Wang Yufen <wangyufen@huawei.com>
> ---
>   kernel/bpf/verifier.c | 13 +++++++++----
>   1 file changed, 9 insertions(+), 4 deletions(-)
> 
> diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c
> index 7f0a9f6..eff7a5a 100644
> --- a/kernel/bpf/verifier.c
> +++ b/kernel/bpf/verifier.c
> @@ -6736,11 +6736,11 @@ static int __check_func_call(struct bpf_verifier_env *env, struct bpf_insn *insn
>   	/* Transfer references to the callee */
>   	err = copy_reference_state(callee, caller);
>   	if (err)
> -		return err;
> +		goto err_out;
>   
>   	err = set_callee_state_cb(env, caller, callee, *insn_idx);
>   	if (err)
> -		return err;
> +		goto err_out;
>   
>   	clear_caller_saved_regs(env, caller->regs);
>   
> @@ -6757,6 +6757,11 @@ static int __check_func_call(struct bpf_verifier_env *env, struct bpf_insn *insn
>   		print_verifier_state(env, callee, true);
>   	}
>   	return 0;
> +
> +err_out:
> +	kfree(callee);

Is it sure that free_func_state() is not needed ?

> +	state->frame[state->curframe + 1] = NULL;
> +	return err;
>   }
>   
>   int map_set_for_each_callback_args(struct bpf_verifier_env *env,
> @@ -6970,8 +6975,7 @@ static int prepare_func_exit(struct bpf_verifier_env *env, int *insn_idx)
>   		return -EINVAL;
>   	}
>   
> -	state->curframe--;
> -	caller = state->frame[state->curframe];
> +	caller = state->frame[state->curframe - 1];
>   	if (callee->in_callback_fn) {
>   		/* enforce R0 return value range [0, 1]. */
>   		struct tnum range = callee->callback_ret_range;
> @@ -7001,6 +7005,7 @@ static int prepare_func_exit(struct bpf_verifier_env *env, int *insn_idx)
>   			return err;
>   	}
>   
> +	state->curframe--;

nit. state->curframe is always pointing to callee upto this point?  Instead of 
doing another +1 dance in the latter 'state->frame[state->curframe + 1] = 
NULL;', how about do it later like:

	/* clear everything in the callee */
         free_func_state(callee);
	state->frame[state->curframe--] = NULL;


It shouldn't affect the earlier print_verifier_state() which explicitly takes 
callee and caller as its arg, right?

>   	*insn_idx = callee->callsite + 1;
>   	if (env->log.level & BPF_LOG_LEVEL) {
>   		verbose(env, "returning from callee:\n");
wangyufen Nov. 8, 2022, 3:32 a.m. UTC | #2
在 2022/11/8 8:46, Martin KaFai Lau 写道:
> On 11/3/22 2:42 AM, Wang Yufen wrote:
>> kmemleak reports this issue:
>>
>> unreferenced object 0xffff88817139d000 (size 2048):
>>    comm "test_progs", pid 33246, jiffies 4307381979 (age 45851.820s)
>>    hex dump (first 32 bytes):
>>      01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
>>      00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 ................
>>    backtrace:
>>      [<0000000045f075f0>] kmalloc_trace+0x27/0xa0
>>      [<0000000098b7c90a>] __check_func_call+0x316/0x1230
>>      [<00000000b4c3c403>] check_helper_call+0x172e/0x4700
>>      [<00000000aa3875b7>] do_check+0x21d8/0x45e0
>>      [<000000001147357b>] do_check_common+0x767/0xaf0
>>      [<00000000b5a595b4>] bpf_check+0x43e3/0x5bc0
>>      [<0000000011e391b1>] bpf_prog_load+0xf26/0x1940
>>      [<0000000007f765c0>] __sys_bpf+0xd2c/0x3650
>>      [<00000000839815d6>] __x64_sys_bpf+0x75/0xc0
>>      [<00000000946ee250>] do_syscall_64+0x3b/0x90
>>      [<0000000000506b7f>] entry_SYSCALL_64_after_hwframe+0x63/0xcd
>>
>> The root case here is: In function prepare_func_exit(), the callee is
>> not released in the abnormal scenario after "state->curframe--;". To
>> fix, move "state->curframe--;" to the very bottom of the function,
>> right when we free callee and reset frame[] pointer to NULL, as Andrii
>> suggested.
>>
>> In addition, function __check_func_call() has a similar problem. In
>> the abnormal scenario before "state->curframe++;", the callee is alse
>> not released.
>>
>> Fixes: 69c087ba6225 ("bpf: Add bpf_for_each_map_elem() helper")
>> Fixes: fd978bf7fd31 ("bpf: Add reference tracking to verifier")
>> Signed-off-by: Wang Yufen <wangyufen@huawei.com>
>> ---
>>   kernel/bpf/verifier.c | 13 +++++++++----
>>   1 file changed, 9 insertions(+), 4 deletions(-)
>>
>> diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c
>> index 7f0a9f6..eff7a5a 100644
>> --- a/kernel/bpf/verifier.c
>> +++ b/kernel/bpf/verifier.c
>> @@ -6736,11 +6736,11 @@ static int __check_func_call(struct 
>> bpf_verifier_env *env, struct bpf_insn *insn
>>       /* Transfer references to the callee */
>>       err = copy_reference_state(callee, caller);
>>       if (err)
>> -        return err;
>> +        goto err_out;
>>         err = set_callee_state_cb(env, caller, callee, *insn_idx);
>>       if (err)
>> -        return err;
>> +        goto err_out;
>>         clear_caller_saved_regs(env, caller->regs);
>>   @@ -6757,6 +6757,11 @@ static int __check_func_call(struct 
>> bpf_verifier_env *env, struct bpf_insn *insn
>>           print_verifier_state(env, callee, true);
>>       }
>>       return 0;
>> +
>> +err_out:
>> +    kfree(callee);
>
> Is it sure that free_func_state() is not needed ?

No, free_func_state() is needed. Sorry, I didn't notice that memory is 
alloced for callee->refs in

copy_reference_state()-> copy_array() -> kmalloc_track_caller().  will 
change in v4

Thanks.

Wang

>
>> +    state->frame[state->curframe + 1] = NULL;
>> +    return err;
>>   }
>>     int map_set_for_each_callback_args(struct bpf_verifier_env *env,
>> @@ -6970,8 +6975,7 @@ static int prepare_func_exit(struct 
>> bpf_verifier_env *env, int *insn_idx)
>>           return -EINVAL;
>>       }
>>   -    state->curframe--;
>> -    caller = state->frame[state->curframe];
>> +    caller = state->frame[state->curframe - 1];
>>       if (callee->in_callback_fn) {
>>           /* enforce R0 return value range [0, 1]. */
>>           struct tnum range = callee->callback_ret_range;
>> @@ -7001,6 +7005,7 @@ static int prepare_func_exit(struct 
>> bpf_verifier_env *env, int *insn_idx)
>>               return err;
>>       }
>>   +    state->curframe--;
>
> nit. state->curframe is always pointing to callee upto this point?  
> Instead of doing another +1 dance in the latter 
> 'state->frame[state->curframe + 1] = NULL;', how about do it later like:
>
>     /* clear everything in the callee */
>         free_func_state(callee);
>     state->frame[state->curframe--] = NULL;
>
>
> It shouldn't affect the earlier print_verifier_state() which 
> explicitly takes callee and caller as its arg, right?

Yes, state->curframe not affect print_verifier_state(),  doing 
"state->curframe--"  at the end of the function is better, also will 
change in v4.  Thanks!

>
>>       *insn_idx = callee->callsite + 1;
>>       if (env->log.level & BPF_LOG_LEVEL) {
>>           verbose(env, "returning from callee:\n");
>
diff mbox series

Patch

diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c
index 7f0a9f6..eff7a5a 100644
--- a/kernel/bpf/verifier.c
+++ b/kernel/bpf/verifier.c
@@ -6736,11 +6736,11 @@  static int __check_func_call(struct bpf_verifier_env *env, struct bpf_insn *insn
 	/* Transfer references to the callee */
 	err = copy_reference_state(callee, caller);
 	if (err)
-		return err;
+		goto err_out;
 
 	err = set_callee_state_cb(env, caller, callee, *insn_idx);
 	if (err)
-		return err;
+		goto err_out;
 
 	clear_caller_saved_regs(env, caller->regs);
 
@@ -6757,6 +6757,11 @@  static int __check_func_call(struct bpf_verifier_env *env, struct bpf_insn *insn
 		print_verifier_state(env, callee, true);
 	}
 	return 0;
+
+err_out:
+	kfree(callee);
+	state->frame[state->curframe + 1] = NULL;
+	return err;
 }
 
 int map_set_for_each_callback_args(struct bpf_verifier_env *env,
@@ -6970,8 +6975,7 @@  static int prepare_func_exit(struct bpf_verifier_env *env, int *insn_idx)
 		return -EINVAL;
 	}
 
-	state->curframe--;
-	caller = state->frame[state->curframe];
+	caller = state->frame[state->curframe - 1];
 	if (callee->in_callback_fn) {
 		/* enforce R0 return value range [0, 1]. */
 		struct tnum range = callee->callback_ret_range;
@@ -7001,6 +7005,7 @@  static int prepare_func_exit(struct bpf_verifier_env *env, int *insn_idx)
 			return err;
 	}
 
+	state->curframe--;
 	*insn_idx = callee->callsite + 1;
 	if (env->log.level & BPF_LOG_LEVEL) {
 		verbose(env, "returning from callee:\n");