diff mbox series

ring-buffer: fix array bounds checking

Message ID 20241216164931.57323-1-aha310510@gmail.com (mailing list archive)
State New
Headers show
Series ring-buffer: fix array bounds checking | expand

Commit Message

Jeongjun Park Dec. 16, 2024, 4:49 p.m. UTC
If there is a case where the variable s is greater than or equal to nr_subbufs
before entering the loop, oob read or use-after-free will occur. This problem
occurs because the variable s is used as an index to dereference the
struct page before the variable value range check. This logic prevents the
wrong address value from being copied to the pages array through the subsequent
range check, but oob read still occurs, so the code needs to be modified.

Fixes: 117c39200d9d ("ring-buffer: Introducing ring-buffer mapping functions")
Signed-off-by: Jeongjun Park <aha310510@gmail.com>
---
 kernel/trace/ring_buffer.c | 10 +++++-----
 1 file changed, 5 insertions(+), 5 deletions(-)

--

Comments

Vincent Donnefort Dec. 16, 2024, 5:54 p.m. UTC | #1
On Tue, Dec 17, 2024 at 01:49:30AM +0900, Jeongjun Park wrote:
> If there is a case where the variable s is greater than or equal to nr_subbufs
> before entering the loop, oob read or use-after-free will occur. This problem
> occurs because the variable s is used as an index to dereference the
> struct page before the variable value range check. This logic prevents the
> wrong address value from being copied to the pages array through the subsequent
> range check, but oob read still occurs, so the code needs to be modified.

Hi Jeongjun, thanks for the patch.

Did you find a reproducer for that problem or has it just been found by code
inspection?

As discussed here [1], s >= nr_subbufs should really never happen as we already
cap nr_pages. 

[1] https://lore.kernel.org/all/78e20e98-bdfc-4d7b-a59c-988b81fcc58b@redhat.com/, 

> 
> Fixes: 117c39200d9d ("ring-buffer: Introducing ring-buffer mapping functions")
> Signed-off-by: Jeongjun Park <aha310510@gmail.com>
> ---
>  kernel/trace/ring_buffer.c | 10 +++++-----
>  1 file changed, 5 insertions(+), 5 deletions(-)
> 
> diff --git a/kernel/trace/ring_buffer.c b/kernel/trace/ring_buffer.c
> index 7e257e855dd1..83da74bf7bd6 100644
> --- a/kernel/trace/ring_buffer.c
> +++ b/kernel/trace/ring_buffer.c
> @@ -6994,9 +6994,9 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
>  {
>  	unsigned long nr_subbufs, nr_pages, nr_vma_pages, pgoff = vma->vm_pgoff;
>  	unsigned int subbuf_pages, subbuf_order;
> -	struct page **pages;
> +	struct page **pages, *page;
>  	int p = 0, s = 0;
> -	int err;
> +	int err, off;
>  
>  	/* Refuse MP_PRIVATE or writable mappings */
>  	if (vma->vm_flags & VM_WRITE || vma->vm_flags & VM_EXEC ||
> @@ -7055,14 +7055,14 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
>  	}
>  
>  	while (p < nr_pages) {
> -		struct page *page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
> -		int off = 0;
> -

I believe we can keep the struct page and off declaration within the while loop.

>  		if (WARN_ON_ONCE(s >= nr_subbufs)) {
>  			err = -EINVAL;
>  			goto out;
>  		}
>  
> +		page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
> +		off = 0;
> +
>  		for (; off < (1 << (subbuf_order)); off++, page++) {
>  			if (p >= nr_pages)
>  				break;
> --
Jeongjun Park Dec. 17, 2024, 1:28 a.m. UTC | #2
> Vincent Donnefort <vdonnefort@google.com> wrote:
> On Tue, Dec 17, 2024 at 01:49:30AM +0900, Jeongjun Park wrote:
>> If there is a case where the variable s is greater than or equal to nr_subbufs
>> before entering the loop, oob read or use-after-free will occur. This problem
>> occurs because the variable s is used as an index to dereference the
>> struct page before the variable value range check. This logic prevents the
>> wrong address value from being copied to the pages array through the subsequent
>> range check, but oob read still occurs, so the code needs to be modified.
> 
> Hi Jeongjun, thanks for the patch.
> 
> Did you find a reproducer for that problem or has it just been found by code
> inspection?
> 
> As discussed here [1], s >= nr_subbufs should really never happen as we already
> cap nr_pages.
> 
> [1] https://lore.kernel.org/all/78e20e98-bdfc-4d7b-a59c-988b81fcc58b@redhat.com/,

I didn't find the bug caused by this separately, but I found it while analyzing
the code. However, since it has been confirmed that syzbot
has a reproducer that generates oob and uaf, this will definitely be
reproduced.

The reason I suggested this patch is because I think the logic of the code
is a bit inappropriate. Normally, a range check is performed before using
a specific variable as an index of an array. Of course, in this loop, the page
structure pointer that was oob-read will not be copied to the pages array,
but I don't think it's very appropriate to read the array using a variable
value that may be out of range as an index before the range check.
Therefore, I suggest patching it like this.

> 
>> 
>> Fixes: 117c39200d9d ("ring-buffer: Introducing ring-buffer mapping functions")
>> Signed-off-by: Jeongjun Park <aha310510@gmail.com>
>> ---
>> kernel/trace/ring_buffer.c | 10 +++++-----
>> 1 file changed, 5 insertions(+), 5 deletions(-)
>> 
>> diff --git a/kernel/trace/ring_buffer.c b/kernel/trace/ring_buffer.c
>> index 7e257e855dd1..83da74bf7bd6 100644
>> --- a/kernel/trace/ring_buffer.c
>> +++ b/kernel/trace/ring_buffer.c
>> @@ -6994,9 +6994,9 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
>> {
>>    unsigned long nr_subbufs, nr_pages, nr_vma_pages, pgoff = vma->vm_pgoff;
>>    unsigned int subbuf_pages, subbuf_order;
>> -    struct page **pages;
>> +    struct page **pages, *page;
>>    int p = 0, s = 0;
>> -    int err;
>> +    int err, off;
>> 
>>    /* Refuse MP_PRIVATE or writable mappings */
>>    if (vma->vm_flags & VM_WRITE || vma->vm_flags & VM_EXEC ||
>> @@ -7055,14 +7055,14 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
>>    }
>> 
>>    while (p < nr_pages) {
>> -        struct page *page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
>> -        int off = 0;
>> -
> 
> I believe we can keep the struct page and off declaration within the while loop.

The reason I modified it this way is that, since this loop will always be 
entered if there are no other issues, these variables will be used in 
many situations, so I think it is quite inefficient to continue to declare variables 
in a loop where you don't know how many times it will be repeated. 
So, I think that declaring variables in advance and then continuously initializing 
their values ​​is advantageous in terms of performance and there are 
no other issues. What do you think?

Regards,

Jeongjun Park

> 
>>        if (WARN_ON_ONCE(s >= nr_subbufs)) {
>>            err = -EINVAL;
>>            goto out;
>>        }
>> 
>> +        page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
>> +        off = 0;
>> +
>>        for (; off < (1 << (subbuf_order)); off++, page++) {
>>            if (p >= nr_pages)
>>                break;
>> --
Vincent Donnefort Dec. 17, 2024, 8:38 a.m. UTC | #3
On Tue, Dec 17, 2024 at 10:28:49AM +0900, Jeongjun Park wrote:
> 
> 
> > Vincent Donnefort <vdonnefort@google.com> wrote:
> > On Tue, Dec 17, 2024 at 01:49:30AM +0900, Jeongjun Park wrote:
> >> If there is a case where the variable s is greater than or equal to nr_subbufs
> >> before entering the loop, oob read or use-after-free will occur. This problem
> >> occurs because the variable s is used as an index to dereference the
> >> struct page before the variable value range check. This logic prevents the
> >> wrong address value from being copied to the pages array through the subsequent
> >> range check, but oob read still occurs, so the code needs to be modified.
> > 
> > Hi Jeongjun, thanks for the patch.
> > 
> > Did you find a reproducer for that problem or has it just been found by code
> > inspection?
> > 
> > As discussed here [1], s >= nr_subbufs should really never happen as we already
> > cap nr_pages.
> > 
> > [1] https://lore.kernel.org/all/78e20e98-bdfc-4d7b-a59c-988b81fcc58b@redhat.com/,
> 
> I didn't find the bug caused by this separately, but I found it while analyzing
> the code. However, since it has been confirmed that syzbot
> has a reproducer that generates oob and uaf, this will definitely be
> reproduced.

Could you share that reproducer? Or at least the steps. As this situation should
never happen a, follow-up fix will be necessary.

> 
> The reason I suggested this patch is because I think the logic of the code
> is a bit inappropriate. Normally, a range check is performed before using
> a specific variable as an index of an array. Of course, in this loop, the page
> structure pointer that was oob-read will not be copied to the pages array,
> but I don't think it's very appropriate to read the array using a variable
> value that may be out of range as an index before the range check.
> Therefore, I suggest patching it like this.

Of course, no question about that.

> 
> > 
> >> 
> >> Fixes: 117c39200d9d ("ring-buffer: Introducing ring-buffer mapping functions")
> >> Signed-off-by: Jeongjun Park <aha310510@gmail.com>
> >> ---
> >> kernel/trace/ring_buffer.c | 10 +++++-----
> >> 1 file changed, 5 insertions(+), 5 deletions(-)
> >> 
> >> diff --git a/kernel/trace/ring_buffer.c b/kernel/trace/ring_buffer.c
> >> index 7e257e855dd1..83da74bf7bd6 100644
> >> --- a/kernel/trace/ring_buffer.c
> >> +++ b/kernel/trace/ring_buffer.c
> >> @@ -6994,9 +6994,9 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
> >> {
> >>    unsigned long nr_subbufs, nr_pages, nr_vma_pages, pgoff = vma->vm_pgoff;
> >>    unsigned int subbuf_pages, subbuf_order;
> >> -    struct page **pages;
> >> +    struct page **pages, *page;
> >>    int p = 0, s = 0;
> >> -    int err;
> >> +    int err, off;
> >> 
> >>    /* Refuse MP_PRIVATE or writable mappings */
> >>    if (vma->vm_flags & VM_WRITE || vma->vm_flags & VM_EXEC ||
> >> @@ -7055,14 +7055,14 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
> >>    }
> >> 
> >>    while (p < nr_pages) {
> >> -        struct page *page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
> >> -        int off = 0;
> >> -
> > 
> > I believe we can keep the struct page and off declaration within the while loop.
> 
> The reason I modified it this way is that, since this loop will always be 
> entered if there are no other issues, these variables will be used in 
> many situations, so I think it is quite inefficient to continue to declare variables 
> in a loop where you don't know how many times it will be repeated. 
> So, I think that declaring variables in advance and then continuously initializing 
> their values ​​is advantageous in terms of performance and there are 
> no other issues. What do you think?

I'm pretty sure the compiler would do the right thing here and no additional
step would result from declaring both variables inside the loop.

> 
> Regards,
> 
> Jeongjun Park
> 
> > 
> >>        if (WARN_ON_ONCE(s >= nr_subbufs)) {
> >>            err = -EINVAL;
> >>            goto out;
> >>        }
> >> 
> >> +        page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
> >> +        off = 0;
> >> +
> >>        for (; off < (1 << (subbuf_order)); off++, page++) {
> >>            if (p >= nr_pages)
> >>                break;
> >> --
Jeongjun Park Dec. 17, 2024, 1:42 p.m. UTC | #4
Vincent Donnefort <vdonnefort@google.com> wrote:
>
> On Tue, Dec 17, 2024 at 10:28:49AM +0900, Jeongjun Park wrote:
> >
> >
> > > Vincent Donnefort <vdonnefort@google.com> wrote:
> > > On Tue, Dec 17, 2024 at 01:49:30AM +0900, Jeongjun Park wrote:
> > >> If there is a case where the variable s is greater than or equal to nr_subbufs
> > >> before entering the loop, oob read or use-after-free will occur. This problem
> > >> occurs because the variable s is used as an index to dereference the
> > >> struct page before the variable value range check. This logic prevents the
> > >> wrong address value from being copied to the pages array through the subsequent
> > >> range check, but oob read still occurs, so the code needs to be modified.
> > >
> > > Hi Jeongjun, thanks for the patch.
> > >
> > > Did you find a reproducer for that problem or has it just been found by code
> > > inspection?
> > >
> > > As discussed here [1], s >= nr_subbufs should really never happen as we already
> > > cap nr_pages.
> > >
> > > [1] https://lore.kernel.org/all/78e20e98-bdfc-4d7b-a59c-988b81fcc58b@redhat.com/,
> >
> > I didn't find the bug caused by this separately, but I found it while analyzing
> > the code. However, since it has been confirmed that syzbot
> > has a reproducer that generates oob and uaf, this will definitely be
> > reproduced.
>
> Could you share that reproducer? Or at least the steps. As this situation should
> never happen a, follow-up fix will be necessary.

[1] When tested with a reproducer, pgoff was 8, subbuf_order was 0, and
subbuf_pages was 1. However, nr_subbufs was 3, so oob-read or uaf occurred.

[1] : https://syzkaller.appspot.com/text?tag=ReproC&x=14514730580000

>
> >
> > The reason I suggested this patch is because I think the logic of the code
> > is a bit inappropriate. Normally, a range check is performed before using
> > a specific variable as an index of an array. Of course, in this loop, the page
> > structure pointer that was oob-read will not be copied to the pages array,
> > but I don't think it's very appropriate to read the array using a variable
> > value that may be out of range as an index before the range check.
> > Therefore, I suggest patching it like this.
>
> Of course, no question about that.
>
> >
> > >
> > >>
> > >> Fixes: 117c39200d9d ("ring-buffer: Introducing ring-buffer mapping functions")
> > >> Signed-off-by: Jeongjun Park <aha310510@gmail.com>
> > >> ---
> > >> kernel/trace/ring_buffer.c | 10 +++++-----
> > >> 1 file changed, 5 insertions(+), 5 deletions(-)
> > >>
> > >> diff --git a/kernel/trace/ring_buffer.c b/kernel/trace/ring_buffer.c
> > >> index 7e257e855dd1..83da74bf7bd6 100644
> > >> --- a/kernel/trace/ring_buffer.c
> > >> +++ b/kernel/trace/ring_buffer.c
> > >> @@ -6994,9 +6994,9 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
> > >> {
> > >>    unsigned long nr_subbufs, nr_pages, nr_vma_pages, pgoff = vma->vm_pgoff;
> > >>    unsigned int subbuf_pages, subbuf_order;
> > >> -    struct page **pages;
> > >> +    struct page **pages, *page;
> > >>    int p = 0, s = 0;
> > >> -    int err;
> > >> +    int err, off;
> > >>
> > >>    /* Refuse MP_PRIVATE or writable mappings */
> > >>    if (vma->vm_flags & VM_WRITE || vma->vm_flags & VM_EXEC ||
> > >> @@ -7055,14 +7055,14 @@ static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
> > >>    }
> > >>
> > >>    while (p < nr_pages) {
> > >> -        struct page *page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
> > >> -        int off = 0;
> > >> -
> > >
> > > I believe we can keep the struct page and off declaration within the while loop.
> >
> > The reason I modified it this way is that, since this loop will always be
> > entered if there are no other issues, these variables will be used in
> > many situations, so I think it is quite inefficient to continue to declare variables
> > in a loop where you don't know how many times it will be repeated.
> > So, I think that declaring variables in advance and then continuously initializing
> > their values is advantageous in terms of performance and there are
> > no other issues. What do you think?
>
> I'm pretty sure the compiler would do the right thing here and no additional
> step would result from declaring both variables inside the loop.

Okay. In that case, I will just remove the variable declaration related patches
and send you the v2 patch right away.

Regards,

Jeongjun Park

>
> >
> > Regards,
> >
> > Jeongjun Park
> >
> > >
> > >>        if (WARN_ON_ONCE(s >= nr_subbufs)) {
> > >>            err = -EINVAL;
> > >>            goto out;
> > >>        }
> > >>
> > >> +        page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
> > >> +        off = 0;
> > >> +
> > >>        for (; off < (1 << (subbuf_order)); off++, page++) {
> > >>            if (p >= nr_pages)
> > >>                break;
> > >> --
Steven Rostedt Dec. 18, 2024, 5 p.m. UTC | #5
On Tue, 17 Dec 2024 22:42:53 +0900
Jeongjun Park <aha310510@gmail.com> wrote:

> > Could you share that reproducer? Or at least the steps. As this situation should
> > never happen a, follow-up fix will be necessary.  
> 
> [1] When tested with a reproducer, pgoff was 8, subbuf_order was 0, and
> subbuf_pages was 1. However, nr_subbufs was 3, so oob-read or uaf occurred.
> 
> [1] : https://syzkaller.appspot.com/text?tag=ReproC&x=14514730580000

This was fixed by Edwards patch.

> Okay. In that case, I will just remove the variable declaration related patches
> and send you the v2 patch right away.
> 

I'm not sure this is needed nor is it a bug.

	while (p < nr_pages) {
		struct page *page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
		int off = 0;

		if (WARN_ON_ONCE(s >= nr_subbufs)) {
			err = -EINVAL;
			goto out;
		}

The WARN_ON_ONCE() suggests that this should never happen. And I believe it shouldn't.

I'm fine if you want to make the change to:

	while (p < nr_pages) {
		struct page *page;
		int off = 0;

		if (WARN_ON_ONCE(s >= nr_subbufs)) {
			err = -EINVAL;
			goto out;
		}

		page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);

But it's not a bug fix. It's simply a cleanup that can wait till the next
merge window.

-- Steve
diff mbox series

Patch

diff --git a/kernel/trace/ring_buffer.c b/kernel/trace/ring_buffer.c
index 7e257e855dd1..83da74bf7bd6 100644
--- a/kernel/trace/ring_buffer.c
+++ b/kernel/trace/ring_buffer.c
@@ -6994,9 +6994,9 @@  static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
 {
 	unsigned long nr_subbufs, nr_pages, nr_vma_pages, pgoff = vma->vm_pgoff;
 	unsigned int subbuf_pages, subbuf_order;
-	struct page **pages;
+	struct page **pages, *page;
 	int p = 0, s = 0;
-	int err;
+	int err, off;
 
 	/* Refuse MP_PRIVATE or writable mappings */
 	if (vma->vm_flags & VM_WRITE || vma->vm_flags & VM_EXEC ||
@@ -7055,14 +7055,14 @@  static int __rb_map_vma(struct ring_buffer_per_cpu *cpu_buffer,
 	}
 
 	while (p < nr_pages) {
-		struct page *page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
-		int off = 0;
-
 		if (WARN_ON_ONCE(s >= nr_subbufs)) {
 			err = -EINVAL;
 			goto out;
 		}
 
+		page = virt_to_page((void *)cpu_buffer->subbuf_ids[s]);
+		off = 0;
+
 		for (; off < (1 << (subbuf_order)); off++, page++) {
 			if (p >= nr_pages)
 				break;