diff mbox series

riscv: stacktrace: fix dump_backtrace/walk_stackframe with NULL task

Message ID 20210627092659.46193-1-changbin.du@gmail.com (mailing list archive)
State New, archived
Headers show
Series riscv: stacktrace: fix dump_backtrace/walk_stackframe with NULL task | expand

Commit Message

Changbin Du June 27, 2021, 9:26 a.m. UTC
Some places try to show backtrace with NULL task, and expect the task is
'current'. For example, dump_stack()->show_stack(NULL,...). So the
stacktrace code should take care of this case.

Here is an oops caused by this issue when accessing the NULL task.

[   15.180813] Kernel panic - not syncing: No working init found.  Try passing init= option to kernel. See Linux Documentation/admin-guide/init.rst for guidance.
[   15.182382] CPU: 3 PID: 1 Comm: swapper/0 Not tainted 5.13.0-rc7-00111-g625acffd7ae2-dirty #18
[   15.183431] Hardware name: riscv-virtio,qemu (DT)
[   15.184253] Call Trace:
[   15.223617] Unable to handle kernel paging request at virtual address 0000000000001590
[   15.267378] Oops [#1]
[   15.268215] Modules linked in:
[   15.272027] CPU: 3 PID: 1 Comm: swapper/0 Not tainted 5.13.0-rc7-00111-g625acffd7ae2-dirty #18
[   15.273997] Hardware name: riscv-virtio,qemu (DT)
[   15.275134] epc : walk_stackframe+0xc4/0xdc
[   15.280146]  ra : dump_backtrace+0x30/0x38
[   15.280799] epc : ffffffff8000597e ra : ffffffff800059c6 sp : ffffffe002383d60
[   15.281622]  gp : ffffffff8179ad18 tp : ffffffe002378000 t0 : ffffffff81bc1a3f
[   15.282574]  t1 : 0000000000000001 t2 : 0000000000000000 s0 : ffffffe002383dc0
[   15.283782]  s1 : ffffffff812b7d18 a0 : 0000000000001000 a1 : 0000000000000000
[   15.285115]  a2 : ffffffff807ec668 a3 : ffffffff812b7d18 a4 : c76c00cabf08b500
[   15.286213]  a5 : 0000000000001000 a6 : 000000001a9ef260 a7 : 0000000000000000
[   15.287317]  s2 : 0000000000000000 s3 : 0000000000000000 s4 : 0000000000000000
[   15.288323]  s5 : ffffffff807ec668 s6 : ffffffff812b7d18 s7 : 0000000000000000
[   15.289530]  s8 : 0000000000000000 s9 : 0000000000000000 s10: 0000000000000000
[   15.290995]  s11: 0000000000000000 t3 : 0000000000000001 t4 : 0000000000000000
[   15.292465]  t5 : 206f74206e6f6974 t6 : ffffffe002383b28
[   15.293859] status: 0000000000000100 badaddr: 0000000000001590 cause: 000000000000000d
[   15.296035] [<ffffffff8000597e>] walk_stackframe+0xc4/0xdc
[   15.297342] [<ffffffff800059c6>] dump_backtrace+0x30/0x38
[   15.298333] [<ffffffff807ec6e0>] show_stack+0x40/0x4c
[   15.299765] [<ffffffff807f07ac>] dump_stack+0x7c/0x96
[   15.300553] [<ffffffff807ec8be>] panic+0x118/0x300
[   15.301147] [<ffffffff807f61e8>] kernel_init+0x12c/0x138
[   15.302056] [<ffffffff80003a22>] ret_from_exception+0x0/0xc
[   15.338628] ---[ end trace 0a3fa0cc7f3393cd ]---
[   15.339919] note: swapper/0[1] exited with preempt_count 1
[   15.341995] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b
[   15.343889] SMP: stopping secondary CPUs
[   16.802836] SMP: failed to stop secondary CPUs 0-3
[   16.806264] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b ]---

This patch fixes it by setting the task to current if it's NULL before
accessing it.

Signed-off-by: Changbin Du <changbin.du@gmail.com>
Fixes: 5d8544e2d0 ("RISC-V: Generic library routines and assembly")
---
 arch/riscv/kernel/stacktrace.c | 6 ++++++
 1 file changed, 6 insertions(+)

Comments

Jisheng Zhang June 28, 2021, 5:44 a.m. UTC | #1
On Sun, 27 Jun 2021 17:26:59 +0800
Changbin Du <changbin.du@gmail.com> wrote:


> 
> 
> Some places try to show backtrace with NULL task, and expect the task is
> 'current'. For example, dump_stack()->show_stack(NULL,...). So the
> stacktrace code should take care of this case.

I fixed this issue one week ago:

http://lists.infradead.org/pipermail/linux-riscv/2021-June/007258.html

> 
> Here is an oops caused by this issue when accessing the NULL task.
> 
> [   15.180813] Kernel panic - not syncing: No working init found.  Try passing init= option to kernel. See Linux Documentation/admin-guide/init.rst for guidance.
> [   15.182382] CPU: 3 PID: 1 Comm: swapper/0 Not tainted 5.13.0-rc7-00111-g625acffd7ae2-dirty #18
> [   15.183431] Hardware name: riscv-virtio,qemu (DT)
> [   15.184253] Call Trace:
> [   15.223617] Unable to handle kernel paging request at virtual address 0000000000001590
> [   15.267378] Oops [#1]
> [   15.268215] Modules linked in:
> [   15.272027] CPU: 3 PID: 1 Comm: swapper/0 Not tainted 5.13.0-rc7-00111-g625acffd7ae2-dirty #18
> [   15.273997] Hardware name: riscv-virtio,qemu (DT)
> [   15.275134] epc : walk_stackframe+0xc4/0xdc
> [   15.280146]  ra : dump_backtrace+0x30/0x38
> [   15.280799] epc : ffffffff8000597e ra : ffffffff800059c6 sp : ffffffe002383d60
> [   15.281622]  gp : ffffffff8179ad18 tp : ffffffe002378000 t0 : ffffffff81bc1a3f
> [   15.282574]  t1 : 0000000000000001 t2 : 0000000000000000 s0 : ffffffe002383dc0
> [   15.283782]  s1 : ffffffff812b7d18 a0 : 0000000000001000 a1 : 0000000000000000
> [   15.285115]  a2 : ffffffff807ec668 a3 : ffffffff812b7d18 a4 : c76c00cabf08b500
> [   15.286213]  a5 : 0000000000001000 a6 : 000000001a9ef260 a7 : 0000000000000000
> [   15.287317]  s2 : 0000000000000000 s3 : 0000000000000000 s4 : 0000000000000000
> [   15.288323]  s5 : ffffffff807ec668 s6 : ffffffff812b7d18 s7 : 0000000000000000
> [   15.289530]  s8 : 0000000000000000 s9 : 0000000000000000 s10: 0000000000000000
> [   15.290995]  s11: 0000000000000000 t3 : 0000000000000001 t4 : 0000000000000000
> [   15.292465]  t5 : 206f74206e6f6974 t6 : ffffffe002383b28
> [   15.293859] status: 0000000000000100 badaddr: 0000000000001590 cause: 000000000000000d
> [   15.296035] [<ffffffff8000597e>] walk_stackframe+0xc4/0xdc
> [   15.297342] [<ffffffff800059c6>] dump_backtrace+0x30/0x38
> [   15.298333] [<ffffffff807ec6e0>] show_stack+0x40/0x4c
> [   15.299765] [<ffffffff807f07ac>] dump_stack+0x7c/0x96
> [   15.300553] [<ffffffff807ec8be>] panic+0x118/0x300
> [   15.301147] [<ffffffff807f61e8>] kernel_init+0x12c/0x138
> [   15.302056] [<ffffffff80003a22>] ret_from_exception+0x0/0xc
> [   15.338628] ---[ end trace 0a3fa0cc7f3393cd ]---
> [   15.339919] note: swapper/0[1] exited with preempt_count 1
> [   15.341995] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b
> [   15.343889] SMP: stopping secondary CPUs
> [   16.802836] SMP: failed to stop secondary CPUs 0-3
> [   16.806264] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b ]---
> 
> This patch fixes it by setting the task to current if it's NULL before
> accessing it.
> 
> Signed-off-by: Changbin Du <changbin.du@gmail.com>
> Fixes: 5d8544e2d0 ("RISC-V: Generic library routines and assembly")

Hmm, this fixes tag should be
Fixes: eac2f3059e02 ("riscv: stacktrace: fix the riscv stacktrace when CONFIG_FRAME_POINTER enabled"


> ---
>  arch/riscv/kernel/stacktrace.c | 6 ++++++
>  1 file changed, 6 insertions(+)
> 
> diff --git a/arch/riscv/kernel/stacktrace.c b/arch/riscv/kernel/stacktrace.c
> index bde85fc53357..788b65eba965 100644
> --- a/arch/riscv/kernel/stacktrace.c
> +++ b/arch/riscv/kernel/stacktrace.c
> @@ -23,6 +23,9 @@ void notrace walk_stackframe(struct task_struct *task, struct pt_regs *regs,
>  {
>         unsigned long fp, sp, pc;
> 
> +       if (!task)
> +               task = current;
> +
>         if (regs) {
>                 fp = frame_pointer(regs);
>                 sp = user_stack_pointer(regs);
> @@ -73,6 +76,9 @@ void notrace walk_stackframe(struct task_struct *task,
>         unsigned long sp, pc;
>         unsigned long *ksp;
> 
> +       if (!task)
> +               task = current;
> +
>         if (regs) {
>                 sp = user_stack_pointer(regs);
>                 pc = instruction_pointer(regs);
> --
> 2.30.2
> 
> 
> _______________________________________________
> linux-riscv mailing list
> linux-riscv@lists.infradead.org
Changbin Du June 28, 2021, 11:20 p.m. UTC | #2
On Mon, Jun 28, 2021 at 01:44:04PM +0800, Jisheng Zhang wrote:
> On Sun, 27 Jun 2021 17:26:59 +0800
> Changbin Du <changbin.du@gmail.com> wrote:
> 
> 
> > 
> > 
> > Some places try to show backtrace with NULL task, and expect the task is
> > 'current'. For example, dump_stack()->show_stack(NULL,...). So the
> > stacktrace code should take care of this case.
> 
> I fixed this issue one week ago:
> 
> http://lists.infradead.org/pipermail/linux-riscv/2021-June/007258.html
> 
Good to know. Thanks!

> > 
> > Here is an oops caused by this issue when accessing the NULL task.
> > 
> > [   15.180813] Kernel panic - not syncing: No working init found.  Try passing init= option to kernel. See Linux Documentation/admin-guide/init.rst for guidance.
> > [   15.182382] CPU: 3 PID: 1 Comm: swapper/0 Not tainted 5.13.0-rc7-00111-g625acffd7ae2-dirty #18
> > [   15.183431] Hardware name: riscv-virtio,qemu (DT)
> > [   15.184253] Call Trace:
> > [   15.223617] Unable to handle kernel paging request at virtual address 0000000000001590
> > [   15.267378] Oops [#1]
> > [   15.268215] Modules linked in:
> > [   15.272027] CPU: 3 PID: 1 Comm: swapper/0 Not tainted 5.13.0-rc7-00111-g625acffd7ae2-dirty #18
> > [   15.273997] Hardware name: riscv-virtio,qemu (DT)
> > [   15.275134] epc : walk_stackframe+0xc4/0xdc
> > [   15.280146]  ra : dump_backtrace+0x30/0x38
> > [   15.280799] epc : ffffffff8000597e ra : ffffffff800059c6 sp : ffffffe002383d60
> > [   15.281622]  gp : ffffffff8179ad18 tp : ffffffe002378000 t0 : ffffffff81bc1a3f
> > [   15.282574]  t1 : 0000000000000001 t2 : 0000000000000000 s0 : ffffffe002383dc0
> > [   15.283782]  s1 : ffffffff812b7d18 a0 : 0000000000001000 a1 : 0000000000000000
> > [   15.285115]  a2 : ffffffff807ec668 a3 : ffffffff812b7d18 a4 : c76c00cabf08b500
> > [   15.286213]  a5 : 0000000000001000 a6 : 000000001a9ef260 a7 : 0000000000000000
> > [   15.287317]  s2 : 0000000000000000 s3 : 0000000000000000 s4 : 0000000000000000
> > [   15.288323]  s5 : ffffffff807ec668 s6 : ffffffff812b7d18 s7 : 0000000000000000
> > [   15.289530]  s8 : 0000000000000000 s9 : 0000000000000000 s10: 0000000000000000
> > [   15.290995]  s11: 0000000000000000 t3 : 0000000000000001 t4 : 0000000000000000
> > [   15.292465]  t5 : 206f74206e6f6974 t6 : ffffffe002383b28
> > [   15.293859] status: 0000000000000100 badaddr: 0000000000001590 cause: 000000000000000d
> > [   15.296035] [<ffffffff8000597e>] walk_stackframe+0xc4/0xdc
> > [   15.297342] [<ffffffff800059c6>] dump_backtrace+0x30/0x38
> > [   15.298333] [<ffffffff807ec6e0>] show_stack+0x40/0x4c
> > [   15.299765] [<ffffffff807f07ac>] dump_stack+0x7c/0x96
> > [   15.300553] [<ffffffff807ec8be>] panic+0x118/0x300
> > [   15.301147] [<ffffffff807f61e8>] kernel_init+0x12c/0x138
> > [   15.302056] [<ffffffff80003a22>] ret_from_exception+0x0/0xc
> > [   15.338628] ---[ end trace 0a3fa0cc7f3393cd ]---
> > [   15.339919] note: swapper/0[1] exited with preempt_count 1
> > [   15.341995] Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b
> > [   15.343889] SMP: stopping secondary CPUs
> > [   16.802836] SMP: failed to stop secondary CPUs 0-3
> > [   16.806264] ---[ end Kernel panic - not syncing: Attempted to kill init! exitcode=0x0000000b ]---
> > 
> > This patch fixes it by setting the task to current if it's NULL before
> > accessing it.
> > 
> > Signed-off-by: Changbin Du <changbin.du@gmail.com>
> > Fixes: 5d8544e2d0 ("RISC-V: Generic library routines and assembly")
> 
> Hmm, this fixes tag should be
> Fixes: eac2f3059e02 ("riscv: stacktrace: fix the riscv stacktrace when CONFIG_FRAME_POINTER enabled"
> 
> 
> > ---
> >  arch/riscv/kernel/stacktrace.c | 6 ++++++
> >  1 file changed, 6 insertions(+)
> > 
> > diff --git a/arch/riscv/kernel/stacktrace.c b/arch/riscv/kernel/stacktrace.c
> > index bde85fc53357..788b65eba965 100644
> > --- a/arch/riscv/kernel/stacktrace.c
> > +++ b/arch/riscv/kernel/stacktrace.c
> > @@ -23,6 +23,9 @@ void notrace walk_stackframe(struct task_struct *task, struct pt_regs *regs,
> >  {
> >         unsigned long fp, sp, pc;
> > 
> > +       if (!task)
> > +               task = current;
> > +
> >         if (regs) {
> >                 fp = frame_pointer(regs);
> >                 sp = user_stack_pointer(regs);
> > @@ -73,6 +76,9 @@ void notrace walk_stackframe(struct task_struct *task,
> >         unsigned long sp, pc;
> >         unsigned long *ksp;
> > 
> > +       if (!task)
> > +               task = current;
> > +
> >         if (regs) {
> >                 sp = user_stack_pointer(regs);
> >                 pc = instruction_pointer(regs);
> > --
> > 2.30.2
> > 
> > 
> > _______________________________________________
> > linux-riscv mailing list
> > linux-riscv@lists.infradead.org
>
Changbin Du July 27, 2021, 10:16 p.m. UTC | #3
On Mon, Jun 28, 2021 at 01:44:04PM +0800, Jisheng Zhang wrote:
> On Sun, 27 Jun 2021 17:26:59 +0800
> Changbin Du <changbin.du@gmail.com> wrote:
> 
> 
> > 
> > 
> > Some places try to show backtrace with NULL task, and expect the task is
> > 'current'. For example, dump_stack()->show_stack(NULL,...). So the
> > stacktrace code should take care of this case.
> 
> I fixed this issue one week ago:
> 
> http://lists.infradead.org/pipermail/linux-riscv/2021-June/007258.html

I still see this issue on mainline. Is your fix merged? Thanks!
Jisheng Zhang July 28, 2021, 1:39 p.m. UTC | #4
On Wed, 28 Jul 2021 06:16:56 +0800
Changbin Du <changbin.du@gmail.com> wrote:

> On Mon, Jun 28, 2021 at 01:44:04PM +0800, Jisheng Zhang wrote:
> > On Sun, 27 Jun 2021 17:26:59 +0800
> > Changbin Du <changbin.du@gmail.com> wrote:
> > 
> >   
> > > 
> > > 
> > > Some places try to show backtrace with NULL task, and expect the task is
> > > 'current'. For example, dump_stack()->show_stack(NULL,...). So the
> > > stacktrace code should take care of this case.  
> > 
> > I fixed this issue one week ago:
> > 
> > http://lists.infradead.org/pipermail/linux-riscv/2021-June/007258.html  
> 
> I still see this issue on mainline. Is your fix merged? Thanks!

Nope, the fix is missed twice. Palmer has added the fix patch into
fix branch, I help it will be in next rc

Regards
diff mbox series

Patch

diff --git a/arch/riscv/kernel/stacktrace.c b/arch/riscv/kernel/stacktrace.c
index bde85fc53357..788b65eba965 100644
--- a/arch/riscv/kernel/stacktrace.c
+++ b/arch/riscv/kernel/stacktrace.c
@@ -23,6 +23,9 @@  void notrace walk_stackframe(struct task_struct *task, struct pt_regs *regs,
 {
 	unsigned long fp, sp, pc;
 
+	if (!task)
+		task = current;
+
 	if (regs) {
 		fp = frame_pointer(regs);
 		sp = user_stack_pointer(regs);
@@ -73,6 +76,9 @@  void notrace walk_stackframe(struct task_struct *task,
 	unsigned long sp, pc;
 	unsigned long *ksp;
 
+	if (!task)
+		task = current;
+
 	if (regs) {
 		sp = user_stack_pointer(regs);
 		pc = instruction_pointer(regs);