[V2] rcu: Make sure new krcp free business is handled after the wanted rcu grace period.

In kfree_rcu_monitor(), new free business at krcp is attached to any free
channel at krwp. kfree_rcu_monitor() is responsible to make sure new free
business is handled after the rcu grace period. But if there is any none-free
channel at krwp already, that means there is an on-going rcu work,
which will cause the kvfree_call_rcu()-triggered free business is done
before the wanted rcu grace period ends.

This commit ignore krwp which has non-free channel at kfree_rcu_monitor(),
to fix the issue that kvfree_call_rcu() loses effectiveness.

Below is the css_set obj "from_cset" use-after-free case caused by
kvfree_call_rcu() losing effectiveness.
CPU 0 calls rcu_read_lock(), then use "from_cset", then hard irq comes,
the task is schedule out.
CPU 1 calls kfree_rcu(cset, rcu_head), willing to free "from_cset" after new gp.
But "from_cset" is freed right after current gp end. "from_cset" is reallocated.
CPU 0 's task arrives back, references "from_cset"'s member, which causes crash.

					cgroup_attach_task()
					|cgroup_migrate()
					|cgroup_migrate_execute()
					|css_set_move_task(task, from_cset, to_cset, true)
					|cgroup_move_task(task, to_cset)
					|rcu_assign_pointer(.., to_cset)
					|...
					|cgroup_migrate_finish()
					|put_css_set_locked(from_cset)
					|from_cset->refcount return 0
					|kfree_rcu(cset, rcu_head) // means to free from_cset after new gp
					|add_ptr_to_bulk_krc_lock()
					|schedule_delayed_work(&krcp->monitor_work, ..)

					kfree_rcu_monitor()
					|krcp->bulk_head[0]'s work attached to krwp->bulk_head_free[]
					|queue_rcu_work(system_wq, &krwp->rcu_work)
					|if rwork->rcu.work is not in WORK_STRUCT_PENDING_BIT state,
					|call_rcu(&rwork->rcu, rcu_work_rcufn) <--- request a new gp

					// There is a perious call_rcu(.., rcu_work_rcufn)
					// gp end, rcu_work_rcufn() is called.
					rcu_work_rcufn()
					|__queue_work(.., rwork->wq, &rwork->work);

					|kfree_rcu_work()
					|krwp->bulk_head_free[0] bulk is freed before new gp end!!!
					|The "from_cset" is freed before new gp end.

// the task is scheduled in after many ms.
 |css_set_ptr->subsys[(subsys_id) <--- Caused kernel crash, because css_set_ptr is freed.

v2: Use helper function instead of inserted code block at kfree_rcu_monitor().

Fixes: c014efeef76a ("rcu: Add multiple in-flight batches of kfree_rcu() work")
Signed-off-by: Ziwei Dai <ziwei.dai@unisoc.com>
---
 kernel/rcu/tree.c | 27 +++++++++++++++++++--------
 1 file changed, 19 insertions(+), 8 deletions(-)

Message ID	1680266529-28429-1-git-send-email-ziwei.dai@unisoc.com (mailing list archive)
State	Accepted
Commit	e222f9a512539c3f4093a55d16624d9da614800b
Headers	show Return-Path: <rcu-owner@vger.kernel.org> From: Ziwei Dai <ziwei.dai@unisoc.com> To: <urezki@gmail.com>, <paulmck@kernel.org>, <frederic@kernel.org>, <quic_neeraju@quicinc.com>, <josh@joshtriplett.org>, <rostedt@goodmis.org>, <mathieu.desnoyers@efficios.com>, <jiangshanlai@gmail.com>, <joel@joelfernandes.org>, <rcu@vger.kernel.org> CC: <linux-kernel@vger.kernel.org>, <shuang.wang@unisoc.com>, <yifan.xin@unisoc.com>, <ke.wang@unisoc.com>, <xuewen.yan@unisoc.com>, <zhiguo.niu@unisoc.com>, <ziwei.dai@unisoc.com>, <zhaoyang.huang@unisoc.com> Subject: [PATCH V2] rcu: Make sure new krcp free business is handled after the wanted rcu grace period. Date: Fri, 31 Mar 2023 20:42:09 +0800 Message-ID: <1680266529-28429-1-git-send-email-ziwei.dai@unisoc.com> MIME-Version: 1.0 Content-Type: text/plain Precedence: bulk
Series	[V2] rcu: Make sure new krcp free business is handled after the wanted rcu grace period. \| expand [V2] rcu: Make sure new krcp free business is handled after the wanted rcu grace period.

[V2] rcu: Make sure new krcp free business is handled after the wanted rcu grace period.

Commit Message

Comments

Patch