回复: [PATCH] io-wq: Fix UAF when wakeup wqe in hash waitqueue

From: Zhang, Qiang
Date: Mon May 24 2021 - 05:19:30 EST




________________________________________
发件人: Hillf Danton <hdanton@xxxxxxxx>
发送时间: 2021年5月24日 16:25
收件人: Zhang, Qiang
抄送: axboe@xxxxxxxxx; asml.silence@xxxxxxxxx; syzbot+6cb11ade52aa17095297@xxxxxxxxxxxxxxxxxxxxxxxxx; io-uring@xxxxxxxxxxxxxxx; linux-kernel@xxxxxxxxxxxxxxx
主题: Re: [PATCH] io-wq: Fix UAF when wakeup wqe in hash waitqueue

[Please note: This e-mail is from an EXTERNAL e-mail address]

On Mon, 24 May 2021 15:18:44 +0800
> From: Zqiang <qiang.zhang@xxxxxxxxxxxxx>
>
> The syzbot report a UAF when iou-wrk accessing wqe of the hash
> waitqueue. in the case of sharing a hash waitqueue between two
> io-wq, when one of the io-wq is destroyed, all iou-wrk in this
> io-wq are awakened, all wqe belonging to this io-wq are removed
> from hash waitqueue, after that, all iou-wrk belonging to this
> io-wq begin running, suppose following scenarios, wqe[0] and wqe[1]
> belong to this io-wq, and these work has same hash value.
>
> CPU0 CPU1
> iou-wrk0(wqe[0]) iou-wrk1(wqe[1])
>
> while test_bit IO_WQ_BIT_EXIT while test_bit IO_WQ_BIT_EXIT
> io_worker_handle_work
> schedule_timeout (sleep be break by wakeup io_get_next_work
> and the IO_WQ_BIT_EXIT be set) set_bit hash
>
> test_bit IO_WQ_BIT_EXIT (return true)
> wqe->work_list (is not empty)
> io_get_next_work
> io_wq_is_hashed
> test_and_set_bit hash (is true) (hash!=-1U&&!next_hashed) true
> (there is no work other than hash work)
> io_wait_on_hash clear_bit hash
> spin_lock wq_has_sleeper (is false)
> list_empty(&wqe->wait.entry) (is true)
> __add_wait_queue (hash->wait is empty,not wakeup
> and IO_WQ_BIT_EXIT has been set,
> ........ the wqe->work_list is empty exit
> (there is no work other than hash work while loop)
> io_get_next_work will return NULL)
> return NULL (the wqe->work_list is empty
> the io_worker_handle_work is not
> called)
> io_worker_exit io_worker_exit
>
> In the above scenario, wqe may be mistakenly removing
> opportunities from the queue, this leads to when the wqe is
> released, it still in hash waitqueue. when a iou-wrk belonging
> to another io-wq access hash waitqueue will trigger UAF,
> To avoid this phenomenon, after all iou-wrk thread belonging to the
> io-wq exit, remove wqe from the hash waiqueue, at this time,
> there will be no operation to queue the wqe.
>
> Reported-by: syzbot+6cb11ade52aa17095297@xxxxxxxxxxxxxxxxxxxxxxxxx
> Signed-off-by: Zqiang <qiang.zhang@xxxxxxxxxxxxx>
> ---
> fs/io-wq.c | 9 ++++++---
> 1 file changed, 6 insertions(+), 3 deletions(-)
>
> diff --git a/fs/io-wq.c b/fs/io-wq.c
> index 5361a9b4b47b..911a1274aabd 100644
> --- a/fs/io-wq.c
> +++ b/fs/io-wq.c
> @@ -1003,13 +1003,16 @@ static void io_wq_exit_workers(struct io_wq *wq)
> struct io_wqe *wqe = wq->wqes[node];
>
> io_wq_for_each_worker(wqe, io_wq_worker_wake, NULL);
> - spin_lock_irq(&wq->hash->wait.lock);
> - list_del_init(&wq->wqes[node]->wait.entry);
> - spin_unlock_irq(&wq->hash->wait.lock);
> }
> rcu_read_unlock();
> io_worker_ref_put(wq);
> wait_for_completion(&wq->worker_done);
> +
> + for_each_node(node) {
> + spin_lock_irq(&wq->hash->wait.lock);
> + list_del_init(&wq->wqes[node]->wait.entry);
> + spin_unlock_irq(&wq->hash->wait.lock);
> + }
> put_task_struct(wq->task);
> wq->task = NULL;
> }
> --
> 2.17.1

>Scratch scalp one inch off to work out how this is a cure given a) uaf makes
>no sense without free and b) how io workers could survive
>wait_for_completion(&wq->worker_done).
>
>If they could OTOH then this is not the pill for the leak in worker_refs.

Hello Pavel Begunkov, Hillf Danton

Sorry there is a problem with the calltrace described in my message. Please ignore this modification

Thanks
Qiang