sched/fair: Fix fault in reweight_entity

Syzbot found a GPF in reweight_entity. This has been bisected to
commit 4ef0c5c6b5ba ("kernel/sched: Fix sched_fork() access an invalid
sched_task_group")

There is a race between sched_post_fork() and setpriority(PRIO_PGRP)
within a thread group that causes a null-ptr-deref in
reweight_entity() in CFS. The scenario is that the main process spawns
number of new threads, which then call setpriority(PRIO_PGRP, 0, -20),
wait, and exit. For each of the new threads the copy_process() gets
invoked, which adds the new task_struct and calls sched_post_fork()
for it.

In the above scenario there is a possibility that
setpriority(PRIO_PGRP) and set_one_prio() will be called for a thread
in the group that is just being created by copy_process(), and for
which the sched_post_fork() has not been executed yet. This will
trigger a null pointer dereference in reweight_entity(), as it will
try to access the run queue pointer, which hasn't been set.

Before the mentioned change the cfs_rq pointer for the task has been
set in sched_fork(), which is called much earlier in copy_process(),
before the new task is added to the thread_group. Now it is done in
the sched_post_fork(), which is called after that. To fix the issue
the remove the update_load param from the update_load param() function
and call reweight_task() only if the task flag doesn't have the
TASK_NEW flag set.

Fixes: 4ef0c5c6b5ba ("kernel/sched: Fix sched_fork() access an invalid sched_task_group")
Reported-by: syzbot+af7a719bc92395ee41b3@syzkaller.appspotmail.com
Signed-off-by: Tadeusz Struk <tadeusz.struk@linaro.org>
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Reviewed-by: Dietmar Eggemann <dietmar.eggemann@arm.com>
Cc: stable@vger.kernel.org
Link: https://lkml.kernel.org/r/20220203161846.1160750-1-tadeusz.struk@linaro.org

authored by Tadeusz Struk and committed by Peter Zijlstra 13765de8 26291c54

Changed files
+6 -5
kernel
sched
+6 -5
kernel/sched/core.c
··· 1214 1214 } 1215 1215 #endif 1216 1216 1217 - static void set_load_weight(struct task_struct *p, bool update_load) 1217 + static void set_load_weight(struct task_struct *p) 1218 1218 { 1219 + bool update_load = !(READ_ONCE(p->__state) & TASK_NEW); 1219 1220 int prio = p->static_prio - MAX_RT_PRIO; 1220 1221 struct load_weight *load = &p->se.load; 1221 1222 ··· 4407 4406 p->static_prio = NICE_TO_PRIO(0); 4408 4407 4409 4408 p->prio = p->normal_prio = p->static_prio; 4410 - set_load_weight(p, false); 4409 + set_load_weight(p); 4411 4410 4412 4411 /* 4413 4412 * We don't need the reset flag anymore after the fork. It has ··· 6922 6921 put_prev_task(rq, p); 6923 6922 6924 6923 p->static_prio = NICE_TO_PRIO(nice); 6925 - set_load_weight(p, true); 6924 + set_load_weight(p); 6926 6925 old_prio = p->prio; 6927 6926 p->prio = effective_prio(p); 6928 6927 ··· 7213 7212 */ 7214 7213 p->rt_priority = attr->sched_priority; 7215 7214 p->normal_prio = normal_prio(p); 7216 - set_load_weight(p, true); 7215 + set_load_weight(p); 7217 7216 } 7218 7217 7219 7218 /* ··· 9446 9445 #endif 9447 9446 } 9448 9447 9449 - set_load_weight(&init_task, false); 9448 + set_load_weight(&init_task); 9450 9449 9451 9450 /* 9452 9451 * The boot idle thread does lazy MMU switching as well: