sched/fair: Fix update_cfs_group() vs DELAY_DEQUEUE

Normally dequeue_entities() will continue to dequeue an empty group entity;
except DELAY_DEQUEUE changes things -- it retains empty entities such that they
might continue to compete and burn off some lag.

However, doing this results in update_cfs_group() re-computing the cgroup
weight 'slice' for an empty group, which it (rightly) figures isn't much at
all. This in turn means that the delayed entity is not competing at the
expected weight. Worse, the very low weight causes its lag to be inflated,
which combined with avg_vruntime() using scale_load_down(), leads to artifacts.

As such, don't adjust the weight for empty group entities and let them compete
at their original weight.

Fixes: 152e11f6df29 ("sched/fair: Implement delayed dequeue")
Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org>
Link: https://lkml.kernel.org/r/20250110115720.GA17405@noisy.programming.kicks-ass.net

Changed files
+5 -1
kernel
sched
+5 -1
kernel/sched/fair.c
··· 3956 3956 struct cfs_rq *gcfs_rq = group_cfs_rq(se); 3957 3957 long shares; 3958 3958 3959 - if (!gcfs_rq) 3959 + /* 3960 + * When a group becomes empty, preserve its weight. This matters for 3961 + * DELAY_DEQUEUE. 3962 + */ 3963 + if (!gcfs_rq || !gcfs_rq->load.weight) 3960 3964 return; 3961 3965 3962 3966 if (throttled_hierarchy(gcfs_rq))