cpufreq_sched: reset capacity in pick_next_task_idle? - eas-dev

11 Nov 2015

Hello,
I'm using the EASv5 3.18 tree with cpufreq_sched. With the sched
governor enabled I've noticed that after a migration or after a switch
from a non-fair task to the idle task, the source CPU goes idle and its
(possibly max) capacity request stays in place, preventing other
requests from going through until that source CPU decides to wake up and
take up some work. I know that there are some ongoing discussions about
how to actually enforce a frequency reduction when a CPU enters idle to
save power, but this seems to be a more immediate problem since the
other CPU(s)' requests are also basically ignored. How about a
reset_capacity call in pick_next_task_idle? Throttling is a concern I
suppose, but I think the check in dequeue_task_fair is doing the same
thing already, so the following would just repeat for
non_fair_class->idle_task.

diff --git a/kernel/sched/idle_task.c b/kernel/sched/idle_task.c
index c65dac8..555c21d 100644
--- a/kernel/sched/idle_task.c
+++ b/kernel/sched/idle_task.c
@@ -28,6 +28,8 @@ pick_next_task_idle(struct rq *rq, struct task_struct
*prev)
  {
         put_prev_task(rq, prev);
+       cpufreq_sched_reset_cap(cpu_of(rq));
+
         schedstat_inc(rq, sched_goidle);
         return rq->idle;
  }
Thanks,
Vikram