Re: [Eas-dev] [PATCH RFC 4/8] sched/fair: use load to replace util when have big difference

28 Jun 2016

Hi Patrick,
On Tue, Jun 28, 2016 at 12:12:14PM +0100, Patrick Bellasi wrote:
...
On 23-Jun 21:43, Leo Yan wrote:
...
When load_avg is much higher than util_avg, then it indicate either the
task have higher priority for more weight value for load_avg or because
the task have much longer time for runnable state.
So for both this two case, replace util_avg value with load_avg. So use
this way to inflate utilization signal and finally let the single big
task has more chance to migrate to big CPU.
Signed-off-by: Leo Yan leo.yan@linaro.org
include/linux/sched.h |  1 +
 kernel/sched/fair.c   | 35 +++++++++++++++++++++++++++++++++++
 2 files changed, 36 insertions(+)

diff --git a/include/linux/sched.h b/include/linux/sched.h
index 644c39a..5d6bb25 100644
--- a/include/linux/sched.h
+++ b/include/linux/sched.h
@@ -1166,6 +1166,7 @@ struct load_weight {


for entity, support any load.weight always runnable



*/
 struct sched_avg {

u64 last_migrate_time;
u64 last_update_time, load_sum;
u32 util_sum, period_contrib;
unsigned long load_avg, util_avg;

diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c
index 185efe1..7fbfd41 100644
--- a/kernel/sched/fair.c
+++ b/kernel/sched/fair.c
@@ -674,6 +674,7 @@ void init_entity_runnable_average(struct sched_entity *se)
 {
   struct sched_avg *sa = &se->avg;

sa->last_migrate_time = 0;
sa->last_update_time = 0;
/*
sched_avg's period_contrib should be strictly less then 1024, so



@@ -2771,6 +2772,7 @@ static void attach_entity_load_avg(struct cfs_rq *cfs_rq, struct sched_entity *s
skip_aging:
   se->avg.last_update_time = cfs_rq->avg.last_update_time;

se->avg.last_migrate_time = cfs_rq->avg.last_update_time;
cfs_rq->avg.load_avg += se->avg.load_avg;
cfs_rq->avg.load_sum += se->avg.load_sum;

@@ -5228,6 +5230,11 @@ static inline unsigned long task_util(struct task_struct *p)
   return p->se.avg.util_avg;
 }
+static inline unsigned long task_load(struct task_struct *p)
+{

return p->se.avg.load_avg;

+}



unsigned int capacity_margin = 1280; /* ~20% margin */
static inline unsigned long boosted_task_util(struct task_struct *task);
@@ -5369,8 +5376,35 @@ static inline unsigned long
 boosted_task_util(struct task_struct *task)
 {
   unsigned long util = task_util(task);

unsigned long load = task_load(task);

Should not be:
unsigned long load = clamp(task_load(task), 0, SCHED_LOAD_SCALE);
Should we do this after finish below conditions checking so we can
keep original value?
...
...
unsigned long margin = schedtune_task_margin(task);
Again, as a general comment, should we enabled these mods only for
boosted tasks? In this function we can use the condition "margin != 0"
to idenitfy when it's worth to enable the switch from utilization to
load... thus increasing chances to migrate on big CPUs only for
boosted tasks.
This patch is not purpose for boosted task but for there have a single
big task with sufficient runnable time on rq but cannot migrate
to big core. If the task has been specified boost margin from CGroup
or sysfs, then I refer to directly use the specified value.
How about you think for this?
...
...

int cpu = task_cpu(task);
struct sched_entity *se = &task->se;
u64 delta;

/*
* change to use load metrics if can meet two conditions:


* - load is 20% higher than util, so that means task have extra


*   20% time for runnable state and waiting to run; Or the task has


*   higher prioirty than nice 0; then consider to use load signal


*   rather than util signal;


* - load reach CPU "over-utilized" criteria.



Maybe add that the 20% margin is defined by the value:
         capacity_margin / SCHED_LOAD_SCALE

Yes, this is more consistent. Will fix.
...
...

*/


if ((load * capacity_margin > capacity_of(cpu) * 1024) &&
   (load * 1024 > util * capacity_margin))



Just for readability and to match the previous comment:
if ((util * capacity_margin < load * 1024) &&
            (load * capacity_margin > capacity_of(cpu) * 1024))
Will fix.
...
...

util = load;


else {

/*


 * Avoid ping-pong issue, so make sure the task can run at


 * least once in higher capacity CPU


 */


delta = se->avg.last_update_time - se->avg.last_migrate_time;


if (delta < sysctl_sched_latency &&


    capacity_of(cpu) == cpu_rq(cpu)->rd->max_cpu_capacity.val)


	util = load;


}

trace_sched_boost_task(task, util, margin);
return util + margin;


@@ -9100,6 +9134,7 @@ static void task_move_group_fair(struct task_struct *p)
 #ifdef CONFIG_SMP
   /* Tell se's cfs_rq has been changed -- migrated */
   p->se.avg.last_update_time = 0;

p->se.avg.last_migrate_time = 0;

#endif
   attach_task_cfs_rq(p);
 }
--
1.9.1
--
#include <best/regards.h>
Patrick Bellasi

    

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

Re: [Eas-dev] [PATCH RFC 4/8] sched/fair: use load to replace util when have big difference

Signed-off-by: Leo Yan leo.yan@linaro.org