On Tue, Jan 07, 2014 at 10:39:39AM +0000, Preeti U Murthy wrote:
On 01/07/2014 03:20 PM, Peter Zijlstra wrote:
On Tue, Jan 07, 2014 at 03:10:21PM +0530, Preeti U Murthy wrote:
What if we want to add arch specific flags to the NUMA domain? Currently with Peter's patch:https://lkml.org/lkml/2013/11/5/239 and this patch, the arch can modify the sd flags of the topology levels till just before the NUMA domain. In sd_init_numa(), the flags for the NUMA domain get initialized. We need to perhaps call into arch here to probe for additional flags?
What are you thinking of? I was hoping all NUMA details were captured in the distance table.
Its far easier to talk of specifics in this case.
If the processor can be core gated, then there is very little power savings that we could yield from consolidating all the load onto a single node in a NUMA domain. 6 cores on one node or 3 cores each on two nodes, the power is drawn by 6 cores in all.
Not being a NUMA expert, I would have thought that load consolidation at node level would nearly always save power even when cpus can be power gated individually. The number of cpus awake is the same, but you only need to power the caches, memory, and other node peripherals for one node instead of two in your example. Wouldn't that save power?
Memory/cache intensive workloads might benefit from spreading at node level though.
Am I missing something?
Morten