On 8/26/21 1:35 PM, Tejun Heo wrote:
Hello, Waiman.
Let's stop iterating on the patchset until we reach a consensus.
On Wed, Aug 25, 2021 at 05:37:49PM -0400, Waiman Long wrote:
- The "cpuset.cpus" is not empty and the list of CPUs are exclusive, i.e. they are not shared by any of its siblings.
Part of it can be reached by cpus going offline.
- The parent cgroup is a partition root.
This condition can happen if a parent stop being a partition.
- The "cpuset.cpus" is also a proper subset of the parent's
"cpuset.cpus.effective".
- The "cpuset.cpus" is a subset of the parent's
This can happen if cpus go offline.
- There is no child cgroups with cpuset enabled. This is for eliminating corner cases that have to be handled if such a condition is allowed.
This may make sense as a short cut for us but doesn't really stem from interface or behavior requirements.
Of the four conditions listed, two are bogus (the states can be reached through a different path and the configuration success or failure can be timing dependent if configuration racaes against cpu hotplug operations) and one maybe makes sense half-way and one is more of a shortcut.
Can't we just replace these with transitions to invalid state with proper explanation? That'd get rid of the error handling duplications from both the kernel and user side, make automated configurations which may race against hot plug operations reliable, and consistently provide users with why something failed.
What I am doing here is setting a high bar for transitioning from member to either "root" or "isolated". Once it becomes a partition, there are multiple ways that can make it invalid. I am fine with that. However, I am not sure it is a good idea to allow users to echo "root" to cpuset.cpus.partition anywhere in the cgroup hierarchy and require them to read it back to see if it succeed.
All the checking are done with cpuset_rwsem held. So there shouldn't be any racing. Of course, a hotplug can immediately follow and make the partition invalid.
Cheers, Longman