On Mon, Apr 20, 2020 at 12:45:36PM +0100, Edward Cree wrote:
On 16/04/2020 19:49, Sasha Levin wrote:
Just a question while I process your explanation (thanks for doing it!): wouldn't this be done by the neural network?
Yes, in the basic case. (Hopefully we're agreed that this is a long way from "I'm not sure what a fixes tag has to do with inclusion in a stable tree.", which is how this whole brouhaha started.)
My point was more that having or not having a fixes tag on it's own doesn't guarantee inclusion in the stable trees - that's why we have and explicit stable tag. What was said was (for me) the equivalent of "my commit message contains the word 'panic', why wasn't it picked?"
A Fixes tag affects the probability of a commit being picked up by AUTOSEL, yes, but it's not a reliable way to include or exclude patches from the stable tree.
It may also sound counter-intuitive but my long term plan (hope) is for AUTOSEL to die because maintainers got better at tagging patches. I don't want to keep doing this forever :)
It learns what a stable worthy commit is (and what isn't), and applies weights based on these findings, right? So if it learns that most non-stable commits don't have a fixes tag, it's likely to use that and "require" other inputs to have enough weight to compensate over a missing fixes tag so that it'll pass the threshold, no?
Yes. The problem comes when there are other inputs the NN doesn't have, that ought to screen off some of the information it's using. This is probably best illustrated by an unrealistic extreme case...
It's actually not that unrealistic. We have a few subsystems that do a great job with patch selection, and I usually don't find any other patches to pick up from there, while some other subsystems in the kernel require us to pick almost every patch that flows in there (think files that contain device quirks for example).
I've tried to address that by also including the modified filename into the inputs of the NN, so that the NN is better at acting differently based on the subsystem/filename being patched.
For mlx5, for example, there are two ways it would differentiate it from everything else:
- Commit subject lines usually start with net/mlx5, which is used as input to the NN. - Filenames touch drivers/net/ethernet/mellanox/mlx5/*
Anyway, yes - I understand your bigger point here around missing information from the NN. I'd like to think that based on previous experience it does a good job of balancing everything, but I might be mistaken.
Let's imagine hypothetically that the maintainer of drivers/blub is an absolutely perfect judge of which patches should go to -stable, and that the transmission path from him to the stable trees never loses a patch. This would mean that every autosel patch in drivers/blub is necessarily a false positive, because all the 'true positives' it might have found have already been taken out of the pool, so to speak. But if the NN is just trained to discriminate patches on whether they end up going to stable, it won't see any difference between a drivers/blub patch that the maintainer sent to stable straight away and a drivers/wibble patch that the latter's less diligent maintainer didn't forward and that only got picked up later when a stable kernel user encountered the bug it was fixing. As long as the NN doesn't have that piece of information, it's going to either generate lots of false positives in drivers/blub or lots of false negatives in drivers/wibble. Now obviously drivers/blub doesn't exist, no maintainer is 100% perfect at -stable submissions; but any difference will produce the same effect on a smaller scale, with the 'blubbish' maintainers seeing a high false positive fraction while from the 'wibblesome' maintainer's point of view autosel is working great. And since the 'blubs' are the ones who're putting effort of their own into stable selection already, they get aggrieved at having to also put effort into catching the false positives from a system that doesn't seem to be doing much for them, and everyone ends up shouting at each other as we're seeing here.
(Do you want me to do another worked numerical example demonstrating the above, or does it make enough sense in words not to need one?)
Nope, the example above works, thanks!