-----Original Message----- From: Iurii Zaikin
On Thu, Oct 17, 2019 at 4:54 PM Tim.Bird@sony.com wrote:
-----Original Message----- From: Iurii Zaikin
You can do all of this and allow users to supply another set of data. It doesn't gave to be one or the other.
What is the use case for running a unit test on a different data set than what it comes with?
I just gave some ideas in another message (our emails crossed), but one use case is to allow someone besides the test author to inject additional data points, and to do so without having to re-compile the code.
They might do this for multiple reasons:
- to experiment with additional data points
- to try to diagnose a problem they are seeing
- to fill gaps they see in existing data points
Whether this makes sense depends on a lot of factors. I suspect the timestamp test code is not a good candidate for this, as the code is simple enough that adding a new test case is pretty trivial. For some other types of tests, adding the data via an external file could be easier than changing the code of the test.
I think feeding test data without recompiling looks attractive right now because in order to run a single test you need to compile and link the whole kernel. I believe KUnit's strategic goal is the ability to only compile the relevant bits, which is admittedly very far off. Normally, in application programming the amount of code that needs to be recompiled in order to run a test suite is small enough that the added complexity of enabling the test to get the data from external sources is not warranted. Typically, external files are used when something is not practical to include in the source file directly due to size or complexity, i.e. a large snippet of text, an image file, some binary data etc. Such needs are typically addressed by the test author rather than the core test framework. Now, in application programming you can do a lot of things like reading a file which is trickier in kernel. But again we've come to supporting a use case for test data which has to be fabricated through some involved process or otherwise not easily included in the source file.
I strongly agree with everything you say here.
And if you come up with an additional test case, why not just add it and leave it there?
You should do exactly that. But the part you glossed over is the "coming up with an additional test case" part.
Having a data-driven test can, in some circumstances, allow one to more easily come up with additional interesting test cases. This is where Ted is exactly right about fuzzers. You don't want to execute a fuzzer as part of your unit testing. But you might want to execute a fuzzer to come up with additional unit test cases to add. And fuzzers are easier to write for data files than for C code. (I run a grave risk of de-railing the conversation by referring back to fuzzers, just when I believe we are coming to agreement about a number of ideas. :-). Just to be clear, I'm not promoting fuzzers for unit testing.
Regards, -- Tim
Unit tests are cheap, even if a case proves to be redundant, the mere fact that the code under test made you think of such a case is sufficient to permanently include the test case into the test suite.