On 10/7/21 4:22 AM, David Ahern wrote:
On 10/6/21 3:35 PM, Leonard Crestez wrote:
I counted the [FAIL] or [ OK ] markers but not the output of nettest itself. I don't know what to look for, I guess I could diff the outputs?
Shouldn't it be sufficient to compare the exit codes of the nettest client?
mistakes happen. The 700+ tests that exist were verified by me when I submitted the script - that each test passes when it should and fails when it should. "FAIL" has many reasons. I tried to have separate exit codes for nettest.c to capture the timeouts vs ECONNREFUSED, etc., but I could easily have made a mistake. scanning the output is the best way. Most of the 'supposed to fail' tests have a HINT saying why it should fail.
It is not good to have a test for which correctness is ambiguous to such an extent, it makes reliable future changes difficult. In theory an uniform TAP format is supposed to solve this but it is not applied inside selftests/net.
I attempted to write a script to compare two logs in their current format: https://gitlab.com/cdleonard/kselftest-parse-nettest-fcnal
It does a bunch of nasty scrubbing of irrelevant behavior and got it to the point where no diffs are found between repeated runs on the same machine.
One nasty issue was that many tests kill processes inside log_test so relevant output may be shown either before or after the "TEST: " result line. This was solved by associating output until the next ### with previous test.
The output is also modified by a previous change to not capture server output separately and instead let it be combined with that of the client. That change is required for this one, doing out=$(nettest -k) does not return on fork unless the pipe is also closed.
I did not look at your change, mine is relatively minimal because it only changes who decide when the server goes into the background: the shell script or the server itself. This makes it work very easily even for tests with multiple server instances.
The logging issue is why I went with 1 binary do both server and client after nettest.c got support for changing namespaces.
It's possible to just compare the "client" and "server" logs separately by sorting them on their prefix.
I think a decent approach would be to do a bulk replace for all "run_cmd{,_nsb,_nsc} nettest" with a new "run_nettest" function that passes all arguments to nettest itself. That run_nettest function could include a leading common "-t" arg that is parsed at the top of fcnal-test.sh.
-- Regards, Leonard