[PATCH v2 0/3] kunit: allow running test suites/cases individually

List overview All Threads
Download

newer

older

[PATCH v1 0/6] kunit: build kunit...

[PATCH 1/1] selftests: KVM: set...

Daniel Latypov

28 Sep 2021 28 Sep '21

10:29 p.m.

Allow running each suite or each test case alone per kernel boot. The motivation for this is to debug "test hermeticity" issues. This new --run_isolated flag would be a good first step to try and narrow down root causes.

Context: sometimes tests pass/fail depending on what ran before them. Memory corruption errors in particular might only cause noticeable issues later on. But you can also have the opposite, where "fixing" one test causes another to start failing.

Usage: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=suite $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=test $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=test example

The last one would provide output like ======== [PASSED] example ======== [PASSED] example_simple_test ============================================================ Testing complete. 1 tests run. 0 failed. 0 crashed. 0 skipped. Starting KUnit Kernel (2/3)... ============================================================ ======== [SKIPPED] example ======== [SKIPPED] example_skip_test # SKIP this test should be skipped ============================================================ Testing complete. 1 tests run. 0 failed. 0 crashed. 1 skipped. Starting KUnit Kernel (3/3)... ============================================================ ======== [SKIPPED] example ======== [SKIPPED] example_mark_skipped_test # SKIP this test should be skipped ============================================================ Testing complete. 1 tests run. 0 failed. 0 crashed. 1 skipped.

See the last patch's description for a bit more detail.

Meta: The first patch is from another series with just a reworded commit message, https://lore.kernel.org/linux-kselftest/20210805235145.2528054-2-dlatypov@go...

This patch series is based on Shuah's kunit branch: https://git.kernel.org/pub/scm/linux/kernel/git/shuah/linux-kselftest.git/?h...

Changes: v1 -> v2: rebase onto Shuah's kunit branch, fix missing code in patch 1.

Daniel Latypov (3): kunit: add 'kunit.action' param to allow listing out tests kunit: tool: factor exec + parse steps into a function kunit: tool: support running each suite/test separately

lib/kunit/executor.c | 45 ++++++++- tools/testing/kunit/kunit.py | 127 +++++++++++++++++-------- tools/testing/kunit/kunit_tool_test.py | 40 ++++++++ 3 files changed, 167 insertions(+), 45 deletions(-)

base-commit: 3b29021ddd10cfb6b2565c623595bd3b02036f33

-- 2.33.0.685.g46640cef36-goog

Show replies by date

Daniel Latypov

28 Sep 28 Sep

10:29 p.m.

New subject: [PATCH v2 1/3] kunit: add 'kunit.action' param to allow listing out tests

Context: It's difficult to map a given .kunitconfig => set of enabled tests. Letting kunit.py figure that out would be useful.

This patch: * is intended to be an implementation detail used only by kunit.py * adds a kunit.action module param with one valid non-null value, "list" * for the "list" action, it simply prints out "<suite>.<test>" * leaves the kunit.py changes to make use of this for another patch.

Note: kunit.filter_glob is respected for this and all future actions.

Hack: we print a TAP header and fake test plan to allow kunit.py to use the same code to pick up KUnit output that it does for normal tests. Since this is intended to be an implementation detail, it seems fine for now. Maybe in the future we ouptut each test as SKIPPED or the like.

Go with a more generic "action" param, since it seems like we might eventually have more modes besides just running or listing tests, e.g. * perhaps a benchmark mode that reruns test cases and reports timing * perhaps a deflake mode that reruns test cases that failed * perhaps a mode where we randomize test order to try and catch hermeticity bugs like "test a only passes if run after test b"

Tested: $ ./tools/testing/kunit/kunit.py run --kernel_arg=kunit.action=list --raw_output=kunit ... TAP version 14 1..1 example.example_simple_test example.example_skip_test example.example_mark_skipped_test reboot: System halted

Signed-off-by: Daniel Latypov dlatypov@google.com --- lib/kunit/executor.c | 45 +++++++++++++++++++++++++++++++++++++++----- 1 file changed, 40 insertions(+), 5 deletions(-)

diff --git a/lib/kunit/executor.c b/lib/kunit/executor.c index bab3ab940acc..8b38c91b4fac 100644 --- a/lib/kunit/executor.c +++ b/lib/kunit/executor.c @@ -15,9 +15,16 @@ extern struct kunit_suite * const * const __kunit_suites_end[]; #if IS_BUILTIN(CONFIG_KUNIT)

static char *filter_glob_param; +static char *action_param; + module_param_named(filter_glob, filter_glob_param, charp, 0); MODULE_PARM_DESC(filter_glob, "Filter which KUnit test suites/tests run at boot-time, e.g. list* or list*.*del_test"); +module_param_named(action, action_param, charp, 0); +MODULE_PARM_DESC(action, + "Changes KUnit executor behavior, valid values are:\n" + "<none>: run the tests like normal\n" + "'list' to list test names instead of running them.\n");

/* glob_match() needs NULL terminated strings, so we need a copy of filter_glob_param. */ struct kunit_test_filter { @@ -196,9 +203,35 @@ static void kunit_print_tap_header(struct suite_set *suite_set) pr_info("1..%d\n", num_of_suites); }

-int kunit_run_all_tests(void) +static void kunit_exec_run_tests(struct suite_set *suite_set) { struct kunit_suite * const * const *suites; + + kunit_print_tap_header(suite_set); + + for (suites = suite_set->start; suites < suite_set->end; suites++) + __kunit_test_suites_init(*suites); +} + +static void kunit_exec_list_tests(struct suite_set *suite_set) +{ + unsigned int i; + struct kunit_suite * const * const *suites; + struct kunit_case *test_case; + + /* Hack: print a tap header so kunit.py can find the start of KUnit output. */ + kunit_print_tap_header(suite_set); + + for (suites = suite_set->start; suites < suite_set->end; suites++) + for (i = 0; (*suites)[i] != NULL; i++) { + kunit_suite_for_each_test_case((*suites)[i], test_case) { + pr_info("%s.%s\n", (*suites)[i]->name, test_case->name); + } + } +} + +int kunit_run_all_tests(void) +{ struct suite_set suite_set = { .start = __kunit_suites_start, .end = __kunit_suites_end, @@ -207,10 +240,12 @@ int kunit_run_all_tests(void) if (filter_glob_param) suite_set = kunit_filter_suites(&suite_set, filter_glob_param);

- kunit_print_tap_header(&suite_set); - - for (suites = suite_set.start; suites < suite_set.end; suites++) - __kunit_test_suites_init(*suites); + if (!action_param) + kunit_exec_run_tests(&suite_set); + else if (strcmp(action_param, "list") == 0) + kunit_exec_list_tests(&suite_set); + else + pr_err("kunit executor: unknown action '%s'\n", action_param);

if (filter_glob_param) { /* a copy was made of each array */ kunit_free_suite_set(suite_set);

-- 2.33.0.685.g46640cef36-goog

David Gow

29 Sep 29 Sep

4:38 a.m.

New subject: [PATCH v2 1/3] kunit: add 'kunit.action' param to allow listing out tests

On Wed, Sep 29, 2021 at 6:29 AM Daniel Latypov dlatypov@google.com wrote:

...

Context: It's difficult to map a given .kunitconfig => set of enabled tests. Letting kunit.py figure that out would be useful.

This patch:

is intended to be an implementation detail used only by kunit.py

adds a kunit.action module param with one valid non-null value, "list"

for the "list" action, it simply prints out "<suite>.<test>"

leaves the kunit.py changes to make use of this for another patch.

Note: kunit.filter_glob is respected for this and all future actions.

Hack: we print a TAP header and fake test plan to allow kunit.py to use the same code to pick up KUnit output that it does for normal tests. Since this is intended to be an implementation detail, it seems fine for now. Maybe in the future we ouptut each test as SKIPPED or the like.

I'm still a little uneasy using the "TAP version 14" header here, and then proceeding to include a list of tests which, in and of itself, isn't valid TAP. I don't think we need to solve this perfectly now: we can always change it if something comes out of, e.g., the KTAP standardisation, but I'd rather we have something -- even something temporary -- which is easily distinguishable from an actual TAP result.

Even if we had "TAP version 14 - test list" or something so that kunit_tool picked up on it without further changes, that'd be fine, though something like "KUnit Test List" would be better.

Also, I'd still rather we lose the "1..1" test suite list, though I can live without if I have to, given that it is actually giving the correct number of suites.

...

From the kernel side, all this should take is replacing the call to

kunit_print_tap_header() with a direct pr_info() call. Maybe there'd need to be some minor kunit_tool changes in patch 3, too, but nothing excessive.

Also, nit: "ouptut" should be "output"

...

Go with a more generic "action" param, since it seems like we might eventually have more modes besides just running or listing tests, e.g.

perhaps a benchmark mode that reruns test cases and reports timing

perhaps a deflake mode that reruns test cases that failed

perhaps a mode where we randomize test order to try and catch hermeticity bugs like "test a only passes if run after test b"

Tested: $ ./tools/testing/kunit/kunit.py run --kernel_arg=kunit.action=list --raw_output=kunit ... TAP version 14 1..1 example.example_simple_test example.example_skip_test example.example_mark_skipped_test reboot: System halted

Signed-off-by: Daniel Latypov dlatypov@google.com

Otherwise, I'm quite happy with this: it works well on my end, and the implementation makes sense.

So this is: Reviewed-by: David Gow davidgow@google.com

(But I'd rather the TAP header bit change if possible...)

-- David

...

lib/kunit/executor.c | 45 +++++++++++++++++++++++++++++++++++++++----- 1 file changed, 40 insertions(+), 5 deletions(-)

diff --git a/lib/kunit/executor.c b/lib/kunit/executor.c index bab3ab940acc..8b38c91b4fac 100644 --- a/lib/kunit/executor.c +++ b/lib/kunit/executor.c @@ -15,9 +15,16 @@ extern struct kunit_suite * const * const __kunit_suites_end[]; #if IS_BUILTIN(CONFIG_KUNIT)

static char *filter_glob_param; +static char *action_param;

module_param_named(filter_glob, filter_glob_param, charp, 0); MODULE_PARM_DESC(filter_glob, "Filter which KUnit test suites/tests run at boot-time, e.g. list* or list*.*del_test"); +module_param_named(action, action_param, charp, 0); +MODULE_PARM_DESC(action,
           "Changes KUnit executor behavior, valid values are:\n"
           "<none>: run the tests like normal\n"
           "'list' to list test names instead of running them.\n");
/* glob_match() needs NULL terminated strings, so we need a copy of filter_glob_param. */ struct kunit_test_filter { @@ -196,9 +203,35 @@ static void kunit_print_tap_header(struct suite_set *suite_set) pr_info("1..%d\n", num_of_suites); }

-int kunit_run_all_tests(void) +static void kunit_exec_run_tests(struct suite_set *suite_set) { struct kunit_suite * const * const *suites;
  kunit_print_tap_header(suite_set);
  for (suites = suite_set->start; suites < suite_set->end; suites++)
          __kunit_test_suites_init(*suites);
+}

+static void kunit_exec_list_tests(struct suite_set *suite_set) +{
  unsigned int i;
  struct kunit_suite * const * const *suites;
  struct kunit_case *test_case;
  /* Hack: print a tap header so kunit.py can find the start of KUnit output. */
  kunit_print_tap_header(suite_set);

As noted, would rather this be something like pr_info("KUnit Test List");

...

  for (suites = suite_set->start; suites < suite_set->end; suites++)

          for (i = 0; (*suites)[i] != NULL; i++) {

                  kunit_suite_for_each_test_case((*suites)[i], test_case) {

                          pr_info("%s.%s\n", (*suites)[i]->name, test_case->name);

```
                  }
```
```
          }
```

+int kunit_run_all_tests(void) +{ struct suite_set suite_set = { .start = __kunit_suites_start, .end = __kunit_suites_end, @@ -207,10 +240,12 @@ int kunit_run_all_tests(void) if (filter_glob_param) suite_set = kunit_filter_suites(&suite_set, filter_glob_param);

```
  kunit_print_tap_header(&suite_set);
```

  for (suites = suite_set.start; suites < suite_set.end; suites++)

          __kunit_test_suites_init(*suites);

```
  if (!action_param)
```

          kunit_exec_run_tests(&suite_set);

  else if (strcmp(action_param, "list") == 0)

          kunit_exec_list_tests(&suite_set);

```
  else
```

          pr_err("kunit executor: unknown action '%s'\n", action_param);

  if (filter_glob_param) { /* a copy was made of each array */
          kunit_free_suite_set(suite_set);

-- 2.33.0.685.g46640cef36-goog

Daniel Latypov

4:55 p.m.

New subject: [PATCH v2 1/3] kunit: add 'kunit.action' param to allow listing out tests

On Tue, Sep 28, 2021 at 9:38 PM David Gow davidgow@google.com wrote:

...

On Wed, Sep 29, 2021 at 6:29 AM Daniel Latypov dlatypov@google.com wrote:

...
Context: It's difficult to map a given .kunitconfig => set of enabled tests. Letting kunit.py figure that out would be useful.

This patch:

is intended to be an implementation detail used only by kunit.py

adds a kunit.action module param with one valid non-null value, "list"

for the "list" action, it simply prints out "<suite>.<test>"

leaves the kunit.py changes to make use of this for another patch.

Note: kunit.filter_glob is respected for this and all future actions.

Hack: we print a TAP header and fake test plan to allow kunit.py to use the same code to pick up KUnit output that it does for normal tests. Since this is intended to be an implementation detail, it seems fine for now. Maybe in the future we ouptut each test as SKIPPED or the like.

I'm still a little uneasy using the "TAP version 14" header here, and then proceeding to include a list of tests which, in and of itself, isn't valid TAP. I don't think we need to solve this perfectly now: we can always change it if something comes out of, e.g., the KTAP standardisation, but I'd rather we have something -- even something temporary -- which is easily distinguishable from an actual TAP result.

Even if we had "TAP version 14 - test list" or something so that

Currently, the regex kunit_parser.py uses is anchored with $. So I'm not able to add anything at the end of the line.

We could potentially add a new line afterwards with "# KUnit Test List" or similar?

I'd prefer not to since I'm not sure it matters _too_ much. Someone would have to go out of their way to print the list of tests, and if so, I'm not sure they should be trying to parse TAP output from it.

...

kunit_tool picked up on it without further changes, that'd be fine, though something like "KUnit Test List" would be better.

Also, I'd still rather we lose the "1..1" test suite list, though I can live without if I have to, given that it is actually giving the correct number of suites.

From the kernel side, all this should take is replacing the call to kunit_print_tap_header() with a direct pr_info() call. Maybe there'd need to be some minor kunit_tool changes in patch 3, too, but nothing excessive.

That's fair. I had mainly just wanted to avoid hard-copying a copy of the TAP version.

I've made the change so it doesn't print the test plan and it works.

The only change needed in patch 3 is diff --git a/tools/testing/kunit/kunit.py b/tools/testing/kunit/kunit.py index e7b92caba53d..ca4f877234c2 100755 --- a/tools/testing/kunit/kunit.py +++ b/tools/testing/kunit/kunit.py @@ -101,8 +101,7 @@ def _list_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest) filter_glob=request.filter_glob, build_dir=request.build_dir) output = kunit_parser.extract_tap_lines(output) - # Hack! Drop the TAP version header and top-level test plan. - output.pop() + # Hack! Drop the dummy TAP version header that the executor prints out. output.pop() return list(output)

...

Also, nit: "ouptut" should be "output"

Ah, fixed.

...

...
Go with a more generic "action" param, since it seems like we might eventually have more modes besides just running or listing tests, e.g.

perhaps a benchmark mode that reruns test cases and reports timing

perhaps a deflake mode that reruns test cases that failed

perhaps a mode where we randomize test order to try and catch hermeticity bugs like "test a only passes if run after test b"

Tested: $ ./tools/testing/kunit/kunit.py run --kernel_arg=kunit.action=list --raw_output=kunit ... TAP version 14 1..1 example.example_simple_test example.example_skip_test example.example_mark_skipped_test reboot: System halted

Signed-off-by: Daniel Latypov dlatypov@google.com

Otherwise, I'm quite happy with this: it works well on my end, and the implementation makes sense.

So this is: Reviewed-by: David Gow davidgow@google.com

(But I'd rather the TAP header bit change if possible...)

-- David

...
lib/kunit/executor.c | 45 +++++++++++++++++++++++++++++++++++++++----- 1 file changed, 40 insertions(+), 5 deletions(-)

diff --git a/lib/kunit/executor.c b/lib/kunit/executor.c index bab3ab940acc..8b38c91b4fac 100644 --- a/lib/kunit/executor.c +++ b/lib/kunit/executor.c @@ -15,9 +15,16 @@ extern struct kunit_suite * const * const __kunit_suites_end[]; #if IS_BUILTIN(CONFIG_KUNIT)

static char *filter_glob_param; +static char *action_param;

module_param_named(filter_glob, filter_glob_param, charp, 0); MODULE_PARM_DESC(filter_glob, "Filter which KUnit test suites/tests run at boot-time, e.g. list* or list*.*del_test"); +module_param_named(action, action_param, charp, 0); +MODULE_PARM_DESC(action,
           "Changes KUnit executor behavior, valid values are:\n"
           "<none>: run the tests like normal\n"
           "'list' to list test names instead of running them.\n");
/* glob_match() needs NULL terminated strings, so we need a copy of filter_glob_param. */ struct kunit_test_filter { @@ -196,9 +203,35 @@ static void kunit_print_tap_header(struct suite_set *suite_set) pr_info("1..%d\n", num_of_suites); }

-int kunit_run_all_tests(void) +static void kunit_exec_run_tests(struct suite_set *suite_set) { struct kunit_suite * const * const *suites;
  kunit_print_tap_header(suite_set);
  for (suites = suite_set->start; suites < suite_set->end; suites++)
          __kunit_test_suites_init(*suites);
+}

+static void kunit_exec_list_tests(struct suite_set *suite_set) +{
  unsigned int i;
  struct kunit_suite * const * const *suites;
  struct kunit_case *test_case;
  /* Hack: print a tap header so kunit.py can find the start of KUnit output. */
  kunit_print_tap_header(suite_set);
As noted, would rather this be something like pr_info("KUnit Test List");

...
  for (suites = suite_set->start; suites < suite_set->end; suites++)
          for (i = 0; (*suites)[i] != NULL; i++) {
                  kunit_suite_for_each_test_case((*suites)[i], test_case) {
                          pr_info("%s.%s\n", (*suites)[i]->name, test_case->name);
                  }
          }
+}

+int kunit_run_all_tests(void) +{ struct suite_set suite_set = { .start = __kunit_suites_start, .end = __kunit_suites_end, @@ -207,10 +240,12 @@ int kunit_run_all_tests(void) if (filter_glob_param) suite_set = kunit_filter_suites(&suite_set, filter_glob_param);
  kunit_print_tap_header(&suite_set);
  for (suites = suite_set.start; suites < suite_set.end; suites++)
          __kunit_test_suites_init(*suites);
  if (!action_param)
          kunit_exec_run_tests(&suite_set);
  else if (strcmp(action_param, "list") == 0)
          kunit_exec_list_tests(&suite_set);
  else
          pr_err("kunit executor: unknown action '%s'\n", action_param);

  if (filter_glob_param) { /* a copy was made of each array */
          kunit_free_suite_set(suite_set);
-- 2.33.0.685.g46640cef36-goog

Daniel Latypov

28 Sep 28 Sep

10:29 p.m.

New subject: [PATCH v2 2/3] kunit: tool: factor exec + parse steps into a function

Currently this code is copy-pasted between the normal "run" subcommand and the "exec" subcommand.

Given we don't have any interest in just executing the tests without giving the user any indication what happened (i.e. parsing the output), make a function that does both this things and can be reused.

This will be useful when we allow more complicated ways of running tests, e.g. invoking the kernel multiple times instead of just once, etc.

We remove input_data from the ParseRequest so the callers don't have to pass in a dummy value for this field. Named tuples are also immutable, so if they did pass in a dummy, exec_tests() would need to make a copy to call parse_tests().

Removing it also makes KunitParseRequest match the other *Request types, as they only contain user arguments/flags, not data.

Signed-off-by: Daniel Latypov dlatypov@google.com --- tools/testing/kunit/kunit.py | 44 ++++++++++++++++-------------------- 1 file changed, 19 insertions(+), 25 deletions(-)

diff --git a/tools/testing/kunit/kunit.py b/tools/testing/kunit/kunit.py index 66f67af97971..31eec9f6ecc3 100755 --- a/tools/testing/kunit/kunit.py +++ b/tools/testing/kunit/kunit.py @@ -34,7 +34,7 @@ KunitExecRequest = namedtuple('KunitExecRequest', ['timeout', 'build_dir', 'alltests', 'filter_glob', 'kernel_args']) KunitParseRequest = namedtuple('KunitParseRequest', - ['raw_output', 'input_data', 'build_dir', 'json']) + ['raw_output', 'build_dir', 'json']) KunitRequest = namedtuple('KunitRequest', ['raw_output','timeout', 'jobs', 'build_dir', 'alltests', 'filter_glob', 'kernel_args', 'json', 'make_options']) @@ -91,23 +91,25 @@ def build_tests(linux: kunit_kernel.LinuxSourceTree, 'built kernel successfully', build_end - build_start)

-def exec_tests(linux: kunit_kernel.LinuxSourceTree, - request: KunitExecRequest) -> KunitResult: +def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest, + parse_request: KunitParseRequest) -> KunitResult: kunit_parser.print_with_timestamp('Starting KUnit Kernel ...') test_start = time.time() - result = linux.run_kernel( + run_result = linux.run_kernel( args=request.kernel_args, timeout=None if request.alltests else request.timeout, - filter_glob=request.filter_glob, + filter_glob=request.filter_glob, build_dir=request.build_dir)

test_end = time.time() + exec_time = test_end - test_start

- return KunitResult(KunitStatus.SUCCESS, - result, - test_end - test_start) + # Named tuples are immutable, so we rebuild them here manually + result = parse_tests(parse_request, run_result) + + return KunitResult(status=result.status, result=result.result, elapsed_time=exec_time)

-def parse_tests(request: KunitParseRequest) -> KunitResult: +def parse_tests(request: KunitParseRequest, input_data: Iterable[str]) -> KunitResult: parse_start = time.time()

test_result = kunit_parser.TestResult(kunit_parser.TestStatus.SUCCESS, @@ -115,7 +117,7 @@ def parse_tests(request: KunitParseRequest) -> KunitResult: 'Tests not Parsed.')

if request.raw_output: - output: Iterable[str] = request.input_data + output: Iterable[str] = input_data if request.raw_output == 'all': pass elif request.raw_output == 'kunit': @@ -126,7 +128,7 @@ def parse_tests(request: KunitParseRequest) -> KunitResult: print(line.rstrip())

else: - test_result = kunit_parser.parse_run_tests(request.input_data) + test_result = kunit_parser.parse_run_tests(input_data) parse_end = time.time()

if request.json: @@ -165,15 +167,11 @@ def run_tests(linux: kunit_kernel.LinuxSourceTree, exec_request = KunitExecRequest(request.timeout, request.build_dir, request.alltests, request.filter_glob, request.kernel_args) - exec_result = exec_tests(linux, exec_request) - if exec_result.status != KunitStatus.SUCCESS: - return exec_result - parse_request = KunitParseRequest(request.raw_output, - exec_result.result, request.build_dir, request.json) - parse_result = parse_tests(parse_request) + + exec_result = exec_tests(linux, exec_request, parse_request)

run_end = time.time()

@@ -184,7 +182,7 @@ def run_tests(linux: kunit_kernel.LinuxSourceTree, config_result.elapsed_time, build_result.elapsed_time, exec_result.elapsed_time)) - return parse_result + return exec_result

def add_common_opts(parser) -> None: parser.add_argument('--build_dir', @@ -381,15 +379,12 @@ def main(argv, linux=None): cli_args.alltests, cli_args.filter_glob, cli_args.kernel_args) - exec_result = exec_tests(linux, exec_request) parse_request = KunitParseRequest(cli_args.raw_output, - exec_result.result, cli_args.build_dir, cli_args.json) - result = parse_tests(parse_request) + result = exec_tests(linux, exec_request, parse_request) kunit_parser.print_with_timestamp(( - 'Elapsed time: %.3fs\n') % ( - exec_result.elapsed_time)) + 'Elapsed time: %.3fs\n') % (result.elapsed_time)) if result.status != KunitStatus.SUCCESS: sys.exit(1) elif cli_args.subcommand == 'parse': @@ -399,10 +394,9 @@ def main(argv, linux=None): with open(cli_args.file, 'r') as f: kunit_output = f.read().splitlines() request = KunitParseRequest(cli_args.raw_output, - kunit_output, None, cli_args.json) - result = parse_tests(request) + result = parse_tests(request, kunit_output) if result.status != KunitStatus.SUCCESS: sys.exit(1) else:

-- 2.33.0.685.g46640cef36-goog

David Gow

29 Sep 29 Sep

4:38 a.m.

New subject: [PATCH v2 2/3] kunit: tool: factor exec + parse steps into a function

On Wed, Sep 29, 2021 at 6:29 AM Daniel Latypov dlatypov@google.com wrote:

...

Currently this code is copy-pasted between the normal "run" subcommand and the "exec" subcommand.

Given we don't have any interest in just executing the tests without giving the user any indication what happened (i.e. parsing the output), make a function that does both this things and can be reused.

This will be useful when we allow more complicated ways of running tests, e.g. invoking the kernel multiple times instead of just once, etc.

We remove input_data from the ParseRequest so the callers don't have to pass in a dummy value for this field. Named tuples are also immutable, so if they did pass in a dummy, exec_tests() would need to make a copy to call parse_tests().

Removing it also makes KunitParseRequest match the other *Request types, as they only contain user arguments/flags, not data.

Signed-off-by: Daniel Latypov dlatypov@google.com

Thanks: while I did sort of like parse_tests() being called more explicitly, upon reflection this is much nicer.

Reviewed-by: David Gow davidgow@google.com

-- David

Daniel Latypov

28 Sep 28 Sep

10:29 p.m.

New subject: [PATCH v2 3/3] kunit: tool: support running each suite/test separately

The new --run_isolated flag makes the tool boot the kernel once per suite or test, preventing leftover state from one suite to impact the other. This can be useful as a starting point to debugging test hermeticity issues.

Note: it takes a lot longer, so people should not use it normally.

Consider the following very simplified example:

bool disable_something_for_test = false; void function_being_tested() { ... if (disable_something_for_test) return; ... }

static void test_before(struct kunit *test) { disable_something_for_test = true; function_being_tested(); /* oops, we forgot to reset it back to false */ }

static void test_after(struct kunit *test) { /* oops, now "fixing" test_before can cause test_after to fail! */ function_being_tested(); }

Presented like this, the issues are obvious, but it gets a lot more complicated to track down as the amount of test setup and helper functions increases.

Another use case is memory corruption. It might not be surface as an failure/crash in the test case or suite that caused it. I've noticed in kunit's own unit tests, the 3rd suite after might be the one to finally crash after an out-of-bounds write, for example.

Example usage:

Per suite: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=suite ... Starting KUnit Kernel (1/7)... ============================================================ ======== [PASSED] kunit_executor_test ======== .... Testing complete. 5 tests run. 0 failed. 0 crashed. 0 skipped. Starting KUnit Kernel (2/7)... ============================================================ ======== [PASSED] kunit-try-catch-test ======== ...

Per test: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=test Starting KUnit Kernel (1/23)... ============================================================ ======== [PASSED] kunit_executor_test ======== [PASSED] parse_filter_test ============================================================ Testing complete. 1 tests run. 0 failed. 0 crashed. 0 skipped. Starting KUnit Kernel (2/23)... ============================================================ ======== [PASSED] kunit_executor_test ======== [PASSED] filter_subsuite_test ...

It works with filters as well: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=suite example ... Starting KUnit Kernel (1/1)... ============================================================ ======== [PASSED] example ======== ...

It also handles test filters, '*.*skip*' runs these 3 tests: kunit_status.kunit_status_mark_skipped_test example.example_skip_test example.example_mark_skipped_test

Signed-off-by: Daniel Latypov dlatypov@google.com --- tools/testing/kunit/kunit.py | 91 ++++++++++++++++++++------ tools/testing/kunit/kunit_tool_test.py | 40 +++++++++++ 2 files changed, 112 insertions(+), 19 deletions(-)

diff --git a/tools/testing/kunit/kunit.py b/tools/testing/kunit/kunit.py index 31eec9f6ecc3..e7b92caba53d 100755 --- a/tools/testing/kunit/kunit.py +++ b/tools/testing/kunit/kunit.py @@ -16,7 +16,7 @@ assert sys.version_info >= (3, 7), "Python version is too old"

from collections import namedtuple from enum import Enum, auto -from typing import Iterable +from typing import Iterable, List

import kunit_config import kunit_json @@ -31,13 +31,13 @@ KunitBuildRequest = namedtuple('KunitBuildRequest', ['jobs', 'build_dir', 'alltests', 'make_options']) KunitExecRequest = namedtuple('KunitExecRequest', - ['timeout', 'build_dir', 'alltests', - 'filter_glob', 'kernel_args']) + ['timeout', 'build_dir', 'alltests', + 'filter_glob', 'kernel_args', 'run_isolated']) KunitParseRequest = namedtuple('KunitParseRequest', ['raw_output', 'build_dir', 'json']) KunitRequest = namedtuple('KunitRequest', ['raw_output','timeout', 'jobs', 'build_dir', 'alltests', 'filter_glob', - 'kernel_args', 'json', 'make_options']) + 'kernel_args', 'run_isolated', 'json', 'make_options'])

KernelDirectoryPath = sys.argv[0].split('tools/testing/kunit/')[0]

@@ -91,21 +91,66 @@ def build_tests(linux: kunit_kernel.LinuxSourceTree, 'built kernel successfully', build_end - build_start)

-def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest, - parse_request: KunitParseRequest) -> KunitResult: - kunit_parser.print_with_timestamp('Starting KUnit Kernel ...') - test_start = time.time() - run_result = linux.run_kernel( - args=request.kernel_args, - timeout=None if request.alltests else request.timeout, - filter_glob=request.filter_glob, - build_dir=request.build_dir) +def _list_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest) -> List[str]: + args = ['kunit.action=list'] + if request.kernel_args: + args.extend(request.kernel_args) + + output = linux.run_kernel(args=args, + timeout=None if request.alltests else request.timeout, + filter_glob=request.filter_glob, + build_dir=request.build_dir) + output = kunit_parser.extract_tap_lines(output) + # Hack! Drop the TAP version header and top-level test plan. + output.pop() + output.pop() + return list(output) + +def _suites_from_test_list(tests: List[str]) -> List[str]: + """Extracts all the suites from an ordered list of tests.""" + suites = [] # type: List[str] + for t in tests: + parts = t.split('.', maxsplit=2) + if len(parts) != 2: + raise ValueError(f'internal KUnit error, test name should be of the form "<suite>.<test>", got "{t}"') + suite, case = parts + if not suites or suites[-1] != suite: + suites.append(suite) + return suites +

- test_end = time.time() - exec_time = test_end - test_start

- # Named tuples are immutable, so we rebuild them here manually - result = parse_tests(parse_request, run_result) +def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest, + parse_request: KunitParseRequest) -> KunitResult: + filter_globs = [request.filter_glob] + if request.run_isolated: + tests = _list_tests(linux, request) + if request.run_isolated == 'test': + filter_globs = tests + if request.run_isolated == 'suite': + filter_globs = _suites_from_test_list(tests) + # Apply the test-part of the user's glob, if present. + if '.' in request.filter_glob: + test_glob = request.filter_glob.split('.', maxsplit=2)[1] + filter_globs = [g + '.'+ test_glob for g in filter_globs] + + overall_status = kunit_parser.TestStatus.SUCCESS + exec_time = 0 + for i, filter_glob in enumerate(filter_globs): + kunit_parser.print_with_timestamp('Starting KUnit Kernel ({}/{})...'.format(i+1, len(filter_globs))) + + test_start = time.time() + run_result = linux.run_kernel( + args=request.kernel_args, + timeout=None if request.alltests else request.timeout, + filter_glob=filter_glob, + build_dir=request.build_dir) + + test_end = time.time() + exec_time += test_end - test_start + + result = parse_tests(parse_request, run_result) + overall_status = kunit_parser.max_status(overall_status, result.status)

return KunitResult(status=result.status, result=result.result, elapsed_time=exec_time)

@@ -166,7 +211,7 @@ def run_tests(linux: kunit_kernel.LinuxSourceTree,

exec_request = KunitExecRequest(request.timeout, request.build_dir, request.alltests, request.filter_glob, - request.kernel_args) + request.kernel_args, request.run_isolated) parse_request = KunitParseRequest(request.raw_output, request.build_dir, request.json) @@ -250,6 +295,12 @@ def add_exec_opts(parser) -> None: parser.add_argument('--kernel_args', help='Kernel command-line parameters. Maybe be repeated', action='append') + parser.add_argument('--run_isolated', help='If set, boot the kernel for each ' + 'individual suite/test. This is can be useful for debugging ' + 'a non-hermetic test, one that might pass/fail based on ' + 'what ran before it.', + type=str, + choices=['suite', 'test']),

def add_parse_opts(parser) -> None: parser.add_argument('--raw_output', help='If set don't format output from kernel. ' @@ -323,6 +374,7 @@ def main(argv, linux=None): cli_args.alltests, cli_args.filter_glob, cli_args.kernel_args, + cli_args.run_isolated, cli_args.json, cli_args.make_options) result = run_tests(linux, request) @@ -378,7 +430,8 @@ def main(argv, linux=None): cli_args.build_dir, cli_args.alltests, cli_args.filter_glob, - cli_args.kernel_args) + cli_args.kernel_args, + cli_args.run_isolated) parse_request = KunitParseRequest(cli_args.raw_output, cli_args.build_dir, cli_args.json) diff --git a/tools/testing/kunit/kunit_tool_test.py b/tools/testing/kunit/kunit_tool_test.py index 619c4554cbff..bcfdc6664c9d 100755 --- a/tools/testing/kunit/kunit_tool_test.py +++ b/tools/testing/kunit/kunit_tool_test.py @@ -477,6 +477,46 @@ class KUnitMainTest(unittest.TestCase): args=['a=1','b=2'], build_dir='.kunit', filter_glob='', timeout=300) self.print_mock.assert_any_call(StrContains('Testing complete.'))

+ def test_list_tests(self): + want = ['suite.test1', 'suite.test2', 'suite2.test1'] + self.linux_source_mock.run_kernel.return_value = ['TAP version 14', '1..0'] + want + + got = kunit._list_tests(self.linux_source_mock, + kunit.KunitExecRequest(300, '.kunit', False, 'suite*', None, 'suite')) + + self.assertEqual(got, want) + # Should respect the user's filter glob when listing tests. + self.linux_source_mock.run_kernel.assert_called_once_with( + args=['kunit.action=list'], build_dir='.kunit', filter_glob='suite*', timeout=300) + + + @mock.patch.object(kunit, '_list_tests') + def test_run_isolated_by_suite(self, mock_tests): + mock_tests.return_value = ['suite.test1', 'suite.test2', 'suite2.test1'] + kunit.main(['exec', '--run_isolated=suite', 'suite*.test*'], self.linux_source_mock) + + # Should respect the user's filter glob when listing tests. + mock_tests.assert_called_once_with(mock.ANY, + kunit.KunitExecRequest(300, '.kunit', False, 'suite*.test*', None, 'suite')) + self.linux_source_mock.run_kernel.assert_has_calls([ + mock.call(args=None, build_dir='.kunit', filter_glob='suite.test*', timeout=300), + mock.call(args=None, build_dir='.kunit', filter_glob='suite2.test*', timeout=300), + ]) + + @mock.patch.object(kunit, '_list_tests') + def test_run_isolated_by_test(self, mock_tests): + mock_tests.return_value = ['suite.test1', 'suite.test2', 'suite2.test1'] + kunit.main(['exec', '--run_isolated=test', 'suite*'], self.linux_source_mock) + + # Should respect the user's filter glob when listing tests. + mock_tests.assert_called_once_with(mock.ANY, + kunit.KunitExecRequest(300, '.kunit', False, 'suite*', None, 'test')) + self.linux_source_mock.run_kernel.assert_has_calls([ + mock.call(args=None, build_dir='.kunit', filter_glob='suite.test1', timeout=300), + mock.call(args=None, build_dir='.kunit', filter_glob='suite.test2', timeout=300), + mock.call(args=None, build_dir='.kunit', filter_glob='suite2.test1', timeout=300), + ]) +

if __name__ == '__main__': unittest.main()

-- 2.33.0.685.g46640cef36-goog

David Gow

29 Sep 29 Sep

4:39 a.m.

New subject: [PATCH v2 3/3] kunit: tool: support running each suite/test separately

On Wed, Sep 29, 2021 at 6:29 AM Daniel Latypov dlatypov@google.com wrote:

...

The new --run_isolated flag makes the tool boot the kernel once per suite or test, preventing leftover state from one suite to impact the other. This can be useful as a starting point to debugging test hermeticity issues.

Note: it takes a lot longer, so people should not use it normally.

Consider the following very simplified example:

bool disable_something_for_test = false; void function_being_tested() { ... if (disable_something_for_test) return; ... }

static void test_before(struct kunit *test) { disable_something_for_test = true; function_being_tested(); /* oops, we forgot to reset it back to false */ }

static void test_after(struct kunit *test) { /* oops, now "fixing" test_before can cause test_after to fail! */ function_being_tested(); }

Presented like this, the issues are obvious, but it gets a lot more complicated to track down as the amount of test setup and helper functions increases.

Another use case is memory corruption. It might not be surface as an

Nit: "might not be surfaced" or "might not surface"

...

failure/crash in the test case or suite that caused it. I've noticed in kunit's own unit tests, the 3rd suite after might be the one to finally crash after an out-of-bounds write, for example.

Example usage:

Per suite: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=suite ... Starting KUnit Kernel (1/7)... ============================================================ ======== [PASSED] kunit_executor_test ======== .... Testing complete. 5 tests run. 0 failed. 0 crashed. 0 skipped. Starting KUnit Kernel (2/7)... ============================================================ ======== [PASSED] kunit-try-catch-test ======== ...

Per test: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=test Starting KUnit Kernel (1/23)... ============================================================ ======== [PASSED] kunit_executor_test ======== [PASSED] parse_filter_test ============================================================ Testing complete. 1 tests run. 0 failed. 0 crashed. 0 skipped. Starting KUnit Kernel (2/23)... ============================================================ ======== [PASSED] kunit_executor_test ======== [PASSED] filter_subsuite_test ...

It works with filters as well: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=suite example ... Starting KUnit Kernel (1/1)... ============================================================ ======== [PASSED] example ======== ...

It also handles test filters, '*.*skip*' runs these 3 tests: kunit_status.kunit_status_mark_skipped_test example.example_skip_test example.example_mark_skipped_test

Signed-off-by: Daniel Latypov dlatypov@google.com

Really glad to finally have this feature.

A few minor comments below (mostly mypy issues, though one of them seems to be a real bug). Otherwise, the main concern I have is that there's no final total of the tests run/passed/failed. That may be easier to implement post the parser rework patch, though, so I don't think we should hold this up for it: https://lore.kernel.org/linux-kselftest/20210901190623.315736-1-rmoar@google...

Also, if we changed the TAP header stuff in patch 1, there'd need to be a couple of minor changes here to support it.

Minor issues below aside, this is Reviewed-by: David Gow davidgow@google.com

Cheers, -- David

...

tools/testing/kunit/kunit.py | 91 ++++++++++++++++++++------ tools/testing/kunit/kunit_tool_test.py | 40 +++++++++++ 2 files changed, 112 insertions(+), 19 deletions(-)

diff --git a/tools/testing/kunit/kunit.py b/tools/testing/kunit/kunit.py index 31eec9f6ecc3..e7b92caba53d 100755 --- a/tools/testing/kunit/kunit.py +++ b/tools/testing/kunit/kunit.py @@ -16,7 +16,7 @@ assert sys.version_info >= (3, 7), "Python version is too old"

from collections import namedtuple from enum import Enum, auto -from typing import Iterable +from typing import Iterable, List

import kunit_config import kunit_json @@ -31,13 +31,13 @@ KunitBuildRequest = namedtuple('KunitBuildRequest', ['jobs', 'build_dir', 'alltests', 'make_options']) KunitExecRequest = namedtuple('KunitExecRequest',
                         ['timeout', 'build_dir', 'alltests',
                          'filter_glob', 'kernel_args'])
                        ['timeout', 'build_dir', 'alltests',
                         'filter_glob', 'kernel_args', 'run_isolated'])
KunitParseRequest = namedtuple('KunitParseRequest', ['raw_output', 'build_dir', 'json']) KunitRequest = namedtuple('KunitRequest', ['raw_output','timeout', 'jobs', 'build_dir', 'alltests', 'filter_glob',
                                     'kernel_args', 'json', 'make_options'])
                                     'kernel_args', 'run_isolated', 'json', 'make_options'])
KernelDirectoryPath = sys.argv[0].split('tools/testing/kunit/')[0]

@@ -91,21 +91,66 @@ def build_tests(linux: kunit_kernel.LinuxSourceTree, 'built kernel successfully', build_end - build_start)

-def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest,
         parse_request: KunitParseRequest) -> KunitResult:
  kunit_parser.print_with_timestamp('Starting KUnit Kernel ...')
  test_start = time.time()
  run_result = linux.run_kernel(
          args=request.kernel_args,
          timeout=None if request.alltests else request.timeout,
          filter_glob=request.filter_glob,
          build_dir=request.build_dir)
+def _list_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest) -> List[str]:
  args = ['kunit.action=list']
  if request.kernel_args:
          args.extend(request.kernel_args)
  output = linux.run_kernel(args=args,
                     timeout=None if request.alltests else request.timeout,
                     filter_glob=request.filter_glob,
                     build_dir=request.build_dir)
  output = kunit_parser.extract_tap_lines(output)

mypy gives an error here: tools/testing/kunit/kunit.py:103: error: Incompatible types in assignment (expression has type "LineStream", variable has type "Iterator[str]") tools/testing/kunit/kunit.py:103: note: 'LineStream' is missing following 'Iterator' protocol member: tools/testing/kunit/kunit.py:103: note: __next__

...

  # Hack! Drop the TAP version header and top-level test plan.

```
  output.pop()
```
```
  output.pop()
```

Similarly, mypy is complaining that: tools/testing/kunit/kunit.py:105: error: "Iterator[str]" has no attribute "pop" tools/testing/kunit/kunit.py:106: error: "Iterator[str]" has no attribute "pop"

Also, we could get rid of one of these output.pop() lines if we removed the test plan in Patch 1.

...

```
  return list(output)
```

+def _suites_from_test_list(tests: List[str]) -> List[str]:

  """Extracts all the suites from an ordered list of tests."""

```
  suites = []  # type: List[str]
```
```
  for t in tests:
```

          parts = t.split('.', maxsplit=2)

```
          if len(parts) != 2:
```

                  raise ValueError(f'internal KUnit error, test name should be of the form "<suite>.<test>", got "{t}"')

```
          suite, case = parts
```

          if not suites or suites[-1] != suite:

```
                  suites.append(suite)
```
```
  return suites
```

```
  test_end = time.time()
```
```
  exec_time = test_end - test_start
```

  # Named tuples are immutable, so we rebuild them here manually

  result = parse_tests(parse_request, run_result)

+def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest,

         parse_request: KunitParseRequest) -> KunitResult:

```
  filter_globs = [request.filter_glob]
```
```
  if request.run_isolated:
```

          tests = _list_tests(linux, request)

          if request.run_isolated == 'test':

```
                  filter_globs = tests
```

          if request.run_isolated == 'suite':

                  filter_globs = _suites_from_test_list(tests)

                  # Apply the test-part of the user's glob, if present.

                  if '.' in request.filter_glob:

                          test_glob = request.filter_glob.split('.', maxsplit=2)[1]

                          filter_globs = [g + '.'+ test_glob for g in filter_globs]

  overall_status = kunit_parser.TestStatus.SUCCESS

```
  exec_time = 0
```

This doesn't seem to work. My test run, for example, gave: Elapsed time: 71.706s total, 0.001s configuring, 3.241s building, 0.001s running

The 0.001s running time seems way too small here.

Digging into this, it might be related to this mypy warning: tools/testing/kunit/kunit.py:150: error: Incompatible types in assignment (expression has type "float", variable has type "int")

Should it be exec_time = 0.0 or similar?

...

  for i, filter_glob in enumerate(filter_globs):

          kunit_parser.print_with_timestamp('Starting KUnit Kernel ({}/{})...'.format(i+1, len(filter_globs)))

```
          test_start = time.time()
```

          run_result = linux.run_kernel(

                  args=request.kernel_args,

                  timeout=None if request.alltests else request.timeout,

                  filter_glob=filter_glob,

                  build_dir=request.build_dir)

```
          test_end = time.time()
```

          exec_time += test_end - test_start

          result = parse_tests(parse_request, run_result)

          overall_status = kunit_parser.max_status(overall_status, result.status)

  return KunitResult(status=result.status, result=result.result, elapsed_time=exec_time)

@@ -166,7 +211,7 @@ def run_tests(linux: kunit_kernel.LinuxSourceTree,

    exec_request = KunitExecRequest(request.timeout, request.build_dir,
                             request.alltests, request.filter_glob,

                           request.kernel_args)

                           request.kernel_args, request.run_isolated)
  parse_request = KunitParseRequest(request.raw_output,
                                    request.build_dir,
                                    request.json)

@@ -250,6 +295,12 @@ def add_exec_opts(parser) -> None: parser.add_argument('--kernel_args', help='Kernel command-line parameters. Maybe be repeated', action='append')

  parser.add_argument('--run_isolated', help='If set, boot the kernel for each '

                      'individual suite/test. This is can be useful for debugging '

                      'a non-hermetic test, one that might pass/fail based on '

                      'what ran before it.',

```
                      type=str,
```

                      choices=['suite', 'test']),

                                 cli_args.run_isolated,
                                 cli_args.json,
                                 cli_args.make_options)
          result = run_tests(linux, request)

@@ -378,7 +430,8 @@ def main(argv, linux=None): cli_args.build_dir, cli_args.alltests, cli_args.filter_glob,

                                          cli_args.kernel_args)

                                          cli_args.kernel_args,

                                          cli_args.run_isolated)
          parse_request = KunitParseRequest(cli_args.raw_output,
                                            cli_args.build_dir,
                                            cli_args.json)

diff --git a/tools/testing/kunit/kunit_tool_test.py b/tools/testing/kunit/kunit_tool_test.py index 619c4554cbff..bcfdc6664c9d 100755 --- a/tools/testing/kunit/kunit_tool_test.py +++ b/tools/testing/kunit/kunit_tool_test.py @@ -477,6 +477,46 @@ class KUnitMainTest(unittest.TestCase): args=['a=1','b=2'], build_dir='.kunit', filter_glob='', timeout=300) self.print_mock.assert_any_call(StrContains('Testing complete.'))

```
  def test_list_tests(self):
```

          want = ['suite.test1', 'suite.test2', 'suite2.test1']

          self.linux_source_mock.run_kernel.return_value = ['TAP version 14', '1..0'] + want

          got = kunit._list_tests(self.linux_source_mock,

                               kunit.KunitExecRequest(300, '.kunit', False, 'suite*', None, 'suite'))

```
          self.assertEqual(got, want)
```

          # Should respect the user's filter glob when listing tests.

          self.linux_source_mock.run_kernel.assert_called_once_with(

                  args=['kunit.action=list'], build_dir='.kunit', filter_glob='suite*', timeout=300)

  @mock.patch.object(kunit, '_list_tests')

  def test_run_isolated_by_suite(self, mock_tests):

          mock_tests.return_value = ['suite.test1', 'suite.test2', 'suite2.test1']

          kunit.main(['exec', '--run_isolated=suite', 'suite*.test*'], self.linux_source_mock)

          # Should respect the user's filter glob when listing tests.

          mock_tests.assert_called_once_with(mock.ANY,

                               kunit.KunitExecRequest(300, '.kunit', False, 'suite*.test*', None, 'suite'))

          self.linux_source_mock.run_kernel.assert_has_calls([

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite.test*', timeout=300),

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite2.test*', timeout=300),

```
          ])
```

  @mock.patch.object(kunit, '_list_tests')

  def test_run_isolated_by_test(self, mock_tests):

          mock_tests.return_value = ['suite.test1', 'suite.test2', 'suite2.test1']

          kunit.main(['exec', '--run_isolated=test', 'suite*'], self.linux_source_mock)

          # Should respect the user's filter glob when listing tests.

          mock_tests.assert_called_once_with(mock.ANY,

                               kunit.KunitExecRequest(300, '.kunit', False, 'suite*', None, 'test'))

          self.linux_source_mock.run_kernel.assert_has_calls([

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite.test1', timeout=300),

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite.test2', timeout=300),

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite2.test1', timeout=300),

```
          ])
```

if __name__ == '__main__': unittest.main() -- 2.33.0.685.g46640cef36-goog

Daniel Latypov

7:22 p.m.

New subject: [PATCH v2 3/3] kunit: tool: support running each suite/test separately

...On Tue, Sep 28, 2021 at 9:39 PM David Gow davidgow@google.com wrote:

...

On Wed, Sep 29, 2021 at 6:29 AM Daniel Latypov dlatypov@google.com wrote:

...
The new --run_isolated flag makes the tool boot the kernel once per suite or test, preventing leftover state from one suite to impact the other. This can be useful as a starting point to debugging test hermeticity issues.

Note: it takes a lot longer, so people should not use it normally.

Consider the following very simplified example:

bool disable_something_for_test = false; void function_being_tested() { ... if (disable_something_for_test) return; ... }

static void test_before(struct kunit *test) { disable_something_for_test = true; function_being_tested(); /* oops, we forgot to reset it back to false */ }

static void test_after(struct kunit *test) { /* oops, now "fixing" test_before can cause test_after to fail! */ function_being_tested(); }

Presented like this, the issues are obvious, but it gets a lot more complicated to track down as the amount of test setup and helper functions increases.

Another use case is memory corruption. It might not be surface as an

Nit: "might not be surfaced" or "might not surface"

Fixed: might not be surface as an => might not be surfaced as a

...

...
failure/crash in the test case or suite that caused it. I've noticed in kunit's own unit tests, the 3rd suite after might be the one to finally crash after an out-of-bounds write, for example.

Example usage:

Per suite: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=suite ... Starting KUnit Kernel (1/7)... ============================================================ ======== [PASSED] kunit_executor_test ======== .... Testing complete. 5 tests run. 0 failed. 0 crashed. 0 skipped. Starting KUnit Kernel (2/7)... ============================================================ ======== [PASSED] kunit-try-catch-test ======== ...

Per test: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=test Starting KUnit Kernel (1/23)... ============================================================ ======== [PASSED] kunit_executor_test ======== [PASSED] parse_filter_test ============================================================ Testing complete. 1 tests run. 0 failed. 0 crashed. 0 skipped. Starting KUnit Kernel (2/23)... ============================================================ ======== [PASSED] kunit_executor_test ======== [PASSED] filter_subsuite_test ...

It works with filters as well: $ ./tools/testing/kunit/kunit.py run --kunitconfig=lib/kunit --run_isolated=suite example ... Starting KUnit Kernel (1/1)... ============================================================ ======== [PASSED] example ======== ...

It also handles test filters, '*.*skip*' runs these 3 tests: kunit_status.kunit_status_mark_skipped_test example.example_skip_test example.example_mark_skipped_test

Signed-off-by: Daniel Latypov dlatypov@google.com

Really glad to finally have this feature.

A few minor comments below (mostly mypy issues, though one of them seems to be a real bug). Otherwise, the main concern I have is that there's no final total of the tests run/passed/failed. That may be easier to implement post the parser rework patch, though, so I don't think we should hold this up for it: https://lore.kernel.org/linux-kselftest/20210901190623.315736-1-rmoar@google...

Also, if we changed the TAP header stuff in patch 1, there'd need to be a couple of minor changes here to support it.

Yeah, I've applied the changes locally.

...

Minor issues below aside, this is Reviewed-by: David Gow davidgow@google.com

Cheers, -- David

...
tools/testing/kunit/kunit.py | 91 ++++++++++++++++++++------ tools/testing/kunit/kunit_tool_test.py | 40 +++++++++++ 2 files changed, 112 insertions(+), 19 deletions(-)

diff --git a/tools/testing/kunit/kunit.py b/tools/testing/kunit/kunit.py index 31eec9f6ecc3..e7b92caba53d 100755 --- a/tools/testing/kunit/kunit.py +++ b/tools/testing/kunit/kunit.py @@ -16,7 +16,7 @@ assert sys.version_info >= (3, 7), "Python version is too old"

from collections import namedtuple from enum import Enum, auto -from typing import Iterable +from typing import Iterable, List

import kunit_config import kunit_json @@ -31,13 +31,13 @@ KunitBuildRequest = namedtuple('KunitBuildRequest', ['jobs', 'build_dir', 'alltests', 'make_options']) KunitExecRequest = namedtuple('KunitExecRequest',
                         ['timeout', 'build_dir', 'alltests',
                          'filter_glob', 'kernel_args'])
                        ['timeout', 'build_dir', 'alltests',
                         'filter_glob', 'kernel_args', 'run_isolated'])
KunitParseRequest = namedtuple('KunitParseRequest', ['raw_output', 'build_dir', 'json']) KunitRequest = namedtuple('KunitRequest', ['raw_output','timeout', 'jobs', 'build_dir', 'alltests', 'filter_glob',
                                     'kernel_args', 'json', 'make_options'])
                                     'kernel_args', 'run_isolated', 'json', 'make_options'])
KernelDirectoryPath = sys.argv[0].split('tools/testing/kunit/')[0]

@@ -91,21 +91,66 @@ def build_tests(linux: kunit_kernel.LinuxSourceTree, 'built kernel successfully', build_end - build_start)

-def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest,
         parse_request: KunitParseRequest) -> KunitResult:
  kunit_parser.print_with_timestamp('Starting KUnit Kernel ...')
  test_start = time.time()
  run_result = linux.run_kernel(
          args=request.kernel_args,
          timeout=None if request.alltests else request.timeout,
          filter_glob=request.filter_glob,
          build_dir=request.build_dir)
+def _list_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest) -> List[str]:
  args = ['kunit.action=list']
  if request.kernel_args:
          args.extend(request.kernel_args)
  output = linux.run_kernel(args=args,
                     timeout=None if request.alltests else request.timeout,
                     filter_glob=request.filter_glob,
                     build_dir=request.build_dir)
  output = kunit_parser.extract_tap_lines(output)
mypy gives an error here: tools/testing/kunit/kunit.py:103: error: Incompatible types in assignment (expression has type "LineStream", variable has type "Iterator[str]") tools/testing/kunit/kunit.py:103: note: 'LineStream' is missing following 'Iterator' protocol member: tools/testing/kunit/kunit.py:103: note: __next__

Not an issue in the sense that it doesn't cause any runtime issues. I've fixed it by introducing a new variable lines = kunit_parser.extract_tap_lines(output)

...

...
  # Hack! Drop the TAP version header and top-level test plan.
  output.pop()
  output.pop()
Similarly, mypy is complaining that: tools/testing/kunit/kunit.py:105: error: "Iterator[str]" has no attribute "pop" tools/testing/kunit/kunit.py:106: error: "Iterator[str]" has no attribute "pop"

Also, we could get rid of one of these output.pop() lines if we removed the test plan in Patch 1.

Yeah, dropped one of the pops since the test plan is gone. That should be all we need.

The mypy error goes away when using a new variable since it'll realize we're dealing with a LineStream that does have a pop() func.

...

```
  return list(output)
```

+def _suites_from_test_list(tests: List[str]) -> List[str]:

  """Extracts all the suites from an ordered list of tests."""

```
  suites = []  # type: List[str]
```
```
  for t in tests:
```

          parts = t.split('.', maxsplit=2)

```
          if len(parts) != 2:
```

                  raise ValueError(f'internal KUnit error, test name should be of the form "<suite>.<test>", got "{t}"')

```
          suite, case = parts
```

          if not suites or suites[-1] != suite:

```
                  suites.append(suite)
```
```
  return suites
```

```
  test_end = time.time()
```
```
  exec_time = test_end - test_start
```

  # Named tuples are immutable, so we rebuild them here manually

  result = parse_tests(parse_request, run_result)

+def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest,

         parse_request: KunitParseRequest) -> KunitResult:

```
  filter_globs = [request.filter_glob]
```
```
  if request.run_isolated:
```

          tests = _list_tests(linux, request)

          if request.run_isolated == 'test':

```
                  filter_globs = tests
```

          if request.run_isolated == 'suite':

                  filter_globs = _suites_from_test_list(tests)

                  # Apply the test-part of the user's glob, if present.

                  if '.' in request.filter_glob:

                          test_glob = request.filter_glob.split('.', maxsplit=2)[1]

                          filter_globs = [g + '.'+ test_glob for g in filter_globs]

  overall_status = kunit_parser.TestStatus.SUCCESS

```
  exec_time = 0
```

This doesn't seem to work. My test run, for example, gave: Elapsed time: 71.706s total, 0.001s configuring, 3.241s building, 0.001s running

The 0.001s running time seems way too small here.

Digging into this, it might be related to this mypy warning: tools/testing/kunit/kunit.py:150: error: Incompatible types in assignment (expression has type "float", variable has type "int")

Should it be exec_time = 0.0 or similar?

It should be 0.0 to be more clear about the type and make mypy happy.

But that doesn't "fix" the low runtime. That's actually a pre-existing bug in kunit.py.

The problem is that `run_kernel()` doesn't actually block until the kernel exits. run_kernel() just gives back an iterator.

If I do run_result = list(linux.run_kernel(...)), that forces it to block until the end. That gives a more reasonable time breakdown: Elapsed time: 14.969s total, 0.001s configuring, 3.012s building, 8.927s running

But doing so makes it impossible for us to parse results as they come in. So I can instead do

diff --git a/tools/testing/kunit/kunit.py b/tools/testing/kunit/kunit.py index 3a3195d3c87d..59ef30b39602 100755 --- a/tools/testing/kunit/kunit.py +++ b/tools/testing/kunit/kunit.py @@ -145,10 +145,10 @@ def exec_tests(linux: kunit_kernel.LinuxSourceTree, request: KunitExecRequest, filter_glob=filter_glob, build_dir=request.build_dir)

+ result = parse_tests(parse_request, run_result) test_end = time.time() exec_time += test_end - test_start

- result = parse_tests(parse_request, run_result) overall_status = kunit_parser.max_status(overall_status, result.status)

return KunitResult(status=result.status, result=result.result, elapsed_time=exec_time)

and that'll produce a more realistic number as well. But it'll factor in the parsing overhead into execution time.

...

  for i, filter_glob in enumerate(filter_globs):

          kunit_parser.print_with_timestamp('Starting KUnit Kernel ({}/{})...'.format(i+1, len(filter_globs)))

```
          test_start = time.time()
```

          run_result = linux.run_kernel(

                  args=request.kernel_args,

                  timeout=None if request.alltests else request.timeout,

                  filter_glob=filter_glob,

                  build_dir=request.build_dir)

```
          test_end = time.time()
```

          exec_time += test_end - test_start

          result = parse_tests(parse_request, run_result)

          overall_status = kunit_parser.max_status(overall_status, result.status)

  return KunitResult(status=result.status, result=result.result, elapsed_time=exec_time)

@@ -166,7 +211,7 @@ def run_tests(linux: kunit_kernel.LinuxSourceTree,

    exec_request = KunitExecRequest(request.timeout, request.build_dir,
                             request.alltests, request.filter_glob,

                           request.kernel_args)

                           request.kernel_args, request.run_isolated)
  parse_request = KunitParseRequest(request.raw_output,
                                    request.build_dir,
                                    request.json)

@@ -250,6 +295,12 @@ def add_exec_opts(parser) -> None: parser.add_argument('--kernel_args', help='Kernel command-line parameters. Maybe be repeated', action='append')

  parser.add_argument('--run_isolated', help='If set, boot the kernel for each '

                      'individual suite/test. This is can be useful for debugging '

                      'a non-hermetic test, one that might pass/fail based on '

                      'what ran before it.',

```
                      type=str,
```

                      choices=['suite', 'test']),

                                 cli_args.run_isolated,
                                 cli_args.json,
                                 cli_args.make_options)
          result = run_tests(linux, request)

@@ -378,7 +430,8 @@ def main(argv, linux=None): cli_args.build_dir, cli_args.alltests, cli_args.filter_glob,

                                          cli_args.kernel_args)

                                          cli_args.kernel_args,

                                          cli_args.run_isolated)
          parse_request = KunitParseRequest(cli_args.raw_output,
                                            cli_args.build_dir,
                                            cli_args.json)

```
  def test_list_tests(self):
```

          want = ['suite.test1', 'suite.test2', 'suite2.test1']

          self.linux_source_mock.run_kernel.return_value = ['TAP version 14', '1..0'] + want

          got = kunit._list_tests(self.linux_source_mock,

                               kunit.KunitExecRequest(300, '.kunit', False, 'suite*', None, 'suite'))

```
          self.assertEqual(got, want)
```

          # Should respect the user's filter glob when listing tests.

          self.linux_source_mock.run_kernel.assert_called_once_with(

                  args=['kunit.action=list'], build_dir='.kunit', filter_glob='suite*', timeout=300)

  @mock.patch.object(kunit, '_list_tests')

  def test_run_isolated_by_suite(self, mock_tests):

          mock_tests.return_value = ['suite.test1', 'suite.test2', 'suite2.test1']

          kunit.main(['exec', '--run_isolated=suite', 'suite*.test*'], self.linux_source_mock)

          # Should respect the user's filter glob when listing tests.

          mock_tests.assert_called_once_with(mock.ANY,

                               kunit.KunitExecRequest(300, '.kunit', False, 'suite*.test*', None, 'suite'))

          self.linux_source_mock.run_kernel.assert_has_calls([

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite.test*', timeout=300),

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite2.test*', timeout=300),

```
          ])
```

  @mock.patch.object(kunit, '_list_tests')

  def test_run_isolated_by_test(self, mock_tests):

          mock_tests.return_value = ['suite.test1', 'suite.test2', 'suite2.test1']

          kunit.main(['exec', '--run_isolated=test', 'suite*'], self.linux_source_mock)

          # Should respect the user's filter glob when listing tests.

          mock_tests.assert_called_once_with(mock.ANY,

                               kunit.KunitExecRequest(300, '.kunit', False, 'suite*', None, 'test'))

          self.linux_source_mock.run_kernel.assert_has_calls([

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite.test1', timeout=300),

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite.test2', timeout=300),

                  mock.call(args=None, build_dir='.kunit', filter_glob='suite2.test1', timeout=300),

```
          ])
```

if __name__ == '__main__': unittest.main() -- 2.33.0.685.g46640cef36-goog

1494

days inactive

1495

days old

linux-kselftest-mirror@lists.linaro.org

8 comments

participants

tags (0)

participants (2)

Daniel Latypov
David Gow