On Wed, Feb 21, 2024 at 10:22 PM David Gow davidgow@google.com wrote:
On Thu, 22 Feb 2024 at 04:10, 'Justin Stitt' via KUnit Development kunit-dev@googlegroups.com wrote:
Hi,
On Wed, Feb 21, 2024 at 05:27:15PM +0800, David Gow wrote:
The correct format specifier for p - n (both p and n are pointers) is %td, as the type should be ptrdiff_t.
I think %tu is better. d specifies a signed type. I don't doubt that the warning is fixed but I think %tu represents the type semantics here.
While I agree that this should never be negative, I'd still lean on this being a signed type, for two reasons:
- I think, if there's a bug in this code, it's easier to debug this if
a 'negative' value were to appear as such.
- While, as I understand it, the C spec does provide for a
ptrdiff_t-sized unsigned printf specifier in '%tu', the difference between two pointers is always signed:
"When two pointers are subtracted, both shall point to elements of the same array object, or one past the last element of the array object; the result is the difference of the subscripts of the two array elements. The size of the result is implementation-defined, and its type (a signed integer type) is ptrdiff_t defined in the <stddef.h> header"
(Technically, the kernel's ptrdiff_t type isn't defined in stddef.h, so a bit of deviation from the spec is happening anyway, though.)
If there's a particularly good reason to make this unsigned in this case, I'd be happy to change it, of course. But I'd otherwise prefer to keep it as-is.
Copying the line for context, it's about `p-r` where p = memchr_inv(&r[1], 0, sizeof(r) - sizeof(r[0])); `p-r` should never be negative unless something has gone horribly horribly wrong.
So in this particular case, either %tu or %td would be fine.
(sorta bikeshedding warning) But, I'd personally lean towards using the signed %td in tests to guard against typos in test code as _a guiding principle._
This is especially true given that the failure messages aren't verified since they are mostly "dead code." You can have crazy incorrect things going on in the format arguments, see patch 1/9 in this series [1]. One of kunit's own tests would do a read from a ~random memory region if that specific assertion failed. Not a good look ;) We never noticed until this series enabled the format string checks. You also can't expect reviewers to go through and modify every assertion to fail to see what the failure mode looks like, so these kinds of errors will continue to slip through.
*So IMO, we should generally adopt a more defensive stance when it comes to these.*
Also consider the user experience if there is a failure and I accidentally wrote `r-p` here. Someone else sees an error report from this test and needs to investigate.
What message is easier to deal with?
in test 18 at -5 out of bound
or
in test 18 at 18446744073709551611 out of bound
Sure, I can eventually figure out what both messages mean, but it's a immediately obvious from the first that there's a a) real error: something is wrong at index 5 b) test code error: there's a flipped sign somewhere
So I'd strongly prefer the current version of the patch over one with %tu. Reviewed-by: Daniel Latypov dlatypov@google.com
[1] https://lore.kernel.org/linux-kselftest/20240221092728.1281499-2-davidgow@go...
Thanks, Daniel