Progress (combines two weeks):
* UM-2 [QEMU upstream maintainership]
- investigated various failures of avocado tests on s390x host.
These are at least four different unrelated endianness bugs in
different device, board or CPU models. Sent out a set of patches
fixing at most of these.
- TCWG / Arm syncup meeting
- Linaro Connect
- caught up on code review and sent a target-arm pull request
- sent patch fixing a bug where our PNG screenshots have the pixel
colour channels in the wrong order
- while I had a conversation from Connect fresh in my mind, went
back and re-looked at the page-table-walk bug with handling the
NSW/SW bits that I sent patches for a couple of weeks ago. Re-did
the code to consistently handle these bits in the right place and
sent out another version of the patchset
- reviewed a very large patchset from RTH that changes how
TCG handles atomic accesses so we can implement FEAT_LSE2
- investigated a bug report about a regression with AArch32 EL1
under an AArch64 EL2; wrote patch
-- PMM
Hello,
# TCWG Infrastructure
- Reviewed a bunch of Gerrit requests for our CI scripts.
- Learning Jenkins job builder syntax and working on a job definition to
run fast_check_{gcc,gdb} jobs on Gerrit review requests for
jenkins-scripts and ABE.
- Changed CI scripts to save ABE and component testsuite logs when the ABE
build or check step fails. This should help debug a case where a GDB CI
job failed due to (probably) a corruption of the gdb.sum testsuite
result file.
# Misc
- Some preparations for the trip to Linaro Connect.
--
Thiago
Progress:
* UM-2 [QEMU upstream maintainership]
- sent out v3 of the 'deprecate singlestep option' patchset
- sorted out the draft QEMU Summit invite list and invitation text
- spent some time trying to understand how the imx SoCs arrange
their ethernet controllers and PHYs so I can review a patchset
- sent patch fixing minor configure nit where it wasn't honouring
--cross-prefix for the ObjC compiler
- got the 8.0 release out of the door and handed over merge handling
to rth
- sent some patchsets updating a few old .txt files to reStructuredText
- still thinking about our page-table-walk code -- I'm not convinced
it's doing the right thing with NSW/SW bits, even after last
week's patchset...
-- PMM
Hello,
# TCWG Infrastructure
- Fixed problem in ABE with MinGW build of GDB's master branch.
- Fixed problem in upstream GDB with MinGW build of GDB's master branch.
- Analysed backlog of failed check_gdb jobs.
- A couple of "misbisected" commits, including one pointing to a glibc commit.
- One detected regression also noticed by Christophe, and fixed by the community.
- One Arm failure that passes on QEMU TCG.
- Reviewed a number of Gerrit requests about improving ABE and CI scripts.
--
Thiago
Progress:
* UM-2 [QEMU upstream maintainership]
* trying to get a release out of the door -- due among other things
to the Easter holiday weekend, rc4 is a bit late and has rather
more changes than I would prefer
* more triaging of Coverity issues
* sent a few trivial bug fix patches
* started reviewing and queueing arm patches for 8.1
* investigated a FEAT_SEL2 bug that showed up with some proposed
changes to TF-A/Hafnium. The architecture and QEMU's code
are pretty hairy in this area but I think I have identified
the correct fix...
-- PMM
Hello,
[GNU-796] Stabilize GDB testsuite results in the CI
- Submitted to GDB mailing list and committed patches fixing the
gdb.asm/asm-source.exp regression found by the check_gdb job. Turned out
to be a small bug in the stabs debug format support.
- Committed Gerrit review request disabling Python on MinGW cross builds.
[GNU-767] Support changing SVE vector length in remote debugging
- Investigating a follow-up question Simon asked during review of the
patch series fixing a corner case when reading auxv from multi-threaded
inferiors.
--
Thiago
Progress:
* UM-2 [QEMU upstream maintainership]
* rc3 this week
* reworked my "deprecate the misleading -singlestep option"
patchset to deal with review feedback, sent out v2
* KVM Forum programme committee work
* investigated a regression where the mps3-an547 board
asserts on startup; for 8.0 we will just revert the change
that introduced the failure, since it was only an optimization
* some triage of Coverity issue reports; sent some patches for
some easy ones, queried original code submitter for others
-- PMM
Hi Alexandros,
Linaro benchmarking CI flagged this patch. After it clang seems to miscompile 602.gcc_s from SPEC CPU2017 for "-O3 -flto" on aarch64-linux-gnu. Also, it appears that 600.perlbench_s slows down by 9%.
Could you investigate, please? Let me know if you need any assistance in reproducing the problem.
Our Benchmarking CI is still in active development, and there are false positives, but this report seems to be legit [1].
Kind regards,
[1] https://ci.linaro.org/job/tcwg_bmk-code_speed-cpu2017speed--llvm-aarch64-ma…
--
Maxim Kuvyrkov
https://www.linaro.org
Hello,
[GNU-796] Stabilize GDB testsuite results in the CI
- Committed ABE change suggested by Maxim to mention failures that cause
additional testsuite runs.
- Tried to update Python MinGW build to 3.x, but it's a rabbit hole.
Proposing to disable Python support when cross-building GDB with MinGW
instead.
- First regression detected by the tcwg_gnu_native_check_gdb CI job
reported upstream¹. Started investigating it.
[GNU-767] Support changing SVE vector length in remote debugging
- Created a testcase to exercise the corner case mentioned by Pedro.
Found out that GDB is also susceptible to it. Fixed both GDB and
gdbserver. Posted patch series upstream².
--
Thiago
¹ https://inbox.sourceware.org/gdb-patches/871ql6dhf4.fsf@linaro.org/
² https://inbox.sourceware.org/gdb-patches/20230331034432.3037148-1-thiago.ba…
Progress (short week, holiday and recovering from covid...):
* UM-2 [QEMU upstream maintainership]
* 8.0 release related work rumbles on
* debugged and fixed a regression caused by my work on HSTR_EL2 traps
* QEMU-530 [QEMU ARM v9.4 Baseline CPU for TCG]
* implemented FEAT_PAN3 (and fixed a minor bug in
syndrome reporting that I noticed in the process)
-- PMM
Hello,
[GNU-796] Stabilize GDB testsuite results in the CI:
- Enabled gdb.gdb/unittest.exp in fast_check_gdb job. It should be
stable now that tcwg-jade-02's kernel has been upgraded.
- Enabled bisections for check_gdb jobs.
- Implemented Maxim's idea to handle GDB testsuite's tests that can FAIL
but are silent when they pass (i.e., the "gdb_test -nopass"
statements).
[GNU-767] Support changing SVE vector length in remote debugging
- Back in February I upstreamed a couple of preparation patches and
Pedro noticed an unintended change in behaviour that can affect a
corner case, so I am working on a fix for that now.
--
Thiago
Hello,
[GNU-796] Stabilize GDB testsuite results in the CI:
- Finished implementing ABE's support for rerunning failed tests using
Maxim's idea of leveraging validate_failures.py to determine which
tests need to be rerun. Submitted v3 and v4 versions, and committed
v4.
- Investigated why tcwg-abet-tested jobs have been failing in the CI.
Found out that Jenkins sets a bogus core.hooksPath in the git repo
config. Submitted and merged a gerrit request fixing the problem.
- Investigated why cross-build gdbserver needs GMP and MPFR (and is thus
failing to build) in tcwg-gnu-build jobs. Submitted Gerrit review
request to fix it.
- Started looking into why re-enabling GDB testsuite parallelism makes
it run fewer tests.
--
Thiago
Progress:
* UM-2 [QEMU upstream maintainership]
- went through and made estimates for a pile of JIRA tasks
relating to implementing features we're missing to get to ARMv9.4
- diagnosed why --enable-werror wasn't affecting warnings from
the kerneldoc docs generator, and sent a patch fixing it
- some bug triage, looking for issues that ought to be fixed for 8.0
- fixed a division-by-zero bug in the cadence UART model
- fixed a documentation markup mistake that made the docs look odd
- investigating/fixing some more CI failures
- minor bits of travel/conference related admin
-- PMM
Hello,
[GNU-796] Stabilize GDB testsuite results in the CI:
- Submitted and merged a couple of review requests removing obsolete CI
jobs related to release automation and release regression detection
- Finished version of scripts that have most of the logic for rerunning
testsuites in ABE. Submitted as a review request. Addressed review
comments and submitted v2.
- Maxim found out a way to simplify the logic of determining whether a
new testsuite run is warranted by using validate_failures.py, which
I'm now implementing.
--
Thiago
Progress:
* UM-2 [QEMU upstream maintainership]
- Softfreeze was this Tuesday; lots of wrangling of pull requests
- Code review; last arm pullreq before softfreeze
- various admin type bits and pieces
-- PMM
Hello,
[GNU-796] Stabilize GDB testsuite results in the CI:
- Moved most of logic to rerun failed tests from tcwg_gnu-build.sh to
Abe and a new script based on the sum file parser of
compare_dg_tests.pl. The new script creates a "merged" sum file from
all the runs. Currently changing Abe to make use of it.
- Did a few cleanups in compare_dg_tests.pl as I was going through its
code. Also added KFAIL status support to it, which the GDB testsuite
needs.
- Removed a few obsolete CI jobs which came out of the woodwork when I
merged the compare_dg_tests.pl cleanups.
--
Thiago
Project Orko
============
- did [launch presentation] for SOAFEE and LEDGE SC
[launch presentation]
<https://docs.google.com/presentation/d/1CGYIK2W0VSo0kih9ExwfFlkGww7rL5fjMIb…>
Enable Arm Architecture in QEMU
===============================
- bunch of planning for FEAT_GCS, see [QEMU-517] and related
- spoke to Lauterbach on behalf of QC, gave pointers on debug
[QEMU-517] <https://linaro.atlassian.net/browse/QEMU-517>
FEAT_RME, CCA Realms ([QEMU-466])
=================================
- did a bit of review of rth's precursor patches
[QEMU-466] <https://linaro.atlassian.net/browse/QEMU-466>
QEMU Upstream Work ([UM-2])
===========================
- finished up [MR for using locally built QEMU in TuxRun]
- posted [PATCH v4 00/26] gdbstub/next: re-organise and split build
Message-Id: <20230302190846.2593720-1-alex.bennee(a)linaro.org>
- posted [PULL v2 00/24] testing updates (gitlab, cirrus, docker,
avocado, windows) Message-Id:
<20230301151604.1948813-1-alex.bennee(a)linaro.org>
- these help address the CI minutes we are burning through monthly
now
[UM-2] <https://linaro.atlassian.net/browse/UM-2>
[testing/next] <https://github.com/stsquad/qemu/tree/testing/next>
[MR for using locally built QEMU in TuxRun]
<https://gitlab.com/Linaro/tuxrun/-/merge_requests/179>
Completed Reviews [6/6]
=======================
[PATCH v2 0/4] Fix deadlock when dying because of a signal
Message-Id: <20230213125238.331881-1-iii(a)linux.ibm.com>
[PATCH 0/8] hw/arm: Cleanups around QOM style
Message-Id: <20230220115114.25237-1-philmd(a)linaro.org>
[PATCH v2 00/24] hw/ide: QOM/QDev housekeeping
Message-Id: <20230220091358.17038-1-philmd(a)linaro.org>
[PATCH v2] gdbstub: move update guest debug to accel ops
Message-Id: <20230207131721.49233-1-mads(a)ynddal.dk>
[PATCH 0/5] iotests: make meson aware of individual I/O tests
Message-Id: <20230302184606.418541-1-berrange(a)redhat.com>
[PATCH v2] TCG plugin API extension to read guest memory content by an address
Message-Id: <5c50db42136d4a908b261c66b132b043(a)yadro.com>
Other
=====
Absences
========
Current Review Queue
====================
TODO [PATCH 1/6] Add the Android Emulator hypervisor driver (AEHD) accelerator.
Message-Id: <20230303022618.4098825-1-hshan(a)google.com>
======================================================================================================================================
TODO [RFC PATCH v2 00/11] Add stage-2 translation for SMMUv3
Message-Id: <20230226220650.1480786-1-smostafa(a)google.com>
======================================================================================================================
TODO [PATCH v2 00/28] tcg: Simplify temporary usage
Message-Id: <20230222232715.15034-1-richard.henderson(a)linaro.org>
====================================================================================================================
--
Alex Bennée
Virtualisation Tech Lead @ Linaro
Hello,
# [GNU-796] Stabilize GDB testsuite results in the CI
- Cleaned up change in tcwg_gnu-build.sh that reruns failed tests and
created Gerrit review request¹.
- Based on Laurent's and Maxim's suggestions, decided to put the code which
retries the failed tests in Abe, so now I'm working on that.
--
Thiago
¹ https://review.linaro.org/c/toolchain/jenkins-scripts/+/43265
Progress:
* UM-2 [QEMU upstream maintainership]
- Code review: big FEAT_LSE2 support series, 8.3 pointer auth,
gdbstub support for M-profile sysregs, another round of
FEAT_RME, and more
- discussions about how to handle the fact that QEMU has
run out of gitlab CI minutes, and some tweaking of my scripts
to work around the lack of minutes
-- PMM