This and the two previous patches help make things easier. But I still cannot get expected results.
For 1 iteration two threads cases such as [1], it works as expected. For more than 1 iteration cases,
such as the 2 iterations case [2], it's not what I expected. Logs are in [3].