ubuntu_ltp_* tests unable to finish properly with B-azure-fips
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
ubuntu-kernel-tests |
Fix Released
|
Undecided
|
Po-Hsu Lin |
Bug Description
In sru-s20240429 and sru-s20240610, the ubuntu_ltp_* tests were found unable to finish properly with B-azure-fips kernel, and eventually trigger the `sut-test` failure on them.
Here is the result from sru-s20240610
* ubuntu_ltp
- report cuts-off at fs:fs_fill test, failed on Standard_D4_v4 only.
* ubuntu_
- report cuts-off at memcg_test_3 test, failed on Standard_B1ms
- report cuts-off at memcg_stress test, failed on Standard_D4_v4, Standard_
* ubuntu_ltp_cve
- report cuts-off at cve-2016-8655 test, failed on Standard_B1ms, Standard_D4_v4
- report cuts-off at cve-2018-18559 test, failed on Standard_
* ubunut_ltp_syscall
- report cuts-off at setsockopt06 test, failed on Standard_B1ms
- report cuts-off at bind06 test, failed on Standard_D4_v4, Standard_
The result from sru-s20240610 is quite similar, just the ubuntu_ltp_cve this time cuts-off at cve-2016-8655 test on Standard_
Note that the cve-2016-8655 is actually the setsockopt06 test, and cve-2018-18559 is the bind06 test.
I have done some experiments on Standard_
* ubunut_
- If we skip memcg_stress test, it will be able to finish properly.
* ubuntu_ltp_cve:
- If we skip cve-2016-8655 and cve-2018-18559 tests, it will be able to finish properly.
* ubuntu_
- If we skip bind06 and writev03 tests, it will be able to finish properly (setsockopt06 works fine in this case, not sure why).
Here is the code to skip a certain test:
diff --git a/ubuntu_
index 4f93c546..684a8ed2 100644
--- a/ubuntu_
+++ b/ubuntu_
@@ -24,6 +24,9 @@ if result == 'GOOD':
# Special case for msgstress04 (lp:1943802 / lp:1943652)
if testcase == 'msgstress04':
+ if testcase in ['bind06', 'writev03'] and platform.release() == '4.15.0-
+ print('skipping bind06 for testing purpose')
+ continue
else:
print("ERROR: test failed to build, skipping all the sub tests")
With my manual test on Standard_D4_v4 with 4.15.0-
And setsockopt06, bind06 test can pass without any immediate crash. Not sure what is the cause of this failure that we see here.
It's also worthy to note that "running something" seems to limited to commands that will keep generating output. Commands like "dmesg -w" and "tail -f /var/log/syslog" will hang too if there is no output to update.
According to Magali, the last bionic fips openssh update is from January, so this might be something else in the kernel.
== Original bug report ==
On azure-fips platforms multiple tests in ubuntu_ltp, ubuntu_
tags: | added: azure bionic fips sru-s20240429 sru-s20240610 ubuntu-ltp ubuntu-ltp-controllers ubuntu-ltp-cve ubuntu-ltp-syscalls |
summary: |
- ubuntu_ltp_* tests completing but causes system to hang + ubuntu_ltp_* tests unable to finish properly with B-azure-fips |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
description: | updated |
Changed in ubuntu-kernel-tests: | |
assignee: | nobody → Po-Hsu Lin (cypressyew) |
status: | New → In Progress |
Even this simple command from host to the target system will fail:
$ time ssh $USER@$SUT_IP "date; sleep 300 ; date"
Thu Aug 8 15:49:46 UTC 2024
^C
real 7m50.194s
user 0m0.005s
sys 0m0.018s
260 seconds is not working as well.
$ time sutssh 52.175.206.46 "date; sleep 260 ; date"
Thu Aug 8 16:06:48 UTC 2024
^C
real 10m8.938s
user 0m0.017s
sys 0m0.000s
250 seconds works.
$ time ssh $USER@$SUT_IP "date; sleep 250 ; date"
Thu Aug 8 16:10:55 UTC 2024
Thu Aug 8 16:15:05 UTC 2024
real 4m11.454s
user 0m0.020s
sys 0m0.002s
Now I wonder if these cases are really failing? Or they took too long to make some noise?