memory leak on AWS kernels when using docker
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux-aws (Ubuntu) |
New
|
Undecided
|
Unassigned | ||
Focal |
New
|
Undecided
|
Unassigned |
Bug Description
Ever since the "ubuntu-
The scenario is that we have an app running in a docker container and it occasionally catches a crash happening within itself and when that happens it creates another process which triggers a gdb dump of that parent app. Normally this works fine but under these specific kernels it causes the memory usage to grow and grow until it hits the maximum allowed memory for the container at which point the container is killed.
I have tested using several of the latest available Ubuntu AMIs including the latest "ubuntu-
I also tested a bunch of the mainline kernels and found the fix was introduced for this memory leak in the v5.9-rc4 kernel (https:/
Do you all have any idea if or when that set of changes will be backported into a supported kernel for Ubuntu 18.04 or 20.04?
Release we are running:
root@<redacted>:~# lsb_release -rd
Description: Ubuntu 18.04.5 LTS
Release: 18.04
Docker / containerd.io versions:
- containerd.io: 1.4.4-1
- docker-ce: 5:20.10.
Latest supported kernel I tried which still sees the memory leak:
root@hostname:~# apt-cache policy linux-aws
linux-aws:
Installed: 5.4.0.1045.27
Candidate: 5.4.0.1045.27
Version table:
*** 5.4.0.1045.27 500
500 http://
500 http://
100 /var/lib/
4.15.0.1007.7 500
500 http://
Thanks,
Paul
Thank you Paul for the bug report.
Just as an additional information, the 5.3.0 series kernels are not supported anymore. If you want to continue using Ubuntu 18.04 with the additional security and bug fixes you will need to eventually upgrade to a 5.4.0-based AWS kernel.
Are you able to point at which commit exactly from v5.9-rc4 fixes the memory leak you are experiencing? Most of the commits mentioning "leak" in their titles has already been applied to the 5.4 kernels, so if you can still reproduce the issue with the latest 5.4 AWS kernel we are probably missing another fix.
Thank you.