The screen freezes randomly

Bug #1913503 reported by Dongwon Cho
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
nvidia-graphics-drivers-460 (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

The screen freezes randomly and I found the following kernel messages by ssh into it.

Jan 28 10:43:03 pc3 kernel: [142570.789087] NVRM: GPU at PCI:0000:01:00: GPU-1fb1859c-c844-063e-e293-10ec0f968a53
Jan 28 10:43:03 pc3 kernel: [142570.789091] NVRM: GPU Board Serial Number:
Jan 28 10:43:03 pc3 kernel: [142570.789094] NVRM: Xid (PCI:0000:01:00): 62, pid=2938, 0000(0000) 00000000 00000000
Jan 28 10:43:12 pc3 kernel: [142579.931126] NVRM: Xid (PCI:0000:01:00): 16, pid=0, Head 00000000 Count 00a0e329
Jan 28 10:43:37 pc3 kernel: [142604.506543] NVRM: Xid (PCI:0000:01:00): 16, pid=7777, Head 00000000 Count 00a0e32a
Jan 28 10:43:45 pc3 kernel: [142612.698367] NVRM: Xid (PCI:0000:01:00): 16, pid=7777, Head 00000000 Count 00a0e32b
Jan 28 10:43:53 pc3 kernel: [142620.890164] NVRM: Xid (PCI:0000:01:00): 16, pid=7777, Head 00000000 Count 00a0e32c
Jan 28 10:44:01 pc3 kernel: [142629.081968] NVRM: Xid (PCI:0000:01:00): 16, pid=7777, Head 00000000 Count 00a0e32d
Jan 28 10:44:09 pc3 kernel: [142637.273780] NVRM: Xid (PCI:0000:01:00): 16, pid=7777, Head 00000000 Count 00a0e32e
Jan 28 10:44:18 pc3 kernel: [142645.465600] NVRM: Xid (PCI:0000:01:00): 16, pid=7777, Head 00000000 Count 00a0e32f
Jan 28 10:44:26 pc3 kernel: [142653.657401] NVRM: Xid (PCI:0000:01:00): 16, pid=7777, Head 00000000 Count 00a0e330

lsb_release -rd
Description: Ubuntu 20.04.1 LTS
Release: 20.04

uname -a
Linux pc3 5.8.0-40-generic #45~20.04.1-Ubuntu SMP Fri Jan 15 11:35:04 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux

modinfo nvidia
filename: /lib/modules/5.8.0-40-generic/updates/dkms/nvidia.ko
alias: char-major-195-*
version: 460.32.03
supported: external
license: NVIDIA
srcversion: 1744B50B284E53E625E0B19
alias: pci:v000010DEd*sv*sd*bc03sc02i00*
alias: pci:v000010DEd*sv*sd*bc03sc00i00*
depends:
retpoline: Y
name: nvidia
vermagic: 5.8.0-40-generic SMP mod_unload
sig_id: PKCS#7
signer: pc3 Secure Boot Module Signature key
sig_key: 47:E3:CF:85:CF:19:77:13:E1:A4:A2:24:4E:69:EA:E3:B6:B6:6B:E8
sig_hashalgo: sha512
signature: 01:3D:1A:A1:1D:CF:D1:61:4E:9D:DA:96:E6:F5:9D:96:BF:A5:BE:B1:
  DF:AD:44:09:28:2D:6E:AD:4A:91:75:90:97:8C:06:7E:B9:E0:10:07:
  2A:84:D5:3B:2A:4D:9B:77:80:A6:9C:BC:40:43:A6:3A:9F:3B:44:13:
  24:31:8E:BD:EC:BB:0D:E8:90:9B:B2:14:E5:7D:DA:D1:75:AF:BF:5C:
  0B:C3:55:7C:D1:8E:BB:7D:83:3C:3B:AC:EB:AE:87:4F:0C:9F:C5:F9:
  AD:E7:53:9B:2B:16:C4:51:45:A5:3F:DE:DD:1B:B8:51:0B:F8:EA:A1:
  A6:CA:85:BC:3F:72:F0:03:8A:1A:85:DD:44:91:B5:09:23:35:9E:14:
  0B:C7:96:73:FB:81:84:06:77:56:6D:40:26:5F:4C:92:4F:A6:6F:B5:
  8D:41:D0:5C:F7:B8:80:EA:AD:7B:05:F0:96:DD:A6:07:4A:1C:2A:56:
  D8:76:9D:F7:97:1E:4F:48:BD:8E:08:24:79:D9:96:5B:B1:7B:07:71:
  5E:18:CF:7F:0A:DD:CD:44:1B:43:B5:46:33:D6:FF:09:CB:05:AF:CC:
  C1:77:35:59:13:0C:6F:53:AA:35:FF:B4:BE:CF:7E:DC:FD:26:81:51:
  6F:EC:15:90:FD:0C:49:17:7C:0F:4D:5E:FB:85:66:87
parm: NvSwitchRegDwords:NvSwitch regkey (charp)
parm: NvSwitchBlacklist:NvSwitchBlacklist=uuid[,uuid...] (charp)
parm: nv_cap_enable_devfs:Enable (1) or disable (0) nv-caps devfs support. Default: 1 (int)
parm: NVreg_ResmanDebugLevel:int
parm: NVreg_RmLogonRC:int
parm: NVreg_ModifyDeviceFiles:int
parm: NVreg_DeviceFileUID:int
parm: NVreg_DeviceFileGID:int
parm: NVreg_DeviceFileMode:int
parm: NVreg_InitializeSystemMemoryAllocations:int
parm: NVreg_UsePageAttributeTable:int
parm: NVreg_RegisterForACPIEvents:int
parm: NVreg_EnablePCIeGen3:int
parm: NVreg_EnableMSI:int
parm: NVreg_TCEBypassMode:int
parm: NVreg_EnableStreamMemOPs:int
parm: NVreg_EnableBacklightHandler:int
parm: NVreg_RestrictProfilingToAdminUsers:int
parm: NVreg_PreserveVideoMemoryAllocations:int
parm: NVreg_EnableS0ixPowerManagement:int
parm: NVreg_S0ixPowerManagementVideoMemoryThreshold:int
parm: NVreg_DynamicPowerManagement:int
parm: NVreg_DynamicPowerManagementVideoMemoryThreshold:int
parm: NVreg_EnableUserNUMAManagement:int
parm: NVreg_MemoryPoolSize:int
parm: NVreg_KMallocHeapMaxSize:int
parm: NVreg_VMallocHeapMaxSize:int
parm: NVreg_IgnoreMMIOCheck:int
parm: NVreg_NvLinkDisable:int
parm: NVreg_EnablePCIERelaxedOrderingMode:int
parm: NVreg_RegisterPCIDriver:int
parm: NVreg_RegistryDwords:charp
parm: NVreg_RegistryDwordsPerDevice:charp
parm: NVreg_RmMsg:charp
parm: NVreg_GpuBlacklist:charp
parm: NVreg_TemporaryFilePath:charp

nvidia-debugdump -l
Found 1 NVIDIA devices
 Device ID: 0
 Device name: GeForce RTX 3070 (*PrimaryCard)
 GPU internal ID: GPU-1fb1859c-c844-063e-e293-10ec0f968a53

Dongwon Cho (dongwoncho)
description: updated
summary: - the screen freezes randomly
+ The screen freezes randomly
Dongwon Cho (dongwoncho)
description: updated
Revision history for this message
Dongwon Cho (dongwoncho) wrote :

When it hangs, the Xorg process uses 100% CPU and the session service shows as follows.

● session-2.scope - Session 2 of user don
     Loaded: loaded (/run/systemd/transient/session-2.scope; transient)
  Transient: yes
     Active: active (running) since Sun 2021-01-31 13:39:53 KST; 1 day 22h ago
      Tasks: 11
     Memory: 117.6M
     CGroup: /user.slice/user-1000.slice/session-2.scope
             ├─5214 gdm-session-worker [pam/gdm-password]
             ├─5617 /usr/lib/gdm3/gdm-x-session --run-script env GNOME_SHELL_SESSION_MODE=ubuntu /usr/bin/gnome-session --systemd --session=ubuntu
             ├─5619 /usr/lib/xorg/Xorg vt2 -displayfd 3 -auth /run/user/1000/gdm/Xauthority -background none -noreset -keeptty -verbose 3
             ├─5908 /usr/bin/fcitx -d
             ├─5915 /usr/bin/dbus-daemon --syslog --fork --print-pid 5 --print-address 7 --config-file /usr/share/fcitx/dbus/daemon.conf
             └─5919 /usr/bin/fcitx-dbus-watcher unix:abstract=/tmp/dbus-9THsR8ilV1,guid=29ce2bb7a35a5607359c93056016349e 5915

Feb 02 12:25:10 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (II) event2 - Logitech M570: SYN_DROPPED event - some input events have been lost.
Feb 02 12:25:15 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (WW) NVIDIA: Wait for channel idle timed out.
Feb 02 12:25:18 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x0000f4b4, 0x0000f438)
Feb 02 12:25:25 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x0000f4b4, 0x0000f438)
Feb 02 12:25:28 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x0000f4b4, 0x0000f440)
Feb 02 12:25:35 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x0000f4b4, 0x0000f440)
Feb 02 12:25:38 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x0000f4b4, 0x0000f448)
Feb 02 12:25:45 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x0000f4b4, 0x0000f448)
Feb 02 12:25:48 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (2, 8, 0x8000, 0x0000f4b4, 0x0000f450)
Feb 02 12:25:55 pc3 /usr/lib/gdm3/gdm-x-session[5619]: (EE) NVIDIA(GPU-0): WAIT (1, 8, 0x8000, 0x0000f4b4, 0x0000f450)

Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in nvidia-graphics-drivers-460 (Ubuntu):
status: New → Confirmed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.