Nvidia module crash: GPU has fallen off the bus.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
nvidia-graphics-drivers (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
The last two mornings, the nvidia module has crashed during the middle of the night (4 a.m.). The displays are set to turn off and lock, but the machine is not set up to go into suspend.
Here's an excerpt from the earliest error in kern.log.
kernel: [144532.511018] hda-intel: spurious response 0x1:0x0, last cmd=0x4f0700
kernel: [144532.511021] hda-intel: spurious response 0x40:0x0, last cmd=0x4f0700
kernel: [144532.511024] hda-intel: spurious response 0x0:0x0, last cmd=0x4f0700
kernel: [144532.777203] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
kernel: [144532.777214] NVRM: GPU at 0000:01:00.0 has fallen off the bus.
kernel: [144582.521101] show_signal_msg: 51 callbacks suppressed
kernel: [144582.521108] Xorg[2151]: segfault at 617461481d ip 00007fe6533bbd41 sp 00007fff94311438 error 4 in nvidia_
kernel: [144582.814351] init: lightdm main process (2129) terminated with status 1
kernel: [144597.810481] init: failsafe-x main process (28549) terminated with status 1
kernel: [144608.298745] BUG: soft lockup - CPU#1 stuck for 22s! [dconf worker:3334]
kernel: [144608.298883] Modules linked in: ip6table_filter ip6_tables ebtable_nat ebtables pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack ipt_REJECT xt_CHECKSUM iptable_mangle xt_tcpudp iptable_filter ip_tables x_tables kvm_intel kvm bnep rfcomm parport_pc ppdev binfmt_misc nfsd snd_hda_codec_hdmi snd_hda_
kernel: [144608.298925] CPU 1
kernel: [144608.298926] Modules linked in: ip6table_filter ip6_tables ebtable_nat ebtables pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack ipt_REJECT xt_CHECKSUM iptable_mangle xt_tcpudp iptable_filter ip_tables x_tables kvm_intel kvm bnep rfcomm parport_pc ppdev binfmt_misc nfsd snd_hda_codec_hdmi snd_hda_
kernel: [144608.298956]
kernel: [144608.298958] Pid: 3334, comm: dconf worker Tainted: P O 3.2.0-39-generic #62-Ubuntu BIOSTAR Group TP55/TP55
kernel: [144608.298961] RIP: 0010:[<
kernel: [144608.299043] RSP: 0018:ffff8807ed
kernel: [144608.299044] RAX: 0000000000000000 RBX: ffff8807ed7e57b8 RCX: 000000000000000d
kernel: [144608.299045] RDX: 0000000000002000 RSI: 000000000000548d RDI: ffff88080b57c034
kernel: [144608.299047] RBP: ffff8807fe03de80 R08: 0000000000070004 R09: ffff8807fe03dea8
kernel: [144608.299048] R10: 0000000000000000 R11: 0000000000000001 R12: ffffffff8103ec69
kernel: [144608.299049] R13: ffff8807ed7e5798 R14: ffffffff8103ec69 R15: ffff8807ed7e5788
kernel: [144608.299051] FS: 00007f084ad5b70
kernel: [144608.299052] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
kernel: [144608.299054] CR2: 000012367832e4f0 CR3: 0000000001c05000 CR4: 00000000000006e0
kernel: [144608.299055] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
kernel: [144608.299057] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
kernel: [144608.299058] Process dconf worker (pid: 3334, threadinfo ffff8807ed7e4000, task ffff8807fcc64500)
kernel: [144608.299059] Stack:
kernel: [144608.299081] 00000000ffffffff ffffffffa02b4a6a ffff88080b57c008 ffff88080b57c008
kernel: [144608.299084] 0000000000070004 ffffffffa0584399 ffff88080b57c008 ffffffffa05a09e4
kernel: [144608.299086] ffff88080b57c008 0000000000000045 ffff88080b57c008 0000000000070004
kernel: [144608.299089] Call Trace:
kernel: [144608.299144] [<ffffffffa02b4
kernel: [144608.299221] [<ffffffffa0584
kernel: [144608.299299] [<ffffffffa05a0
kernel: [144608.299372] [<ffffffffa0521
kernel: [144608.299446] [<ffffffffa0521
kernel: [144608.299508] [<ffffffffa0628
kernel: [144608.299569] [<ffffffffa0628
kernel: [144608.299601] [<ffffffffa02a6
kernel: [144608.299633] [<ffffffffa02a3
kernel: [144608.299664] [<ffffffffa02a3
kernel: [144608.299695] [<ffffffffa02a4
kernel: [144608.299727] [<ffffffffa02a3
kernel: [144608.299758] [<ffffffffa02a3
kernel: [144608.299789] [<ffffffffa02a3
kernel: [144608.299821] [<ffffffffa02a3
kernel: [144608.299852] [<ffffffffa02a3
kernel: [144608.299888] [<ffffffffa0756
kernel: [144608.299924] [<ffffffffa0758
kernel: [144608.299928] [<ffffffff81091
kernel: [144608.299962] [<ffffffffa0776
kernel: [144608.299996] [<ffffffffa0777
kernel: [144608.299999] [<ffffffff8117a
kernel: [144608.300001] [<ffffffff8117a
kernel: [144608.300003] [<ffffffff81177
kernel: [144608.300006] [<ffffffff8106a
kernel: [144608.300009] [<ffffffff8106c
kernel: [144608.300011] [<ffffffff8106c
kernel: [144608.300013] [<ffffffff8106c
kernel: [144608.300016] [<ffffffff8107b
kernel: [144608.300018] [<ffffffff8106c
kernel: [144608.300020] [<ffffffff8107d
kernel: [144608.300023] [<ffffffff81014
kernel: [144608.300025] [<ffffffff81179
kernel: [144608.300027] [<ffffffff81014
kernel: [144608.300030] [<ffffffff81666
kernel: [144608.300031] Code: e8 49 00 48 89 c2 be 01 00 00 00 bf 00 00 00 00 e8 64 ef 00 00 b8 00 00 00 00 eb 12 89 c0 ba 01 00 00 00 d3 e2 85 14 87 0f 95 c0 <0f> b6 c0 48 83 c4 08 c3 41 55 41 54 53 48 89 fb 49 89 f5 49 89
kernel: [144608.301447] Call Trace:
kernel: [144608.301487] [<ffffffffa02b4
kernel: [144608.301573] [<ffffffffa0584
kernel: [144608.301654] [<ffffffffa05a0
kernel: [144608.301731] [<ffffffffa0521
kernel: [144608.301807] [<ffffffffa0521
kernel: [144608.301871] [<ffffffffa0628
kernel: [144608.301935] [<ffffffffa0628
kernel: [144608.301967] [<ffffffffa02a6
kernel: [144608.302000] [<ffffffffa02a3
kernel: [144608.302033] [<ffffffffa02a3
kernel: [144608.302065] [<ffffffffa02a4
kernel: [144608.302098] [<ffffffffa02a3
kernel: [144608.302130] [<ffffffffa02a3
kernel: [144608.302163] [<ffffffffa02a3
kernel: [144608.302196] [<ffffffffa02a3
kernel: [144608.302228] [<ffffffffa02a3
kernel: [144608.302265] [<ffffffffa0756
kernel: [144608.302302] [<ffffffffa0758
kernel: [144608.302305] [<ffffffff81091
kernel: [144608.302339] [<ffffffffa0776
kernel: [144608.302375] [<ffffffffa0777
kernel: [144608.302377] [<ffffffff8117a
kernel: [144608.302379] [<ffffffff8117a
kernel: [144608.302381] [<ffffffff81177
kernel: [144608.302383] [<ffffffff8106a
kernel: [144608.302385] [<ffffffff8106c
kernel: [144608.302387] [<ffffffff8106c
kernel: [144608.302389] [<ffffffff8106c
kernel: [144608.302391] [<ffffffff8107b
kernel: [144608.302393] [<ffffffff8106c
kernel: [144608.302395] [<ffffffff8107d
kernel: [144608.302398] [<ffffffff81014
kernel: [144608.302400] [<ffffffff81179
kernel: [144608.302401] [<ffffffff81014
kernel: [144608.302404] [<ffffffff81666
kernel: [144636.226880] BUG: soft lockup - CPU#1 stuck for 22s! [dconf worker:3334]
kernel: [144636.227012] Modules linked in: ip6table_filter ip6_tables ebtable_nat ebtables pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack ipt_REJECT xt_CHECKSUM iptable_mangle xt_tcpudp iptable_filter ip_tables x_tables kvm_intel kvm bnep rfcomm parport_pc ppdev binfmt_misc nfsd snd_hda_codec_hdmi snd_hda_
kernel: [144636.227055] CPU 1
ProblemType: Bug
DistroRelease: Ubuntu 12.04
Package: xorg 1:7.6+12ubuntu2
ProcVersionSign
Uname: Linux 3.2.0-39-generic x86_64
NonfreeKernelMo
.proc.driver.
.proc.driver.
.proc.driver.
NVRM version: NVIDIA UNIX x86_64 Kernel Module 310.14 Tue Oct 9 11:52:41 PDT 2012
GCC version: gcc version 4.6.3 (Ubuntu/Linaro 4.6.3-1ubuntu5)
.tmp.unity.
ApportVersion: 2.0.1-0ubuntu17.1
Architecture: amd64
CompizPlugins: No value set for `/apps/
CompositorRunning: compiz
Date: Tue Mar 12 10:16:18 2013
DistUpgraded: Fresh install
DistroCodename: precise
DistroVariant: ubuntu
ExtraDebuggingI
GraphicsCard:
NVIDIA Corporation Device [10de:1183] (rev a1) (prog-if 00 [VGA controller])
Subsystem: Device [196e:1000]
InstallationMedia: Ubuntu 12.04 LTS "Precise Pangolin" - Release amd64 (20120425)
JockeyStatus:
xorg:nvidia_
xorg:nvidia_
xorg:nvidia_
MachineType: BIOSTAR Group TP55
MarkForUpload: True
ProcKernelCmdLine: BOOT_IMAGE=
SourcePackage: xorg
Symptom: display
Title: Xorg crash
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 06/02/2010
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 080015
dmi.board.
dmi.board.name: TP55
dmi.board.vendor: BIOSTAR Group
dmi.chassis.
dmi.chassis.type: 3
dmi.chassis.vendor: BIOSTAR Group
dmi.chassis.
dmi.modalias: dmi:bvnAmerican
dmi.product.name: TP55
dmi.product.
dmi.sys.vendor: BIOSTAR Group
version.compiz: compiz 1:0.9.7.12-0ubuntu1
version.ia32-libs: ia32-libs 20090808ubuntu36
version.libdrm2: libdrm2 2.4.39-0ubuntu0.1
version.
version.
version.
version.
version.
version.
version.
version.
version.
affects: | xorg (Ubuntu) → nvidia-graphics-drivers (Ubuntu) |
This has some interesting, and possibly related reading. http:// www.cyberciti. biz/faq/ debian- ubuntu- rhel-fedora- linux-nvidia- nvrm-gpu- fallen- off-bus/
The suggested solution is to turn on persistence for the nvidia card using
# /usr/bin/nvidia-smi -pm 1
This was never an issue with 3.2.0-38+ 310.14- 0ubuntu0. 1 *(the same version I am currently running).
This also could be as simple as a real hardware issue.