System hangs apparently randomly when disconnecting iScsi volumes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Triaged
|
High
|
Unassigned |
Bug Description
We have several servers here which mount Ubuntu LTS Trusty Tahr 14.04. We use LVM snapshots to do backups during the night. Randomly (witha rate of 1 event every 10-12 days) the LVM snapshot creation runs in some problem. The server hangs and it must be rebooted hardly. No shell, no even any screen output, just a black screen, no ping, nothing. And Magic SysRq key doesn't help.
No logs are written either (I also tried redirect logs to another machine, just in case). This happens with different hardware, and different backup software. The only constant is LVM snapshots. Kernel is 3.13.0-49. We tried the 14.10 kernel (3.16.0-34), we had the same hangs, but this time we had something logged:
Apr 21 23:02:56 server-name kernel: [654840.108023] INFO: task kswapd0:50 blocked for more than 120 seconds.
Apr 21 23:02:56 server-name kernel: [654840.108145] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:02:56 server-name kernel: [654840.108245] "echo 0 > /proc/sys/
Apr 21 23:02:56 server-name kernel: [654840.108361] kswapd0 D ffff88007fc130c0 0 50 2 0x00000000
Apr 21 23:02:56 server-name kernel: [654840.108367] ffff880077bcf998 0000000000000046 ffff880077bd0000 ffff880077bcffd8
Apr 21 23:02:56 server-name kernel: [654840.108372] 00000000000130c0 00000000000130c0 ffff88007bdef010 ffffc90010d3c000
Apr 21 23:02:56 server-name kernel: [654840.108377] ffffc90010d3c0d8 0000000000000000 0000000000000000 ffffc90010d3c000
Apr 21 23:02:56 server-name kernel: [654840.108382] Call Trace:
/0x70
Apr 21 23:02:56 server-name kernel: [654840.108419] [<ffffffffc0707
Apr 21 23:02:56 server-name kernel: [654840.108426] [<ffffffff810b4
Apr 21 23:02:56 server-name kernel: [654840.108437] [<ffffffffc0709
Apr 21 23:02:56 server-name kernel: [654840.108443] [<ffffffff810a5
Apr 21 23:02:56 server-name kernel: [654840.108454] [<ffffffffc0709
Apr 21 23:02:56 server-name kernel: [654840.108464] [<ffffffffc06f6
Apr 21 23:02:56 server-name kernel: [654840.108470] [<ffffffff813ab
Apr 21 23:02:56 server-name kernel: [654840.108476] [<ffffffff81230
Apr 21 23:02:56 server-name kernel: [654840.108480] [<ffffffff81231
Apr 21 23:02:56 server-name kernel: [654840.108485] [<ffffffff81231
Apr 21 23:02:56 server-name kernel: [654840.108494] [<ffffffffc06ec
t_inode+0xa0/0x180 [reiserfs]
Apr 21 23:02:56 server-name kernel: [654840.108501] [<ffffffff811ff
Apr 21 23:02:56 server-name kernel: [654840.108507] [<ffffffff811ee
Apr 21 23:02:56 server-name kernel: [654840.108511] [<ffffffff811ef
Apr 21 23:02:56 server-name kernel: [654840.108515] [<ffffffff811ef
Apr 21 23:02:56 server-name kernel: [654840.108520] [<ffffffff811d7
Apr 21 23:02:56 server-name kernel: [654840.108526] [<ffffffff81171
Apr 21 23:02:56 server-name kernel: [654840.108532] [<ffffffff810f8
Apr 21 23:02:56 server-name kernel: [654840.108536] [<ffffffff81173
Apr 21 23:02:56 server-name kernel: [654840.108540] [<ffffffff81177
Apr 21 23:02:56 server-name kernel: [654840.108544] [<ffffffff81177
Apr 21 23:02:56 server-name kernel: [654840.108549] [<ffffffff810b4
Apr 21 23:02:56 server-name kernel: [654840.108552] [<ffffffff81177
Apr 21 23:02:56 server-name kernel: [654840.108558] [<ffffffff81091
Apr 21 23:02:56 server-name kernel: [654840.108562] [<ffffffff81091
Apr 21 23:02:56 server-name kernel: [654840.108567] [<ffffffff8176c
Apr 21 23:02:56 server-name kernel: [654840.108571] [<ffffffff81091
Apr 21 23:02:56 server-name kernel: [654840.108595] INFO: task kworker/1:1:25606 blocked for more than 120 seconds.
Apr 21 23:02:56 server-name kernel: [654840.108709] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:02:56 server-name kernel: [654840.108809] "echo 0 > /proc/sys/
Apr 21 23:02:56 server-name kernel: [654840.108923] kworker/1:1 D ffff88007fc530c0 0 25606 2 0x00000000
Apr 21 23:02:56 server-name kernel: [654840.108936] Workqueue: events_long flush_old_commits [reiserfs]
Apr 21 23:02:56 server-name kernel: [654840.108939] ffff88000311fc90 0000000000000046 ffff880079db65e0 ffff88000311ffd8
Apr 21 23:02:56 server-name kernel: [654840.108943] 00000000000130c0 00000000000130c0 ffff880036b065e0 ffffc90010d3c000
Apr 21 23:02:56 server-name kernel: [654840.108948] ffffc90010d3c0d8 0000000000000000 0000000000000000 ffffc90010d3c000
Apr 21 23:02:56 server-name kernel: [654840.108952] Call Trace:
Apr 21 23:02:56 server-name kernel: [654840.108956] [<ffffffff81768
Apr 21 23:02:56 server-name kernel: [654840.108967] [<ffffffffc0707
Apr 21 23:02:56 server-name kernel: [654840.108972] [<ffffffff810b4
Apr 21 23:02:56 server-name kernel: [654840.108983] [<ffffffffc0709
Apr 21 23:02:56 server-name kernel: [654840.108989] [<ffffffff8101c
Apr 21 23:02:56 server-name kernel: [654840.109000] [<ffffffffc0709
Apr 21 23:02:56 server-name kernel: [654840.109010] [<ffffffffc06f5
Apr 21 23:02:56 server-name kernel: [654840.109019] [<ffffffffc06f5
Apr 21 23:02:56 server-name kernel: [654840.109024] [<ffffffff8108a
Apr 21 23:02:56 server-name kernel: [654840.109028] [<ffffffff8108a
Apr 21 23:02:56 server-name kernel: [654840.109032] [<ffffffff8108a
Apr 21 23:02:56 server-name kernel: [654840.109036] [<ffffffff81091
Apr 21 23:02:56 server-name kernel: [654840.109040] [<ffffffff81091
Apr 21 23:02:56 server-name kernel: [654840.109044] [<ffffffff8176c
Apr 21 23:02:56 server-name kernel: [654840.109048] [<ffffffff81091
Apr 21 23:02:56 server-name kernel: [654840.109053] INFO: task lvcreate:25648 blocked for more than 120 seconds.
Apr 21 23:02:56 server-name kernel: [654840.109159] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:02:56 server-name kernel: [654840.109258] "echo 0 > /proc/sys/
Apr 21 23:02:56 server-name kernel: [654840.109372] lvcreate D ffff88007fc530c0 0 25648 25623 0x00000000
Apr 21 23:02:56 server-name kernel: [654840.109377] ffff880000053b60 0000000000000086 ffff8800773432f0 ffff880000053fd8
Apr 21 23:02:56 server-name kernel: [654840.109381] 00000000000130c0 00000000000130c0 ffff88007c0c9460 ffff8800773432f0
Apr 21 23:02:56 server-name kernel: [654840.109385] ffff880037a23080 ffff880037a23068 ffffffff00000000 ffff880037a23070
Apr 21 23:02:56 server-name kernel: [654840.109390] Call Trace:
Apr 21 23:02:56 server-name kernel: [654840.109394] [<ffffffff81768
Apr 21 23:02:56 server-name kernel: [654840.109398] [<ffffffff8176b
Apr 21 23:02:56 server-name kernel: [654840.109404] [<ffffffff81011
Apr 21 23:02:56 server-name kernel: [654840.109409] [<ffffffff81394
Apr 21 23:02:56 server-name kernel: [654840.109416] [<ffffffff815e6
Apr 21 23:02:56 server-name kernel: [654840.109419] [<ffffffff8176b
Apr 21 23:02:56 server-name kernel: [654840.109423] [<ffffffff811d6
Apr 21 23:02:56 server-name kernel: [654840.109428] [<ffffffff81209
Apr 21 23:02:56 server-name kernel: [654840.109432] [<ffffffff815e4
Apr 21 23:02:56 server-name kernel: [654840.109436] [<ffffffff815e6
Apr 21 23:02:56 server-name kernel: [654840.109441] [<ffffffff815eb
Apr 21 23:02:56 server-name kernel: [654840.109445] [<ffffffff815eb
Apr 21 23:02:56 server-name kernel: [654840.109449] [<ffffffff815ec
Apr 21 23:02:56 server-name kernel: [654840.109454] [<ffffffff812d7
Apr 21 23:02:56 server-name kernel: [654840.109459] [<ffffffff815ec
Apr 21 23:02:56 server-name kernel: [654840.109463] [<ffffffff811e7
Apr 21 23:02:56 server-name kernel: [654840.109468] [<ffffffff812eb
Apr 21 23:02:56 server-name kernel: [654840.109472] [<ffffffff812d6
Apr 21 23:02:56 server-name kernel: [654840.109476] [<ffffffff812d4
Apr 21 23:02:56 server-name kernel: [654840.109480] [<ffffffff811e7
Apr 21 23:02:56 server-name kernel: [654840.109484] [<ffffffff8176c
Apr 21 23:04:56 server-name kernel: [654960.108025] INFO: task kswapd0:50 blocked for more than 120 seconds.
Apr 21 23:04:56 server-name kernel: [654960.108124] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:04:56 server-name kernel: [654960.108224] "echo 0 > /proc/sys/
Apr 21 23:04:56 server-name kernel: [654960.108339] kswapd0 D ffff88007fc130c0 0 50 2 0x00000000
Apr 21 23:04:56 server-name kernel: [654960.108346] ffff880077bcf998 0000000000000046 ffff880077bd0000 ffff880077bcffd8
Apr 21 23:04:56 server-name kernel: [654960.108351] 00000000000130c0 00000000000130c0 ffff88007bdef010 ffffc90010d3c000
Apr 21 23:04:56 server-name kernel: [654960.108355] ffffc90010d3c0d8 0000000000000000 0000000000000000 ffffc90010d3c000
Apr 21 23:04:56 server-name kernel: [654960.108360] Call Trace:
Apr 21 23:04:56 server-name kernel: [654960.108372] [<ffffffff81768
Apr 21 23:04:56 server-name kernel: [654960.108397] [<ffffffffc0707
Apr 21 23:04:56 server-name kernel: [654960.108404] [<ffffffff810b4
Apr 21 23:04:56 server-name kernel: [654960.108416] [<ffffffffc0709
Apr 21 23:04:56 server-name kernel: [654960.108422] [<ffffffff810a5
Apr 21 23:04:56 server-name kernel: [654960.108433] [<ffffffffc0709
Apr 21 23:04:56 server-name kernel: [654960.108443] [<ffffffffc06f6
Apr 21 23:04:56 server-name kernel: [654960.108449] [<ffffffff813ab
Apr 21 23:04:56 server-name kernel: [654960.108455] [<ffffffff81230
Apr 21 23:04:56 server-name kernel: [654960.108459] [<ffffffff81231
Apr 21 23:04:56 server-name kernel: [654960.108464] [<ffffffff81231
Apr 21 23:04:56 server-name kernel: [654960.108473] [<ffffffffc06ec
Apr 21 23:04:56 server-name kernel: [654960.108480] [<ffffffff811ff
Apr 21 23:04:56 server-name kernel: [654960.108485] [<ffffffff811ee
Apr 21 23:04:56 server-name kernel: [654960.108490] [<ffffffff811ef
Apr 21 23:04:56 server-name kernel: [654960.108494] [<ffffffff811ef
Apr 21 23:04:56 server-name kernel: [654960.108499] [<ffffffff811d7
Apr 21 23:04:56 server-name kernel: [654960.108504] [<ffffffff81171
Apr 21 23:04:56 server-name kernel: [654960.108510] [<ffffffff810f8
Apr 21 23:04:56 server-name kernel: [654960.108515] [<ffffffff81173
Apr 21 23:04:56 server-name kernel: [654960.108519] [<ffffffff81177
Apr 21 23:04:56 server-name kernel: [654960.108523] [<ffffffff81177
Apr 21 23:04:56 server-name kernel: [654960.108528] [<ffffffff810b4
Apr 21 23:04:56 server-name kernel: [654960.108531] [<ffffffff81177
Apr 21 23:04:56 server-name kernel: [654960.108536] [<ffffffff81091
Apr 21 23:04:56 server-name kernel: [654960.108541] [<ffffffff81091
Apr 21 23:04:56 server-name kernel: [654960.108546] [<ffffffff8176c
Apr 21 23:04:56 server-name kernel: [654960.108550] [<ffffffff81091
Apr 21 23:04:56 server-name kernel: [654960.108573] INFO: task kworker/1:1:25606 blocked for more than 120 seconds.
Apr 21 23:04:56 server-name kernel: [654960.108688] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:04:56 server-name kernel: [654960.108788] "echo 0 > /proc/sys/
Apr 21 23:04:56 server-name kernel: [654960.108905] kworker/1:1 D ffff88007fc530c0 0 25606 2 0x00000000
Apr 21 23:04:56 server-name kernel: [654960.108918] Workqueue: events_long flush_old_commits [reiserfs]
Apr 21 23:04:56 server-name kernel: [654960.108920] ffff88000311fc90 0000000000000046 ffff880079db65e0 ffff88000311ffd8
Apr 21 23:04:56 server-name kernel: [654960.108925] 00000000000130c0 00000000000130c0 ffff880036b065e0 ffffc90010d3c000
Apr 21 23:04:56 server-name kernel: [654960.108929] ffffc90010d3c0d8 0000000000000000 0000000000000000 ffffc90010d3c000
Apr 21 23:04:56 server-name kernel: [654960.108934] Call Trace:
Apr 21 23:04:56 server-name kernel: [654960.108938] [<ffffffff81768
Apr 21 23:04:56 server-name kernel: [654960.108949] [<ffffffffc0707
Apr 21 23:04:56 server-name kernel: [654960.108953] [<ffffffff810b4
Apr 21 23:04:56 server-name kernel: [654960.108964] [<ffffffffc0709
Apr 21 23:04:56 server-name kernel: [654960.108971] [<ffffffff8101c
Apr 21 23:04:56 server-name kernel: [654960.108982] [<ffffffffc0709
Apr 21 23:04:56 server-name kernel: [654960.108991] [<ffffffffc06f5
Apr 21 23:04:56 server-name kernel: [654960.109001] [<ffffffffc06f5
Apr 21 23:04:56 server-name kernel: [654960.109006] [<ffffffff8108a
Apr 21 23:04:56 server-name kernel: [654960.109010] [<ffffffff8108a
Apr 21 23:04:56 server-name kernel: [654960.109014] [<ffffffff8108a
Apr 21 23:04:56 server-name kernel: [654960.109018] [<ffffffff81091
Apr 21 23:04:56 server-name kernel: [654960.109022] [<ffffffff81091
Apr 21 23:04:56 server-name kernel: [654960.109026] [<ffffffff8176c
Apr 21 23:04:56 server-name kernel: [654960.109030] [<ffffffff81091
Apr 21 23:04:56 server-name kernel: [654960.109035] INFO: task lvcreate:25648 blocked for more than 120 seconds.
Apr 21 23:04:56 server-name kernel: [654960.109141] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:04:56 server-name kernel: [654960.109240] "echo 0 > /proc/sys/
Apr 21 23:04:56 server-name kernel: [654960.109354] lvcreate D ffff88007fc530c0 0 25648 25623 0x00000000
Apr 21 23:04:56 server-name kernel: [654960.109359] ffff880000053b60 0000000000000086 ffff8800773432f0 ffff880000053fd8
Apr 21 23:04:56 server-name kernel: [654960.109363] 00000000000130c0 00000000000130c0 ffff88007c0c9460 ffff8800773432f0
Apr 21 23:04:56 server-name kernel: [654960.109367] ffff880037a23080 ffff880037a23068 ffffffff00000000 ffff880037a23070
Apr 21 23:04:56 server-name kernel: [654960.109372] Call Trace:
Apr 21 23:04:56 server-name kernel: [654960.109376] [<ffffffff81768
Apr 21 23:04:56 server-name kernel: [654960.109380] [<ffffffff8176b
Apr 21 23:04:56 server-name kernel: [654960.109386] [<ffffffff81011
Apr 21 23:04:56 server-name kernel: [654960.109391] [<ffffffff81394
Apr 21 23:04:56 server-name kernel: [654960.109398] [<ffffffff815e6
Apr 21 23:04:56 server-name kernel: [654960.109401] [<ffffffff8176b
Apr 21 23:04:56 server-name kernel: [654960.109405] [<ffffffff811d6
Apr 21 23:04:56 server-name kernel: [654960.109410] [<ffffffff81209
Apr 21 23:04:56 server-name kernel: [654960.109414] [<ffffffff815e4
Apr 21 23:04:56 server-name kernel: [654960.109419] [<ffffffff815e6
Apr 21 23:04:56 server-name kernel: [654960.109424] [<ffffffff815eb
Apr 21 23:04:56 server-name kernel: [654960.109428] [<ffffffff815eb
Apr 21 23:04:56 server-name kernel: [654960.109431] [<ffffffff815ec
Apr 21 23:04:56 server-name kernel: [654960.109437] [<ffffffff812d7
Apr 21 23:04:56 server-name kernel: [654960.109441] [<ffffffff815ec
Apr 21 23:04:56 server-name kernel: [654960.109446] [<ffffffff811e7
Apr 21 23:04:56 server-name kernel: [654960.109451] [<ffffffff812eb
Apr 21 23:04:56 server-name kernel: [654960.109455] [<ffffffff812d6
Apr 21 23:04:56 server-name kernel: [654960.109459] [<ffffffff812d4
Apr 21 23:04:56 server-name kernel: [654960.109462] [<ffffffff811e7
Apr 21 23:04:56 server-name kernel: [654960.109467] [<ffffffff8176c
Apr 21 23:06:10 server-name nmbd[962]: [2015/04/21 23:06:10.798735, 0] ../source3/
Apr 21 23:06:10 server-name nmbd[962]: queue_query_name: interface 0 has NULL IP address !
Apr 21 23:06:56 server-name kernel: [655080.108025] INFO: task kswapd0:50 blocked for more than 120 seconds.
Apr 21 23:06:56 server-name kernel: [655080.108123] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:06:56 server-name kernel: [655080.108223] "echo 0 > /proc/sys/
Apr 21 23:06:56 server-name kernel: [655080.108347] kswapd0 D ffff88007fc130c0 0 50 2 0x00000000
Apr 21 23:06:56 server-name kernel: [655080.108354] ffff880077bcf998 0000000000000046 ffff880077bd0000 ffff880077bcffd8
Apr 21 23:06:56 server-name kernel: [655080.108359] 00000000000130c0 00000000000130c0 ffff88007bdef010 ffffc90010d3c000
Apr 21 23:06:56 server-name kernel: [655080.108363] ffffc90010d3c0d8 0000000000000000 0000000000000000 ffffc90010d3c000
Apr 21 23:06:56 server-name kernel: [655080.108369] Call Trace:
Apr 21 23:06:56 server-name kernel: [655080.108379] [<ffffffff81768
_on_write_
Apr 21 23:06:56 server-name kernel: [655080.108413] [<ffffffff810b4
Apr 21 23:06:56 server-name kernel: [655080.108424] [<ffffffffc0709
Apr 21 23:06:56 server-name kernel: [655080.108430] [<ffffffff810a5
Apr 21 23:06:56 server-name kernel: [655080.108441] [<ffffffffc0709
Apr 21 23:06:56 server-name kernel: [655080.108451] [<ffffffffc06f6
Apr 21 23:06:56 server-name kernel: [655080.108457] [<ffffffff813ab
Apr 21 23:06:56 server-name kernel: [655080.108463] [<ffffffff81230
Apr 21 23:06:56 server-name kernel: [655080.108467] [<ffffffff81231
Apr 21 23:06:56 server-name kernel: [655080.108472] [<ffffffff81231
Apr 21 23:06:56 server-name kernel: [655080.108481] [<ffffffffc06ec
Apr 21 23:06:56 server-name kernel: [655080.108488] [<ffffffff811ff
Apr 21 23:06:56 server-name kernel: [655080.108498] [<ffffffff811ef
Apr 21 23:06:56 server-name kernel: [655080.108502] [<ffffffff811ef
Apr 21 23:06:56 server-name kernel: [655080.108507] [<ffffffff811d7
Apr 21 23:06:56 server-name kernel: [655080.108512] [<ffffffff81171
Apr 21 23:06:56 server-name kernel: [655080.108519] [<ffffffff810f8
Apr 21 23:06:56 server-name kernel: [655080.108523] [<ffffffff81173
Apr 21 23:06:56 server-name kernel: [655080.108527] [<ffffffff81177
Apr 21 23:06:56 server-name kernel: [655080.108531] [<ffffffff81177
Apr 21 23:06:56 server-name kernel: [655080.108536] [<ffffffff810b4
Apr 21 23:06:56 server-name kernel: [655080.108539] [<ffffffff81177
Apr 21 23:06:56 server-name kernel: [655080.108545] [<ffffffff81091
Apr 21 23:06:56 server-name kernel: [655080.108549] [<ffffffff81091
Apr 21 23:06:56 server-name kernel: [655080.108554] [<ffffffff8176c
Apr 21 23:06:56 server-name kernel: [655080.108558] [<ffffffff81091
Apr 21 23:06:56 server-name kernel: [655080.108582] INFO: task kworker/1:1:25606 blocked for more than 120 seconds.
Apr 21 23:06:56 server-name kernel: [655080.108697] Not tainted 3.16.0-34-generic #47~14.04.1-Ubuntu
Apr 21 23:06:56 server-name kernel: [655080.108796] "echo 0 > /proc/sys/
Apr 21 23:06:56 server-name kernel: [655080.108910] kworker/1:1 D ffff88007fc530c0 0 25606 2 0x00000000
Apr 21 23:06:56 server-name kernel: [655080.108923] Workqueue: events_long flush_old_commits [reiserfs]
Apr 21 23:06:56 server-name kernel: [655080.108925] ffff88000311fc90 0000000000000046 ffff880079db65e0 ffff88000311ffd8
and this keeps going for some minutes, then the next morning the server is hung.
description: | updated |
Changed in linux (Ubuntu): | |
status: | Confirmed → Triaged |
This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:
apport-collect 1449910
and then change the status of the bug to 'Confirmed'.
If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.
This change has been made by an automated script, maintained by the Ubuntu Kernel Team.