Header Shadow Image


INFO task kworker blocked for more than 120 seconds.

One day we get the following issue (INFO task kworker blocked for more than 120 seconds.):

Oct 23 20:14:51 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818:20: is_send_status=1, cmd->bufflen=0, cmd->sg_cnt=0, cmd->dma_data_direction=3 se_cmd[ffff88000a1a04f0]
Oct 23 20:14:51 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:20: qlt_free_cmd: se_cmd[ffff88000a1a04f0] ox_id 0148
Oct 23 20:14:51 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:20: qlt_24xx_atio_pkt_all_vps: qla_target(0): type 6 ox_id 0149
Oct 23 20:14:51 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e818:20: is_send_status=1, cmd->bufflen=1024, cmd->sg_cnt=0, cmd->dma_data_direction=2 se_cmd[ffff88000a19eab0]
Oct 23 20:14:51 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e874:20: qlt_free_cmd: se_cmd[ffff88000a19eab0] ox_id 0149
Oct 23 20:14:52 mbpc-pc kernel: INFO: task kworker/u16:3:9779 blocked for more than 120 seconds.
Oct 23 20:14:52 mbpc-pc kernel:      Not tainted 4.8.4 #2
Oct 23 20:14:52 mbpc-pc kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Oct 23 20:14:52 mbpc-pc kernel: kworker/u16:3   D ffff880085ad7a18     0  9779      2 0x00000080
Oct 23 20:14:52 mbpc-pc kernel: Workqueue: tmr-fileio target_tmr_work [target_core_mod]
Oct 23 20:14:52 mbpc-pc kernel: ffff880085ad7a18 0000000000000001 ffff88009e194480 ffff880085ad7998
Oct 23 20:14:52 mbpc-pc kernel: ffff880085aaa680 ffffffff81f99ca0 ffffffff81f998ef ffff880000000000
Oct 23 20:14:52 mbpc-pc kernel: ffffffff812f27d9 ffff880100000000 0000000000000092 ffff880000000000
Oct 23 20:14:52 mbpc-pc kernel: Call Trace:
Oct 23 20:14:52 mbpc-pc kernel: [] ? number+0x2e9/0x310
Oct 23 20:14:52 mbpc-pc kernel: [] schedule+0x40/0xb0
Oct 23 20:14:52 mbpc-pc kernel: [] ? start_flush_work+0x49/0x180
Oct 23 20:14:52 mbpc-pc kernel: [] schedule_timeout+0x9c/0xe0
Oct 23 20:14:52 mbpc-pc kernel: [] ? flush_work+0x1a/0x40
Oct 23 20:14:52 mbpc-pc kernel: [] ? console_unlock+0x35c/0x380
Oct 23 20:14:52 mbpc-pc kernel: [] wait_for_completion+0xc0/0xf0
Oct 23 20:14:52 mbpc-pc kernel: [] ? try_to_wake_up+0x260/0x260
Oct 23 20:14:52 mbpc-pc kernel: [] __transport_wait_for_tasks+0xb4/0x1b0 [target_core_mod]
Oct 23 20:14:52 mbpc-pc kernel: [] ? vprintk_default+0x1f/0x30
Oct 23 20:14:52 mbpc-pc kernel: [] ? printk+0x46/0x48
Oct 23 20:14:52 mbpc-pc kernel: [] transport_wait_for_tasks+0x44/0x60 [target_core_mod]
Oct 23 20:14:52 mbpc-pc kernel: [] core_tmr_abort_task+0xf2/0x160 [target_core_mod]
Oct 23 20:14:52 mbpc-pc kernel: [] target_tmr_work+0x154/0x160 [target_core_mod]
Oct 23 20:14:52 mbpc-pc kernel: [] process_one_work+0x189/0x4e0
Oct 23 20:14:52 mbpc-pc kernel: [] ? schedule+0x40/0xb0
Oct 23 20:14:52 mbpc-pc kernel: [] worker_thread+0x16d/0x520
Oct 23 20:14:52 mbpc-pc kernel: [] ? __schedule+0x2fd/0x6a0
Oct 23 20:14:52 mbpc-pc kernel: [] ? default_wake_function+0x12/0x20
Oct 23 20:14:52 mbpc-pc kernel: [] ? __wake_up_common+0x56/0x90
Oct 23 20:14:52 mbpc-pc kernel: [] ? maybe_create_worker+0x110/0x110
Oct 23 20:14:52 mbpc-pc kernel: [] ? schedule+0x40/0xb0
Oct 23 20:14:52 mbpc-pc kernel: [] ? maybe_create_worker+0x110/0x110
Oct 23 20:14:52 mbpc-pc kernel: [] kthread+0xcc/0xf0
Oct 23 20:14:52 mbpc-pc kernel: [] ? schedule_tail+0x1e/0xc0
Oct 23 20:14:52 mbpc-pc kernel: [] ret_from_fork+0x1f/0x40
Oct 23 20:14:52 mbpc-pc kernel: [] ? kthread_freezable_should_stop+0x70/0x70
Oct 23 20:15:00 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e872:20: qlt_24xx_atio_pkt_all_vps: qla_target(0): type d ox_id 0000
Oct 23 20:15:00 mbpc-pc kernel: qla2xxx [0000:04:00.0]-e82e:20: IMMED_NOTIFY ATIO
Oct 23 20:15:00 mbpc-pc kernel: qla2xxx [0000:04:00.0]-f826:20: qla_target(0): Port ID: 0x00:00:01 ELS opcode: 0x03

So what do we do to fix it?  We try the following:

[root@mbpc-pc log]# cat /proc/sys/vm/dirty_ratio
20
[root@mbpc-pc log]# cat /proc/sys/vm/dirty_background_ratio
10
[root@mbpc-pc log]#

And we set them as follows, both in the current instance and permanently:

[root@mbpc-pc log]# sysctl -w vm.dirty_ratio=10
vm.dirty_ratio = 10
[root@mbpc-pc log]# sysctl -w vm.dirty_background_ratio=5
vm.dirty_background_ratio = 5
[root@mbpc-pc log]# vi /etc/sysctl.conf
[root@mbpc-pc log]# grep -iE "vm.dirty_background_ratio|vm.dirty_ratio" /etc/sysctl.conf
vm.dirty_background_ratio = 5
vm.dirty_ratio = 10
[root@mbpc-pc log]#

But that did not work.  We also tried setting the above to higher numbers but that did not work either.  Finally we noted that one of the disks in the RAID 6 wasn't performing as well and showed smartctl errors.  We replaced the disk:

Oct 23 22:18:42 mbpc-pc kernel: ABORT_TASK: Sending TMR_FUNCTION_COMPLETE for ref_tag: 1195032
Oct 23 22:18:42 mbpc-pc kernel: ABORT_TASK: Found referenced qla2xxx task_tag: 1122276
Oct 23 22:19:35 mbpc-pc kernel: hpet1: lost 9600 rtc interrupts
Jan  7 00:42:00 192.168.0.2 syslog: dhcpfwd : dhcp forwarder daemon successfully started
Oct 23 22:20:41 mbpc-pc kernel: hpet1: lost 9600 rtc interrupts
Oct 23 22:21:07 mbpc-pc kernel: INFO: task kworker/u16:8:308 blocked for more than 120 seconds.
Oct 23 22:21:07 mbpc-pc kernel:      Not tainted 4.8.4 #2
Oct 23 22:21:07 mbpc-pc kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Oct 23 22:21:07 mbpc-pc kernel: kworker/u16:8   D ffff880111b8fa18     0   308      2 0x00000000
Oct 23 22:21:07 mbpc-pc kernel: Workqueue: tmr-fileio target_tmr_work [target_core_mod]
Oct 23 22:21:07 mbpc-pc kernel: ffff880111b8fa18 0000000000000400 ffff880112180480 ffff880111b8f998
Oct 23 22:21:07 mbpc-pc kernel: ffff88011107a380 ffffffff81f99ca0 ffffffff81f998ef ffff880100000000
Oct 23 22:21:07 mbpc-pc kernel: ffffffff812f27d9 0000000000000000 ffffe8ffffcda000 ffff880000000000
Oct 23 22:21:07 mbpc-pc kernel: Call Trace:
Oct 23 22:21:07 mbpc-pc kernel: [] ? number+0x2e9/0x310
Oct 23 22:21:07 mbpc-pc kernel: [] schedule+0x40/0xb0
Oct 23 22:21:07 mbpc-pc kernel: [] ? start_flush_work+0x49/0x180
Oct 23 22:21:07 mbpc-pc kernel: [] schedule_timeout+0x9c/0xe0
Oct 23 22:21:07 mbpc-pc kernel: [] ? flush_work+0x1a/0x40
Oct 23 22:21:07 mbpc-pc kernel: [] ? console_unlock+0x35c/0x380
Oct 23 22:21:07 mbpc-pc kernel: [] wait_for_completion+0xc0/0xf0
Oct 23 22:21:07 mbpc-pc kernel: [] ? try_to_wake_up+0x260/0x260
Oct 23 22:21:07 mbpc-pc kernel: [] __transport_wait_for_tasks+0xb4/0x1b0 [target_core_mod]
Oct 23 22:21:07 mbpc-pc kernel: [] ? vprintk_default+0x1f/0x30
Oct 23 22:21:07 mbpc-pc kernel: [] ? printk+0x46/0x48
Oct 23 22:21:07 mbpc-pc kernel: [] transport_wait_for_tasks+0x44/0x60 [target_core_mod]
Oct 23 22:21:07 mbpc-pc kernel: [] core_tmr_abort_task+0xf2/0x160 [target_core_mod]
Oct 23 22:21:07 mbpc-pc kernel: [] target_tmr_work+0x154/0x160 [target_core_mod]
Oct 23 22:21:07 mbpc-pc kernel: [] process_one_work+0x189/0x4e0
Oct 23 22:21:07 mbpc-pc kernel: [] ? del_timer_sync+0x4c/0x60
Oct 23 22:21:07 mbpc-pc kernel: [] ? maybe_create_worker+0x8e/0x110
Oct 23 22:21:07 mbpc-pc kernel: [] ? schedule+0x40/0xb0
Oct 23 22:21:07 mbpc-pc kernel: [] worker_thread+0x16d/0x520
Oct 23 22:21:07 mbpc-pc kernel: [] ? default_wake_function+0x12/0x20
Oct 23 22:21:07 mbpc-pc kernel: [] ? __wake_up_common+0x56/0x90
Oct 23 22:21:07 mbpc-pc kernel: [] ? maybe_create_worker+0x110/0x110
Oct 23 22:21:07 mbpc-pc kernel: [] ? schedule+0x40/0xb0
Oct 23 22:21:07 mbpc-pc kernel: [] ? maybe_create_worker+0x110/0x110
Oct 23 22:21:07 mbpc-pc kernel: [] kthread+0xcc/0xf0
Oct 23 22:21:07 mbpc-pc kernel: [] ? schedule_tail+0x1e/0xc0
Oct 23 22:21:07 mbpc-pc kernel: [] ret_from_fork+0x1f/0x40
Oct 23 22:21:07 mbpc-pc kernel: [] ? kthread_freezable_should_stop+0x70/0x70
Oct 23 22:21:52 mbpc-pc kernel: hpet1: lost 9600 rtc interrupts
Jan  7 00:44:00 192.168.0.2 syslog: dhcpfwd : dhcp forwarder daemon successfully started
Oct 23 22:23:03 mbpc-pc kernel: hpet1: lost 9600 rtc interrupts
Oct 23 22:23:10 mbpc-pc kernel: INFO: task kworker/u16:8:308 blocked for more than 120 seconds.
Oct 23 22:23:10 mbpc-pc kernel:      Not tainted 4.8.4 #2
Oct 23 22:23:10 mbpc-pc kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Oct 23 22:23:10 mbpc-pc kernel: kworker/u16:8   D ffff880111b8fa18     0   308      2 0x00000000
Oct 23 22:23:10 mbpc-pc kernel: Workqueue: tmr-fileio target_tmr_work [target_core_mod]
Oct 23 22:23:10 mbpc-pc kernel: ffff880111b8fa18 0000000000000400 ffff880112180480 ffff880111b8f998
Oct 23 22:23:10 mbpc-pc kernel: ffff88011107a380 ffffffff81f99ca0 ffffffff81f998ef ffff880100000000
Oct 23 22:23:10 mbpc-pc kernel: ffffffff812f27d9 0000000000000000 ffffe8ffffcda000 ffff880000000000
Oct 23 22:23:10 mbpc-pc kernel: Call Trace:
Oct 23 22:23:10 mbpc-pc kernel: [] ? number+0x2e9/0x310
Oct 23 22:23:10 mbpc-pc kernel: [] schedule+0x40/0xb0
Oct 23 22:23:10 mbpc-pc kernel: [] ? start_flush_work+0x49/0x180
Oct 23 22:23:10 mbpc-pc kernel: [] schedule_timeout+0x9c/0xe0
Oct 23 22:23:10 mbpc-pc kernel: [] ? flush_work+0x1a/0x40
Oct 23 22:23:10 mbpc-pc kernel: [] ? console_unlock+0x35c/0x380
Oct 23 22:23:10 mbpc-pc kernel: [] wait_for_completion+0xc0/0xf0
Oct 23 22:23:10 mbpc-pc kernel: [] ? try_to_wake_up+0x260/0x260
Oct 23 22:23:10 mbpc-pc kernel: [] __transport_wait_for_tasks+0xb4/0x1b0 [target_core_mod]
Oct 23 22:23:10 mbpc-pc kernel: [] ? vprintk_default+0x1f/0x30
Oct 23 22:23:10 mbpc-pc kernel: [] ? printk+0x46/0x48
Oct 23 22:23:10 mbpc-pc kernel: [] transport_wait_for_tasks+0x44/0x60 [target_core_mod]
Oct 23 22:23:10 mbpc-pc kernel: [] core_tmr_abort_task+0xf2/0x160 [target_core_mod]
Oct 23 22:23:10 mbpc-pc kernel: [] target_tmr_work+0x154/0x160 [target_core_mod]
Oct 23 22:23:10 mbpc-pc kernel: [] process_one_work+0x189/0x4e0
Oct 23 22:23:10 mbpc-pc kernel: [] ? del_timer_sync+0x4c/0x60
Oct 23 22:23:10 mbpc-pc kernel: [] ? maybe_create_worker+0x8e/0x110
Oct 23 22:23:10 mbpc-pc kernel: [] ? schedule+0x40/0xb0
Oct 23 22:23:10 mbpc-pc kernel: [] worker_thread+0x16d/0x520
Oct 23 22:23:10 mbpc-pc kernel: [] ? default_wake_function+0x12/0x20
Oct 23 22:23:10 mbpc-pc kernel: [] ? __wake_up_common+0x56/0x90
Oct 23 22:23:10 mbpc-pc kernel: [] ? maybe_create_worker+0x110/0x110
Oct 23 22:23:10 mbpc-pc kernel: [] ? schedule+0x40/0xb0
Oct 23 22:23:10 mbpc-pc kernel: [] ? maybe_create_worker+0x110/0x110
Oct 23 22:23:10 mbpc-pc kernel: [] kthread+0xcc/0xf0
Oct 23 22:23:10 mbpc-pc kernel: [] ? schedule_tail+0x1e/0xc0
Oct 23 22:23:10 mbpc-pc kernel: [] ret_from_fork+0x1f/0x40
Oct 23 22:23:10 mbpc-pc kernel: [] ? kthread_freezable_should_stop+0x70/0x70
Oct 23 22:23:10 mbpc-pc kernel: INFO: task kworker/2:4:5893 blocked for more than 120 seconds.
Oct 23 22:23:10 mbpc-pc kernel:      Not tainted 4.8.4 #2
Oct 23 22:23:10 mbpc-pc kernel: "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Oct 23 22:23:10 mbpc-pc kernel: kworker/2:4     D ffff880091487968     0  5893      2 0x00000080
Oct 23 22:23:10 mbpc-pc kernel: Workqueue: events qlt_free_session_done [qla2xxx]

 

[root@mbpc-pc ~]# iostat -tkx 1
10/23/2016 10:18:40 PM
avg-cpu:  %user   %nice %system %iowait  %steal   %idle
           0.00    0.00    0.00   25.44    0.00   74.56

Device:         rrqm/s   wrqm/s     r/s     w/s    rkB/s    wkB/s avgrq-sz avgqu-sz   await  svctm  %util
sdb               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdc               0.00     0.00    1.00    0.00     4.00     0.00     8.00     0.01   14.00  14.00   1.40
sdd               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sde               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdf               0.00     0.00    0.00    0.00     0.00     0.00     0.00     4.00    0.00   0.00 100.10
sdg               0.00     0.00    0.00    2.00     0.00     8.00     8.00     0.00    0.00   0.00   0.00
sda               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdh               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdj               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdk               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdi               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
fd0               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-0              0.00     0.00    0.00    2.00     0.00     8.00     8.00     0.00    0.00   0.00   0.00
dm-1              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-2              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
md0               0.00     0.00    1.00    0.00     4.00     0.00     8.00     0.00    0.00   0.00   0.00
dm-3              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-4              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-5              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-6              0.00     0.00    1.00    0.00     4.00     0.00     8.00     2.01   14.00 1000.00 100.00

10/23/2016 10:18:39 PM
avg-cpu:  %user   %nice %system %iowait  %steal   %idle
           0.25    0.00    0.25   25.19    0.00   74.31

Device:         rrqm/s   wrqm/s     r/s     w/s    rkB/s    wkB/s avgrq-sz avgqu-sz   await  svctm  %util
sdb               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdc               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdd               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sde               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdf               0.00     0.00    0.00    0.00     0.00     0.00     0.00     4.00    0.00   0.00 100.00
sdg               0.00     4.00    0.00    3.00     0.00    24.00    16.00     0.04   12.33  12.33   3.70
sda               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdh               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdj               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdk               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
sdi               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
fd0               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-0              0.00     0.00    0.00    6.00     0.00    24.00     8.00     0.04    6.17   6.17   3.70
dm-1              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-2              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
md0               0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-3              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-4              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-5              0.00     0.00    0.00    0.00     0.00     0.00     0.00     0.00    0.00   0.00   0.00
dm-6              0.00     0.00    0.00    0.00     0.00     0.00     0.00     2.00    0.00   0.00 100.00


[root@mbpc-pc ~]# smartctl -A /dev/sdf
smartctl 5.43 2012-06-30 r3573 [x86_64-linux-4.8.4] (local build)
Copyright (C) 2002-12 by Bruce Allen, http://smartmontools.sourceforge.net

Error SMART Values Read failed: scsi error badly formed scsi parameters
Smartctl: SMART Read Values failed.

=== START OF READ SMART DATA SECTION ===
[root@mbpc-pc ~]#


[root@mbpc-pc ~]# hdparm -i /dev/sdf

/dev/sdf:

 Model=ST2000DM001-1CH164, FwRev=CC26, SerialNo=Z240QTB6
 Config={ HardSect NotMFM HdSw>15uSec Fixed DTR>10Mbs RotSpdTol>.5% }
 RawCHS=16383/16/63, TrkSize=0, SectSize=0, ECCbytes=4
 BuffType=unknown, BuffSize=unknown, MaxMultSect=16, MultSect=off
 CurCHS=16383/16/63, CurSects=16514064, LBA=yes, LBAsects=3907029168
 IORDY=on/off, tPIO={min:120,w/IORDY:120}, tDMA={min:120,rec:120}
 PIO modes:  pio0 pio1 pio2 pio3 pio4
 DMA modes:  mdma0 mdma1 mdma2
 UDMA modes: udma0 udma1 udma2 udma3 udma4 udma5 *udma6
 AdvancedPM=yes: unknown setting WriteCache=disabled
 Drive conforms to: unknown:  ATA/ATAPI-4,5,6,7

 * signifies the current active mode

[root@mbpc-pc ~]#

 

[root@mbpc-pc ~]# hdparm -i /dev/sdf

/dev/sdf:

 Model=ST2000DM001-1CH164, FwRev=CC26, SerialNo=Z240QTB6
 Config={ HardSect NotMFM HdSw>15uSec Fixed DTR>10Mbs RotSpdTol>.5% }
 RawCHS=16383/16/63, TrkSize=0, SectSize=0, ECCbytes=4
 BuffType=unknown, BuffSize=unknown, MaxMultSect=16, MultSect=off
 CurCHS=16383/16/63, CurSects=16514064, LBA=yes, LBAsects=3907029168
 IORDY=on/off, tPIO={min:120,w/IORDY:120}, tDMA={min:120,rec:120}
 PIO modes:  pio0 pio1 pio2 pio3 pio4
 DMA modes:  mdma0 mdma1 mdma2
 UDMA modes: udma0 udma1 udma2 udma3 udma4 udma5 *udma6
 AdvancedPM=yes: unknown setting WriteCache=disabled
 Drive conforms to: unknown:  ATA/ATAPI-4,5,6,7

 * signifies the current active mode

[root@mbpc-pc ~]#


[root@mbpc-pc ~]# smartctl -A /dev/sdf
smartctl 5.43 2012-06-30 r3573 [x86_64-linux-4.8.4] (local build)
Copyright (C) 2002-12 by Bruce Allen, http://smartmontools.sourceforge.net

Error SMART Values Read failed: scsi error badly formed scsi parameters
Smartctl: SMART Read Values failed.

=== START OF READ SMART DATA SECTION ===
[root@mbpc-pc ~]# cat /proc/mdstat
Personalities : [raid6] [raid5] [raid4]
md0 : active raid6 sdb[7] sdf[6] sdd[3] sda[5] sdc[1] sde[8]
      3907045632 blocks super 1.2 level 6, 64k chunk, algorithm 2 [6/6] [UUUUUU]
      bitmap: 1/8 pages [4KB], 65536KB chunk

unused devices: <none>
[root@mbpc-pc ~]#

 


Disk SDF was going nuts.  I'm surprised how the RAID 6 did not care yet the kworker via QLA2XXX cared:

[root@mbpc-pc ~]# cat /proc/mdstat
Personalities : [raid6] [raid5] [raid4]
md0 : active raid6 sdb[7] sdf[6] sdd[3] sda[5] sdc[1] sde[8]
      3907045632 blocks super 1.2 level 6, 64k chunk, algorithm 2 [6/6] [UUUUUU]
      bitmap: 0/8 pages [0KB], 65536KB chunk

unused devices: <none>
[root@mbpc-pc ~]#

You can also set this as follows to try and see if it will help in the interim while you go shopping for a new disk:

echo 360 > /proc/sys/kernel/hung_task_timeout_secs;

Cheers,
Tom

Leave a Reply

You must be logged in to post a comment.


     
  Copyright © 2003 - 2013 Tom Kacperski (microdevsys.com). All rights reserved.

Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 Unported License