From: Michael T. Halligan (michael_at_halligan.org)
Date: Tue Nov 06 2007 - 04:21:23 CET
Message-Id: <AF891927-A9A9-4887-A153-866E7EF45799@halligan.org> From: "Michael T. Halligan" <michael@halligan.org> Date: Mon, 5 Nov 2007 19:21:23 -0800 Subject: [suse-sles-e] Re: BOUNTY Re: Nvidia SATA completely broken in SLES10 SP1 (HP DL145 G2 servers)
Mostly solved. Apparently the problem lies in the way that Western
Digital implemented, or half implemented NCQ.
On Nov 5, 2007, at 3:49 PM, Michael T. Halligan wrote:
> I will pay $250 to somebody who can conclusively solve this problem.
>
>
> On Nov 5, 2007, at 3:42 PM, Michael T. Halligan wrote:
>
>> Also, I have tried upgrading to the latest kernel, 2.6.16.53-0.16-
>> smp and the probelm still exists. This happens both with the SMP
>> and default kernels.
>>
>> On Nov 5, 2007, at 1:12 PM, Michael T. Halligan wrote:
>>
>>> SLES10 SP1 seems to have a bug in it's sata drivers. There's no
>>> way this is hardware, because when I downgrade the affected
>>> servers to SLES10, the problem goes away.
>>> This only affects SLES10 SP1, and this affects all DL145 G2s
>>> running SLES10 SP1.
>>>
>>> The symptoms are that the servers will hang for about 30 seconds
>>> every 10-20 minutes and spew out these errors to dmesg:
>>>
>>>
>>>
>>>
>>> sdb: Write Protect is off
>>> sdb: Mode Sense: 00 3a 00 00
>>> SCSI device sdb: drive cache: write back
>>> ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
>>> ata2.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0 cdb 0x0
>>> data 0
>>> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
>>> ata2: soft resetting port
>>> ata2: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
>>> ata2.00: configured for UDMA/133
>>> ata2: EH complete
>>> SCSI device sdb: 976773168 512-byte hdwr sectors (500108 MB)
>>> sdb: Write Protect is off
>>> sdb: Mode Sense: 00 3a 00 00
>>> SCSI device sdb: drive cache: write back
>>> ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
>>> ata1.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0 cdb 0x0
>>> data 0
>>> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
>>> ata1: soft resetting port
>>> ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
>>> ata1.00: configured for UDMA/133
>>> ata1: EH complete
>>> SCSI device sda: 976773168 512-byte hdwr sectors (500108 MB)
>>> sda: Write Protect is off
>>> sda: Mode Sense: 00 3a 00 00
>>> SCSI device sda: drive cache: write back
>>> ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
>>> ata1.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0 cdb 0x0
>>> data 0
>>> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
>>> ata1: soft resetting port
>>> ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
>>> ata1.00: configured for UDMA/133
>>> ata1: EH complete
>>> SCSI device sda: 976773168 512-byte hdwr sectors (500108 MB)
>>> sda: Write Protect is off
>>> sda: Mode Sense: 00 3a 00 00
>>> SCSI device sda: drive cache: write back
>>> ata1.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x2 frozen
>>> ata1.00: cmd ea/00:00:00:00:00/00:00:00:00:00/a0 tag 0 cdb 0x0
>>> data 0
>>> res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
>>> ata1: soft resetting port
>>> ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
>>> ata1.00: configured for UDMA/133
>>> ata1: EH complete
>>> SCSI device sda: 976773168 512-byte hdwr sectors (500108 MB)
>>> sda: Write Protect is off
>>> sda: Mode Sense: 00 3a 00 00
>>> SCSI device sda: drive cache: write back
>>>
>>
>
---------------------------------------------------------------------
To unsubscribe, e-mail: suse-sles-e-unsubscribe@suse.com
For additional commands, e-mail: suse-sles-e-help@suse.com
This archive was generated by hypermail 2.1.7 : Tue Nov 06 2007 - 03:42:43 CET