* kernel crash with lite-on sata controller Dell XPS 13 9343
@ 2016-02-25 21:45 Carolin Latze
2016-02-25 21:51 ` Tejun Heo
0 siblings, 1 reply; 6+ messages in thread
From: Carolin Latze @ 2016-02-25 21:45 UTC (permalink / raw)
To: hdegoede, tj; +Cc: linux-ide
Hi all
I have been asked to post here based on the bug I filed at Ubuntu:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1516215
I have the newest kernel running:
cat /proc/version
Linux version 4.5.0-040500rc5-generic (kernel@tangerine) (gcc version
5.2.1 20151010 (Ubuntu 5.2.1-22ubuntu2) ) #201602201730 SMP Sat Feb 20
22:32:16 UTC 2016
Since the beginning (started with 3.19.x kernels on this notebook) I had
problems with this sata controller:
03:00.0 SATA controller: Lite-On Technology Corporation Device 0224 (rev 10)
Very often during boot, the boot just failed with error messages like
ata1.00: exception... (I was only able to capture them by taking a
picture: https://launchpadlibrarian.net/227876135/IMG_20151128_230313.jpg)
Then I keep trying, sometimes I boot into Windows inbetween, and
eventually, the kernel comes up.
Then, when running and putting a lot of load on the disk (e.g. running
dump or a simple update via apt-get upgrade), the system looses its disk
and I have to hard reset.
The machine runs perfectly fine under Windows, so I don't believe it is
a hardware issue.
I hope there is a solution to this issue.
best regards
Carolin
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
2016-02-25 21:45 kernel crash with lite-on sata controller Dell XPS 13 9343 Carolin Latze
@ 2016-02-25 21:51 ` Tejun Heo
2016-02-25 21:53 ` Carolin Latze
0 siblings, 1 reply; 6+ messages in thread
From: Tejun Heo @ 2016-02-25 21:51 UTC (permalink / raw)
To: Carolin Latze; +Cc: hdegoede, linux-ide
Hello, Carolin.
On Thu, Feb 25, 2016 at 10:45:10PM +0100, Carolin Latze wrote:
...
> Since the beginning (started with 3.19.x kernels on this notebook) I had
> problems with this sata controller:
>
> 03:00.0 SATA controller: Lite-On Technology Corporation Device 0224 (rev 10)
>
> Very often during boot, the boot just failed with error messages like
>
> ata1.00: exception... (I was only able to capture them by taking a picture:
> https://launchpadlibrarian.net/227876135/IMG_20151128_230313.jpg)
>
> Then I keep trying, sometimes I boot into Windows inbetween, and eventually,
> the kernel comes up.
>
> Then, when running and putting a lot of load on the disk (e.g. running dump
> or a simple update via apt-get upgrade), the system looses its disk and I
> have to hard reset.
>
> The machine runs perfectly fine under Windows, so I don't believe it is a
> hardware issue.
>
> I hope there is a solution to this issue.
Can you please boot with libata.force=noncq and if that fails post the
kernel messages of the failure?
Thanks.
--
tejun
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
2016-02-25 21:51 ` Tejun Heo
@ 2016-02-25 21:53 ` Carolin Latze
2016-02-25 22:01 ` Tejun Heo
0 siblings, 1 reply; 6+ messages in thread
From: Carolin Latze @ 2016-02-25 21:53 UTC (permalink / raw)
To: Tejun Heo; +Cc: hdegoede, linux-ide
Hi Tejun
I actually boot with "libata.force=noncq,1.5G". Sorry, forgot to mention
that. Same behavior.
best regards
Carolin
On 25.02.2016 22:51, Tejun Heo wrote:
> Hello, Carolin.
>
> On Thu, Feb 25, 2016 at 10:45:10PM +0100, Carolin Latze wrote:
> ...
>> Since the beginning (started with 3.19.x kernels on this notebook) I had
>> problems with this sata controller:
>>
>> 03:00.0 SATA controller: Lite-On Technology Corporation Device 0224 (rev 10)
>>
>> Very often during boot, the boot just failed with error messages like
>>
>> ata1.00: exception... (I was only able to capture them by taking a picture:
>> https://launchpadlibrarian.net/227876135/IMG_20151128_230313.jpg)
>>
>> Then I keep trying, sometimes I boot into Windows inbetween, and eventually,
>> the kernel comes up.
>>
>> Then, when running and putting a lot of load on the disk (e.g. running dump
>> or a simple update via apt-get upgrade), the system looses its disk and I
>> have to hard reset.
>>
>> The machine runs perfectly fine under Windows, so I don't believe it is a
>> hardware issue.
>>
>> I hope there is a solution to this issue.
> Can you please boot with libata.force=noncq and if that fails post the
> kernel messages of the failure?
>
> Thanks.
>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
2016-02-25 21:53 ` Carolin Latze
@ 2016-02-25 22:01 ` Tejun Heo
[not found] ` <56D6A86C.9070007@gmail.com>
0 siblings, 1 reply; 6+ messages in thread
From: Tejun Heo @ 2016-02-25 22:01 UTC (permalink / raw)
To: Carolin Latze; +Cc: Hans De Goede, linux-ide
Hello, Carolin.
On Thu, Feb 25, 2016 at 4:53 PM, Carolin Latze <carolin.latze@gmail.com> wrote:
> Hi Tejun
>
> I actually boot with "libata.force=noncq,1.5G". Sorry, forgot to mention
> that. Same behavior.
Can you please post screen capture from that failure?
Thanks.
--
tejun
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
[not found] ` <56D6A86C.9070007@gmail.com>
@ 2016-03-02 15:25 ` Tejun Heo
2016-03-14 16:02 ` Carolin Latze
0 siblings, 1 reply; 6+ messages in thread
From: Tejun Heo @ 2016-03-02 15:25 UTC (permalink / raw)
To: Carolin Latze; +Cc: Hans De Goede, linux-ide
Hello, Carolin.
On Wed, Mar 02, 2016 at 09:46:36AM +0100, Carolin Latze wrote:
> please see attached screen capture. If there is any better way to capture
> those logs than taking pictures, please let me know
Heh, so there have been quite a few cases where device locks up after
a lot of NCQ commands are issued and/or data transfer is taking place
at high rate but device locking up on plain WRITE_DMA_EXT is pretty
new. Hmmm.... Does the following patch make any difference?
diff --git a/drivers/ata/ahci.h b/drivers/ata/ahci.h
index 45586c1..95c4d5c 100644
--- a/drivers/ata/ahci.h
+++ b/drivers/ata/ahci.h
@@ -366,8 +366,8 @@ extern struct device_attribute *ahci_sdev_attrs[];
#define AHCI_SHT(drv_name) \
ATA_NCQ_SHT(drv_name), \
.can_queue = AHCI_MAX_CMDS - 1, \
- .sg_tablesize = AHCI_MAX_SG, \
- .dma_boundary = AHCI_DMA_BOUNDARY, \
+ .sg_tablesize = LIBATA_MAX_PRD, \
+ .dma_boundary = ATA_DMA_BOUNDARY \
.shost_attrs = ahci_shost_attrs, \
.sdev_attrs = ahci_sdev_attrs
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
2016-03-02 15:25 ` Tejun Heo
@ 2016-03-14 16:02 ` Carolin Latze
0 siblings, 0 replies; 6+ messages in thread
From: Carolin Latze @ 2016-03-14 16:02 UTC (permalink / raw)
To: Tejun Heo; +Cc: Hans De Goede, linux-ide
Hi Tejun
just a quick update from my side: I did patch the rc5 kernel yesterday
(I am aware that there is a newer one but when you sent the patch, rc5
was recent so I wanted to be on the safe side). There is a little typo
on the "dma_boundary" line, the trailing comma is missing. But other
than that, the patching went smoothly. Since then I reboot whenever I
can. So far 15-20 times and no issue up until now. I will keep rebooting
very often for the next days and keep you posted.
best regards
Carolin
On 02.03.2016 16:25, Tejun Heo wrote:
> Hello, Carolin.
>
> On Wed, Mar 02, 2016 at 09:46:36AM +0100, Carolin Latze wrote:
>> please see attached screen capture. If there is any better way to capture
>> those logs than taking pictures, please let me know
> Heh, so there have been quite a few cases where device locks up after
> a lot of NCQ commands are issued and/or data transfer is taking place
> at high rate but device locking up on plain WRITE_DMA_EXT is pretty
> new. Hmmm.... Does the following patch make any difference?
>
> diff --git a/drivers/ata/ahci.h b/drivers/ata/ahci.h
> index 45586c1..95c4d5c 100644
> --- a/drivers/ata/ahci.h
> +++ b/drivers/ata/ahci.h
> @@ -366,8 +366,8 @@ extern struct device_attribute *ahci_sdev_attrs[];
> #define AHCI_SHT(drv_name) \
> ATA_NCQ_SHT(drv_name), \
> .can_queue = AHCI_MAX_CMDS - 1, \
> - .sg_tablesize = AHCI_MAX_SG, \
> - .dma_boundary = AHCI_DMA_BOUNDARY, \
> + .sg_tablesize = LIBATA_MAX_PRD, \
> + .dma_boundary = ATA_DMA_BOUNDARY \
> .shost_attrs = ahci_shost_attrs, \
> .sdev_attrs = ahci_sdev_attrs
>
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2016-03-14 16:02 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-02-25 21:45 kernel crash with lite-on sata controller Dell XPS 13 9343 Carolin Latze
2016-02-25 21:51 ` Tejun Heo
2016-02-25 21:53 ` Carolin Latze
2016-02-25 22:01 ` Tejun Heo
[not found] ` <56D6A86C.9070007@gmail.com>
2016-03-02 15:25 ` Tejun Heo
2016-03-14 16:02 ` Carolin Latze
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).