linux-ide.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* kernel crash with lite-on sata controller Dell XPS 13 9343
@ 2016-02-25 21:45 Carolin Latze
  2016-02-25 21:51 ` Tejun Heo
  0 siblings, 1 reply; 6+ messages in thread
From: Carolin Latze @ 2016-02-25 21:45 UTC (permalink / raw)
  To: hdegoede, tj; +Cc: linux-ide

Hi all

I have been asked to post here based on the bug I filed at Ubuntu:

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1516215

I have the newest kernel running:

cat /proc/version
Linux version 4.5.0-040500rc5-generic (kernel@tangerine) (gcc version 
5.2.1 20151010 (Ubuntu 5.2.1-22ubuntu2) ) #201602201730 SMP Sat Feb 20 
22:32:16 UTC 2016


Since the beginning (started with 3.19.x kernels on this notebook) I had 
problems with this sata controller:

03:00.0 SATA controller: Lite-On Technology Corporation Device 0224 (rev 10)

Very often during boot, the boot just failed with error messages like

ata1.00: exception... (I was only able to capture them by taking a 
picture: https://launchpadlibrarian.net/227876135/IMG_20151128_230313.jpg)

Then I keep trying, sometimes I boot into Windows inbetween, and 
eventually, the kernel comes up.

Then, when running and putting a lot of load on the disk (e.g. running 
dump or a simple update via apt-get upgrade), the system looses its disk 
and I have to hard reset.

The machine runs perfectly fine under Windows, so I don't believe it is 
a hardware issue.

I hope there is a solution to this issue.

best regards
Carolin

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
  2016-02-25 21:45 kernel crash with lite-on sata controller Dell XPS 13 9343 Carolin Latze
@ 2016-02-25 21:51 ` Tejun Heo
  2016-02-25 21:53   ` Carolin Latze
  0 siblings, 1 reply; 6+ messages in thread
From: Tejun Heo @ 2016-02-25 21:51 UTC (permalink / raw)
  To: Carolin Latze; +Cc: hdegoede, linux-ide

Hello, Carolin.

On Thu, Feb 25, 2016 at 10:45:10PM +0100, Carolin Latze wrote:
...
> Since the beginning (started with 3.19.x kernels on this notebook) I had
> problems with this sata controller:
> 
> 03:00.0 SATA controller: Lite-On Technology Corporation Device 0224 (rev 10)
> 
> Very often during boot, the boot just failed with error messages like
> 
> ata1.00: exception... (I was only able to capture them by taking a picture:
> https://launchpadlibrarian.net/227876135/IMG_20151128_230313.jpg)
> 
> Then I keep trying, sometimes I boot into Windows inbetween, and eventually,
> the kernel comes up.
> 
> Then, when running and putting a lot of load on the disk (e.g. running dump
> or a simple update via apt-get upgrade), the system looses its disk and I
> have to hard reset.
> 
> The machine runs perfectly fine under Windows, so I don't believe it is a
> hardware issue.
> 
> I hope there is a solution to this issue.

Can you please boot with libata.force=noncq and if that fails post the
kernel messages of the failure?

Thanks.

-- 
tejun

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
  2016-02-25 21:51 ` Tejun Heo
@ 2016-02-25 21:53   ` Carolin Latze
  2016-02-25 22:01     ` Tejun Heo
  0 siblings, 1 reply; 6+ messages in thread
From: Carolin Latze @ 2016-02-25 21:53 UTC (permalink / raw)
  To: Tejun Heo; +Cc: hdegoede, linux-ide

Hi Tejun

I actually boot with "libata.force=noncq,1.5G". Sorry, forgot to mention 
that. Same behavior.

best regards
Carolin

On 25.02.2016 22:51, Tejun Heo wrote:
> Hello, Carolin.
>
> On Thu, Feb 25, 2016 at 10:45:10PM +0100, Carolin Latze wrote:
> ...
>> Since the beginning (started with 3.19.x kernels on this notebook) I had
>> problems with this sata controller:
>>
>> 03:00.0 SATA controller: Lite-On Technology Corporation Device 0224 (rev 10)
>>
>> Very often during boot, the boot just failed with error messages like
>>
>> ata1.00: exception... (I was only able to capture them by taking a picture:
>> https://launchpadlibrarian.net/227876135/IMG_20151128_230313.jpg)
>>
>> Then I keep trying, sometimes I boot into Windows inbetween, and eventually,
>> the kernel comes up.
>>
>> Then, when running and putting a lot of load on the disk (e.g. running dump
>> or a simple update via apt-get upgrade), the system looses its disk and I
>> have to hard reset.
>>
>> The machine runs perfectly fine under Windows, so I don't believe it is a
>> hardware issue.
>>
>> I hope there is a solution to this issue.
> Can you please boot with libata.force=noncq and if that fails post the
> kernel messages of the failure?
>
> Thanks.
>


^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
  2016-02-25 21:53   ` Carolin Latze
@ 2016-02-25 22:01     ` Tejun Heo
       [not found]       ` <56D6A86C.9070007@gmail.com>
  0 siblings, 1 reply; 6+ messages in thread
From: Tejun Heo @ 2016-02-25 22:01 UTC (permalink / raw)
  To: Carolin Latze; +Cc: Hans De Goede, linux-ide

Hello, Carolin.

On Thu, Feb 25, 2016 at 4:53 PM, Carolin Latze <carolin.latze@gmail.com> wrote:
> Hi Tejun
>
> I actually boot with "libata.force=noncq,1.5G". Sorry, forgot to mention
> that. Same behavior.

Can you please post screen capture from that failure?

Thanks.

-- 
tejun

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
       [not found]       ` <56D6A86C.9070007@gmail.com>
@ 2016-03-02 15:25         ` Tejun Heo
  2016-03-14 16:02           ` Carolin Latze
  0 siblings, 1 reply; 6+ messages in thread
From: Tejun Heo @ 2016-03-02 15:25 UTC (permalink / raw)
  To: Carolin Latze; +Cc: Hans De Goede, linux-ide

Hello, Carolin.

On Wed, Mar 02, 2016 at 09:46:36AM +0100, Carolin Latze wrote:
> please see attached screen capture. If there is any better way to capture
> those logs than taking pictures, please let me know

Heh, so there have been quite a few cases where device locks up after
a lot of NCQ commands are issued and/or data transfer is taking place
at high rate but device locking up on plain WRITE_DMA_EXT is pretty
new.  Hmmm.... Does the following patch make any difference?

diff --git a/drivers/ata/ahci.h b/drivers/ata/ahci.h
index 45586c1..95c4d5c 100644
--- a/drivers/ata/ahci.h
+++ b/drivers/ata/ahci.h
@@ -366,8 +366,8 @@ extern struct device_attribute *ahci_sdev_attrs[];
 #define AHCI_SHT(drv_name)						\
 	ATA_NCQ_SHT(drv_name),						\
 	.can_queue		= AHCI_MAX_CMDS - 1,			\
-	.sg_tablesize		= AHCI_MAX_SG,				\
-	.dma_boundary		= AHCI_DMA_BOUNDARY,			\
+	.sg_tablesize		= LIBATA_MAX_PRD,			\
+	.dma_boundary		= ATA_DMA_BOUNDARY			\
 	.shost_attrs		= ahci_shost_attrs,			\
 	.sdev_attrs		= ahci_sdev_attrs
 

^ permalink raw reply related	[flat|nested] 6+ messages in thread

* Re: kernel crash with lite-on sata controller Dell XPS 13 9343
  2016-03-02 15:25         ` Tejun Heo
@ 2016-03-14 16:02           ` Carolin Latze
  0 siblings, 0 replies; 6+ messages in thread
From: Carolin Latze @ 2016-03-14 16:02 UTC (permalink / raw)
  To: Tejun Heo; +Cc: Hans De Goede, linux-ide

Hi Tejun

just a quick update from my side: I did patch the rc5 kernel yesterday 
(I am aware that there is a newer one but when you sent the patch, rc5 
was recent so I wanted to be on the safe side). There is a little typo 
on the "dma_boundary" line, the trailing comma is missing. But other 
than that, the patching went smoothly. Since then I reboot whenever I 
can. So far 15-20 times and no issue up until now. I will keep rebooting 
very often for the next days and keep you posted.

best regards
Carolin

On 02.03.2016 16:25, Tejun Heo wrote:
> Hello, Carolin.
>
> On Wed, Mar 02, 2016 at 09:46:36AM +0100, Carolin Latze wrote:
>> please see attached screen capture. If there is any better way to capture
>> those logs than taking pictures, please let me know
> Heh, so there have been quite a few cases where device locks up after
> a lot of NCQ commands are issued and/or data transfer is taking place
> at high rate but device locking up on plain WRITE_DMA_EXT is pretty
> new.  Hmmm.... Does the following patch make any difference?
>
> diff --git a/drivers/ata/ahci.h b/drivers/ata/ahci.h
> index 45586c1..95c4d5c 100644
> --- a/drivers/ata/ahci.h
> +++ b/drivers/ata/ahci.h
> @@ -366,8 +366,8 @@ extern struct device_attribute *ahci_sdev_attrs[];
>   #define AHCI_SHT(drv_name)						\
>   	ATA_NCQ_SHT(drv_name),						\
>   	.can_queue		= AHCI_MAX_CMDS - 1,			\
> -	.sg_tablesize		= AHCI_MAX_SG,				\
> -	.dma_boundary		= AHCI_DMA_BOUNDARY,			\
> +	.sg_tablesize		= LIBATA_MAX_PRD,			\
> +	.dma_boundary		= ATA_DMA_BOUNDARY			\
>   	.shost_attrs		= ahci_shost_attrs,			\
>   	.sdev_attrs		= ahci_sdev_attrs
>   


^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2016-03-14 16:02 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-02-25 21:45 kernel crash with lite-on sata controller Dell XPS 13 9343 Carolin Latze
2016-02-25 21:51 ` Tejun Heo
2016-02-25 21:53   ` Carolin Latze
2016-02-25 22:01     ` Tejun Heo
     [not found]       ` <56D6A86C.9070007@gmail.com>
2016-03-02 15:25         ` Tejun Heo
2016-03-14 16:02           ` Carolin Latze

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).