linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [regression] linux318, linux41 - kernel stack is corrupted
@ 2016-04-22  6:17 Philip Müller
  2016-04-22  6:19 ` Philip Müller
  2016-04-22  6:46 ` Greg Kroah-Hartman
  0 siblings, 2 replies; 9+ messages in thread
From: Philip Müller @ 2016-04-22  6:17 UTC (permalink / raw)
  To: Greg Kroah-Hartman, Sasha Levin
  Cc: greg, manjaro-dev, Linux-Kernel@Vger. Kernel. Org

Hi Greg, hi Sasha,

seems I found another regression within the latest point-releases of
3.18 and 4.1 kernel series. We tested it on AMD and Intel CPUs so far.
They hit the same regression. Other kernels released on that day are not
affected. Do you guys have a clue what might been have missed here?

3.18.30 and 4.1.21 didn't had that issue on the same hardware.

kind regards
Philip Müller
---------------------------
Manjaro Project Lead

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22  6:17 [regression] linux318, linux41 - kernel stack is corrupted Philip Müller
@ 2016-04-22  6:19 ` Philip Müller
  2016-04-22  6:46 ` Greg Kroah-Hartman
  1 sibling, 0 replies; 9+ messages in thread
From: Philip Müller @ 2016-04-22  6:19 UTC (permalink / raw)
  To: Greg Kroah-Hartman, Sasha Levin
  Cc: greg, manjaro-dev, Linux-Kernel@Vger. Kernel. Org

Hi Greg, hi Sasha,

seems I found another regression within the latest point-releases of
3.18 and 4.1 kernel series. We tested it on AMD and Intel CPUs so far.
They hit the same regression. Other kernels released on that day are not
affected. Do you guys have a clue what might been have missed here?

3.18.30 and 4.1.21 didn't had that issue on the same hardware.

kind regards
Philip Müller
--------------------------
Manjaro Project Lead

https://github.com/manjaro/packages-core/issues/36

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22  6:17 [regression] linux318, linux41 - kernel stack is corrupted Philip Müller
  2016-04-22  6:19 ` Philip Müller
@ 2016-04-22  6:46 ` Greg Kroah-Hartman
  2016-04-22  7:47   ` Sebastian M. Bobrecki
  1 sibling, 1 reply; 9+ messages in thread
From: Greg Kroah-Hartman @ 2016-04-22  6:46 UTC (permalink / raw)
  To: Philip Müller
  Cc: Sasha Levin, manjaro-dev, Linux-Kernel@Vger. Kernel. Org

On Fri, Apr 22, 2016 at 08:17:58AM +0200, Philip Müller wrote:
> Hi Greg, hi Sasha,
> 
> seems I found another regression within the latest point-releases of
> 3.18 and 4.1 kernel series. We tested it on AMD and Intel CPUs so far.
> They hit the same regression. Other kernels released on that day are not
> affected. Do you guys have a clue what might been have missed here?
> 
> 3.18.30 and 4.1.21 didn't had that issue on the same hardware.

You are going to have to be a bit more specific here...
What is the oops message?  How do you reproduce this?  Does it also
happen on 4.6-rc4?

Can you run 'git bisect' to find the offending patch?

thanks,

greg k-h

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22  6:46 ` Greg Kroah-Hartman
@ 2016-04-22  7:47   ` Sebastian M. Bobrecki
  2016-04-22  7:55     ` Greg Kroah-Hartman
  0 siblings, 1 reply; 9+ messages in thread
From: Sebastian M. Bobrecki @ 2016-04-22  7:47 UTC (permalink / raw)
  To: Greg Kroah-Hartman, Philip Müller
  Cc: Sasha Levin, manjaro-dev, Linux-Kernel@Vger. Kernel. Org

Hi,

I just hit the same with 4.1.22 on Gentoo. 4.1.21 are working fine.

On 22.04.2016 at 08:46, Greg Kroah-Hartman wrote:
> ...
> You are going to have to be a bit more specific here...
> What is the oops message?  How do you reproduce this?  Does it also
> happen on 4.6-rc4?
>
> Can you run 'git bisect' to find the offending patch?
>
Greg have you seen screenshots linked by Philip?

-- 
Sebastian M. Bobrecki

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22  7:47   ` Sebastian M. Bobrecki
@ 2016-04-22  7:55     ` Greg Kroah-Hartman
  2016-04-22  8:10       ` Sebastian M. Bobrecki
  0 siblings, 1 reply; 9+ messages in thread
From: Greg Kroah-Hartman @ 2016-04-22  7:55 UTC (permalink / raw)
  To: Sebastian M. Bobrecki
  Cc: Philip Müller, Sasha Levin, manjaro-dev,
	Linux-Kernel@Vger. Kernel. Org

On Fri, Apr 22, 2016 at 09:47:04AM +0200, Sebastian M. Bobrecki wrote:
> Hi,
> 
> I just hit the same with 4.1.22 on Gentoo. 4.1.21 are working fine.
> 
> On 22.04.2016 at 08:46, Greg Kroah-Hartman wrote:
> > ...
> > You are going to have to be a bit more specific here...
> > What is the oops message?  How do you reproduce this?  Does it also
> > happen on 4.6-rc4?
> > 
> > Can you run 'git bisect' to find the offending patch?
> > 
> Greg have you seen screenshots linked by Philip?

I saw no such screenshots in the email.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22  7:55     ` Greg Kroah-Hartman
@ 2016-04-22  8:10       ` Sebastian M. Bobrecki
  2016-04-22  8:23         ` Greg Kroah-Hartman
  0 siblings, 1 reply; 9+ messages in thread
From: Sebastian M. Bobrecki @ 2016-04-22  8:10 UTC (permalink / raw)
  To: Greg Kroah-Hartman
  Cc: Philip Müller, Sasha Levin, manjaro-dev,
	Linux-Kernel@Vger. Kernel. Org

W dniu 22.04.2016 o 09:55, Greg Kroah-Hartman pisze:
> On Fri, Apr 22, 2016 at 09:47:04AM +0200, Sebastian M. Bobrecki wrote:
>> Hi,
>>
>> I just hit the same with 4.1.22 on Gentoo. 4.1.21 are working fine.
>>
>> On 22.04.2016 at 08:46, Greg Kroah-Hartman wrote:
>>> ...
>>> You are going to have to be a bit more specific here...
>>> What is the oops message?  How do you reproduce this?  Does it also
>>> happen on 4.6-rc4?
>>>
>>> Can you run 'git bisect' to find the offending patch?
>>>
>> Greg have you seen screenshots linked by Philip?
> I saw no such screenshots in the email.
They are here: https://github.com/manjaro/packages-core/issues/36

-- 
Sebastian M. Bobrecki

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22  8:10       ` Sebastian M. Bobrecki
@ 2016-04-22  8:23         ` Greg Kroah-Hartman
  2016-04-22 10:16           ` Mike Galbraith
  0 siblings, 1 reply; 9+ messages in thread
From: Greg Kroah-Hartman @ 2016-04-22  8:23 UTC (permalink / raw)
  To: Sebastian M. Bobrecki
  Cc: Philip Müller, Sasha Levin, manjaro-dev,
	Linux-Kernel@Vger. Kernel. Org

On Fri, Apr 22, 2016 at 10:10:59AM +0200, Sebastian M. Bobrecki wrote:
> W dniu 22.04.2016 o 09:55, Greg Kroah-Hartman pisze:
> > On Fri, Apr 22, 2016 at 09:47:04AM +0200, Sebastian M. Bobrecki wrote:
> > > Hi,
> > > 
> > > I just hit the same with 4.1.22 on Gentoo. 4.1.21 are working fine.
> > > 
> > > On 22.04.2016 at 08:46, Greg Kroah-Hartman wrote:
> > > > ...
> > > > You are going to have to be a bit more specific here...
> > > > What is the oops message?  How do you reproduce this?  Does it also
> > > > happen on 4.6-rc4?
> > > > 
> > > > Can you run 'git bisect' to find the offending patch?
> > > > 
> > > Greg have you seen screenshots linked by Philip?
> > I saw no such screenshots in the email.
> They are here: https://github.com/manjaro/packages-core/issues/36

Looks like an acpi thermal patch got backported incorrectly, again, 'git
bisect' is going to help out the best here.

thanks,

greg k-h

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22  8:23         ` Greg Kroah-Hartman
@ 2016-04-22 10:16           ` Mike Galbraith
  2016-04-23  2:02             ` Sasha Levin
  0 siblings, 1 reply; 9+ messages in thread
From: Mike Galbraith @ 2016-04-22 10:16 UTC (permalink / raw)
  To: Greg Kroah-Hartman, Sebastian M. Bobrecki
  Cc: Philip Müller, Sasha Levin, manjaro-dev,
	Linux-Kernel@Vger. Kernel. Org

On Fri, 2016-04-22 at 17:23 +0900, Greg Kroah-Hartman wrote:
> On Fri, Apr 22, 2016 at 10:10:59AM +0200, Sebastian M. Bobrecki wrote:
> > W dniu 22.04.2016 o 09:55, Greg Kroah-Hartman pisze:
> > > On Fri, Apr 22, 2016 at 09:47:04AM +0200, Sebastian M. Bobrecki wrote:
> > > > Hi,
> > > > 
> > > > I just hit the same with 4.1.22 on Gentoo. 4.1.21 are working fine.
> > > > 
> > > > On 22.04.2016 at 08:46, Greg Kroah-Hartman wrote:
> > > > > ...
> > > > > You are going to have to be a bit more specific here...
> > > > > What is the oops message?  How do you reproduce this?  Does it also
> > > > > happen on 4.6-rc4?
> > > > > 
> > > > > Can you run 'git bisect' to find the offending patch?
> > > > > 
> > > > Greg have you seen screenshots linked by Philip?
> > > I saw no such screenshots in the email.
> > They are here: https://github.com/manjaro/packages-core/issues/36
> 
> Looks like an acpi thermal patch got backported incorrectly, again, 'git
> bisect' is going to help out the best here.

That'll work, but requires repeatedly ignoring the big-fat-warning :)

Backport of 81ad4276b505e987dd8ebbdf63605f92cd172b52 failed to adjust
for intervening ->get_trip_temp() argument type change, thus causing
stack protector to panic.

drivers/thermal/thermal_core.c: In function ‘thermal_zone_device_register’:
drivers/thermal/thermal_core.c:1569:41: warning: passing argument 3 of
‘tz->ops->get_trip_temp’ from incompatible pointer type [-Wincompatible-pointer-types]
   if (tz->ops->get_trip_temp(tz, count, &trip_temp))
                                         ^
drivers/thermal/thermal_core.c:1569:41: note: expected ‘long unsigned int *’
but argument is of type ‘int *’

CC: <stable@vger.kernel.org> #3.18,#4.1
Signed-off-by: Mike Galbraith <umgwanakikbuti@gmail.com>
---
 drivers/thermal/thermal_core.c |    2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

--- a/drivers/thermal/thermal_core.c
+++ b/drivers/thermal/thermal_core.c
@@ -1491,7 +1491,7 @@ struct thermal_zone_device *thermal_zone
 {
 	struct thermal_zone_device *tz;
 	enum thermal_trip_type trip_type;
-	int trip_temp;
+	unsigned long trip_temp;
 	int result;
 	int count;
 	int passive = 0;

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: [regression] linux318, linux41 - kernel stack is corrupted
  2016-04-22 10:16           ` Mike Galbraith
@ 2016-04-23  2:02             ` Sasha Levin
  0 siblings, 0 replies; 9+ messages in thread
From: Sasha Levin @ 2016-04-23  2:02 UTC (permalink / raw)
  To: Mike Galbraith, Greg Kroah-Hartman, Sebastian M. Bobrecki
  Cc: Philip Müller, manjaro-dev, Linux-Kernel@Vger. Kernel. Org

On 04/22/2016 06:16 AM, Mike Galbraith wrote:
> On Fri, 2016-04-22 at 17:23 +0900, Greg Kroah-Hartman wrote:
>> On Fri, Apr 22, 2016 at 10:10:59AM +0200, Sebastian M. Bobrecki wrote:
>>> W dniu 22.04.2016 o 09:55, Greg Kroah-Hartman pisze:
>>>> On Fri, Apr 22, 2016 at 09:47:04AM +0200, Sebastian M. Bobrecki wrote:
>>>>> Hi,
>>>>>
>>>>> I just hit the same with 4.1.22 on Gentoo. 4.1.21 are working fine.
>>>>>
>>>>> On 22.04.2016 at 08:46, Greg Kroah-Hartman wrote:
>>>>>> ...
>>>>>> You are going to have to be a bit more specific here...
>>>>>> What is the oops message?  How do you reproduce this?  Does it also
>>>>>> happen on 4.6-rc4?
>>>>>>
>>>>>> Can you run 'git bisect' to find the offending patch?
>>>>>>
>>>>> Greg have you seen screenshots linked by Philip?
>>>> I saw no such screenshots in the email.
>>> They are here: https://github.com/manjaro/packages-core/issues/36
>>
>> Looks like an acpi thermal patch got backported incorrectly, again, 'git
>> bisect' is going to help out the best here.
> 
> That'll work, but requires repeatedly ignoring the big-fat-warning :)
> 
> Backport of 81ad4276b505e987dd8ebbdf63605f92cd172b52 failed to adjust
> for intervening ->get_trip_temp() argument type change, thus causing
> stack protector to panic.
> 
> drivers/thermal/thermal_core.c: In function ‘thermal_zone_device_register’:
> drivers/thermal/thermal_core.c:1569:41: warning: passing argument 3 of
> ‘tz->ops->get_trip_temp’ from incompatible pointer type [-Wincompatible-pointer-types]
>    if (tz->ops->get_trip_temp(tz, count, &trip_temp))
>                                          ^
> drivers/thermal/thermal_core.c:1569:41: note: expected ‘long unsigned int *’
> but argument is of type ‘int *’
> 
> CC: <stable@vger.kernel.org> #3.18,#4.1
> Signed-off-by: Mike Galbraith <umgwanakikbuti@gmail.com>
> ---
>  drivers/thermal/thermal_core.c |    2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> --- a/drivers/thermal/thermal_core.c
> +++ b/drivers/thermal/thermal_core.c
> @@ -1491,7 +1491,7 @@ struct thermal_zone_device *thermal_zone
>  {
>  	struct thermal_zone_device *tz;
>  	enum thermal_trip_type trip_type;
> -	int trip_temp;
> +	unsigned long trip_temp;
>  	int result;
>  	int count;
>  	int passive = 0;
> 

Thanks!

I'll put it on both 3.18 and 4.1, and will try to ship it within a day or
two once all tests have gone through.


Thanks,
Sasha

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2016-04-23  2:03 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-04-22  6:17 [regression] linux318, linux41 - kernel stack is corrupted Philip Müller
2016-04-22  6:19 ` Philip Müller
2016-04-22  6:46 ` Greg Kroah-Hartman
2016-04-22  7:47   ` Sebastian M. Bobrecki
2016-04-22  7:55     ` Greg Kroah-Hartman
2016-04-22  8:10       ` Sebastian M. Bobrecki
2016-04-22  8:23         ` Greg Kroah-Hartman
2016-04-22 10:16           ` Mike Galbraith
2016-04-23  2:02             ` Sasha Levin

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).