* [Bug 107325] New: Reported temperature of nvidia card with nouveau driver is wrong
@ 2018-07-21 20:36 bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
0 siblings, 1 reply; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2018-07-21 20:36 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 2177 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
Bug ID: 107325
Summary: Reported temperature of nvidia card with nouveau
driver is wrong
Product: xorg
Version: unspecified
Hardware: x86-64 (AMD64)
OS: Linux (All)
Status: NEW
Severity: normal
Priority: medium
Component: Driver/nouveau
Assignee: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW@public.gmane.org
Reporter: j.novak-yL21p0u6sjaEIYLD/85Ykw@public.gmane.org
QA Contact: xorg-team-go0+a7rfsptAfugRpC6u6w@public.gmane.org
Hello,
I use Dell Precision 3530 with NVIDIA Corporation GP107GLM [Quadro P600
Mobile] (rev a1). I use Fedora Core 28 with 4.17.6 x86_64 kernel.
I found that sensors tool shows wrong temperature:
$ sensors
nouveau-pci-0100
Adapter: PCI adapter
temp1: +511.0°C (high = +95.0°C, hyst = +3.0°C)
(crit = +105.0°C, hyst = +5.0°C)
(emerg = +135.0°C, hyst = +5.0°C)
Temperature is obviously wrong.
I tried to troubleshoot it on sensors side and it looks that sensors tool
receives this wrong value from driver.
I made one more observation - right after suspend/wakeup the value is
completely different:
$ sensors
nouveau-pci-0100
Adapter: PCI adapter
temp1: +511.0°C (high = +95.0°C, hyst = +3.0°C)
(crit = +105.0°C, hyst = +5.0°C)
(emerg = +135.0°C, hyst = +5.0°C)
$ sensors
nouveau-pci-0100
Adapter: PCI adapter
temp1: +43.0°C (high = +95.0°C, hyst = +3.0°C)
(crit = +105.0°C, hyst = +5.0°C)
(emerg = +135.0°C, hyst = +5.0°C)
$ sensors
nouveau-pci-0100
Adapter: PCI adapter
temp1: +511.0°C (high = +95.0°C, hyst = +3.0°C)
(crit = +105.0°C, hyst = +5.0°C)
(emerg = +135.0°C, hyst = +5.0°C)
I can provide more information when required.
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 3462 bytes --]
[-- Attachment #2: Type: text/plain, Size: 154 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
@ 2018-07-22 15:08 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2018-07-22 15:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
` (7 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2018-07-22 15:08 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 1006 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
Rhys Kidd <rhyskidd-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |rhyskidd-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org
--- Comment #1 from Rhys Kidd <rhyskidd-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> ---
Thanks for the bug report Jirka,
It would be helpful if you could download and build a nouveau-related debug
toolkit, envytools [0], and run the following commands (inside the nva/
subfolder):
$ ./nvapeek 0x020460
$ ./nvapeek 0x020400
It would also be helpful to see a copy of your GPU's VBIOS attached here. This
can be produced by running the below command:
$ cat /sys/kernel/debug/dri/0/vbios.rom > vbios.rom
[0] https://github.com/envytools/envytools
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 2338 bytes --]
[-- Attachment #2: Type: text/plain, Size: 154 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
2018-07-22 15:08 ` [Bug 107325] " bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2018-07-22 15:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2018-07-22 19:12 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
` (6 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2018-07-22 15:46 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 452 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
--- Comment #2 from Ilia Mirkin <imirkin-FrUbXkNCsVf2fBVCVOL8/A@public.gmane.org> ---
Perhaps when it's runtime-suspended, the readings return all 1's, and we report
511 (0x1ff). Or some variation thereof.
Jirka - if you boot with nouveau.runpm=0, I suspect the temperature will be
fine -- good to check though.
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 1238 bytes --]
[-- Attachment #2: Type: text/plain, Size: 154 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
2018-07-22 15:08 ` [Bug 107325] " bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2018-07-22 15:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2018-07-22 19:12 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2018-07-22 19:14 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
` (5 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2018-07-22 19:12 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 1306 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
--- Comment #3 from Jirka Novak <j.novak-yL21p0u6sjaEIYLD/85Ykw@public.gmane.org> ---
Hi,
> It would be helpful if you could download and build a nouveau-related debug
> toolkit, envytools [0], and run the following commands (inside the nva/
> subfolder):
>
> $ ./nvapeek 0x020460
>
> $ ./nvapeek 0x020400
Output is there, but I see different output for subsequent calls:
# nvapeek 0x020460
00020460: 20003170
# nvapeek 0x020460
00020460: 20003180
# nvapeek 0x020460
00020460: 200031a8
# nvapeek 0x020460
00020460: 200031a0
# nvapeek 0x020460
00020460: 200031e8
# nvapeek 0x020400
00020400: 00000030
# nvapeek 0x020400
00020400: 00000031
# nvapeek 0x020400
00020400: 00000031
# nvapeek 0x020400
00020400: 00000031
# nvapeek 0x020400
00020400: 00000031
# nvapeek 0x020400
00020400: 00000032
> It would also be helpful to see a copy of your GPU's VBIOS attached here. This
> can be produced by running the below command:
>
> $ cat /sys/kernel/debug/dri/0/vbios.rom > vbios.rom
File is attached.
Best regards,
Jirka Novak
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 2184 bytes --]
[-- Attachment #2: Type: text/plain, Size: 154 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
` (2 preceding siblings ...)
2018-07-22 19:12 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2018-07-22 19:14 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-03-31 11:09 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
` (4 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2018-07-22 19:14 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 661 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
--- Comment #4 from Jirka Novak <j.novak-yL21p0u6sjaEIYLD/85Ykw@public.gmane.org> ---
Hi,
> Perhaps when it's runtime-suspended, the readings return all 1's, and we report
> 511 (0x1ff). Or some variation thereof.
>
> Jirka - if you boot with nouveau.runpm=0, I suspect the temperature will be
> fine -- good to check though.
yes, you are correct. It then returns 49-50 degrees.
Best regards,
Jirka Novak
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 1490 bytes --]
[-- Attachment #2: Type: text/plain, Size: 154 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
` (3 preceding siblings ...)
2018-07-22 19:14 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2019-03-31 11:09 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-04-10 14:31 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
` (3 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2019-03-31 11:09 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 589 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
Pacho Ramos <pachoramos1-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
CC| |pachoramos1-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org
--- Comment #5 from Pacho Ramos <pachoramos1-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> ---
I have the same issue with kernel 4.19.30 still
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 1875 bytes --]
[-- Attachment #2: Type: text/plain, Size: 153 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
` (4 preceding siblings ...)
2019-03-31 11:09 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2019-04-10 14:31 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-04-10 14:34 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
` (2 subsequent siblings)
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2019-04-10 14:31 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 463 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
--- Comment #6 from Karol Herbst <karolherbst-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> ---
this is a runtime suspend issue. While the GPU is suspended the temperature
reading fails, but we don't actually check for that, so we return the error
value (-1 & 0x1ff = 511).
I think I had a patch for that somewhere, let me see.
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 1252 bytes --]
[-- Attachment #2: Type: text/plain, Size: 153 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
` (5 preceding siblings ...)
2019-04-10 14:31 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2019-04-10 14:34 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-04-11 0:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-12-04 9:44 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2019-04-10 14:34 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 322 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
--- Comment #7 from Roy <nouveau-NQbd8FSOZ1kdnm+yROfE0A@public.gmane.org> ---
Will this bug interact with Lyude's recent patch, "drm/nouveau/i2c: Disable i2c
bus access after ->fini()"?
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 1109 bytes --]
[-- Attachment #2: Type: text/plain, Size: 153 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
` (6 preceding siblings ...)
2019-04-10 14:34 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2019-04-11 0:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-12-04 9:44 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2019-04-11 0:46 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 312 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
--- Comment #8 from Karol Herbst <karolherbst-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> ---
no. This was mainly for displays afaik and we read out the temperature through
MMIO.
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 1097 bytes --]
[-- Attachment #2: Type: text/plain, Size: 153 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
* [Bug 107325] Reported temperature of nvidia card with nouveau driver is wrong
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
` (7 preceding siblings ...)
2019-04-11 0:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
@ 2019-12-04 9:44 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
8 siblings, 0 replies; 10+ messages in thread
From: bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ @ 2019-12-04 9:44 UTC (permalink / raw)
To: nouveau-PD4FTy7X32lNgt0PjOBp9y5qC8QIuHrW
[-- Attachment #1.1: Type: text/plain, Size: 871 bytes --]
https://bugs.freedesktop.org/show_bug.cgi?id=107325
Martin Peres <martin.peres-GANU6spQydw@public.gmane.org> changed:
What |Removed |Added
----------------------------------------------------------------------------
Resolution|--- |MOVED
Status|NEW |RESOLVED
--- Comment #9 from Martin Peres <martin.peres-GANU6spQydw@public.gmane.org> ---
-- GitLab Migration Automatic Message --
This bug has been migrated to freedesktop.org's GitLab instance and has been
closed from further activity.
You can subscribe and participate further through the new bug through this link
to our GitLab instance:
https://gitlab.freedesktop.org/xorg/driver/xf86-video-nouveau/issues/445.
--
You are receiving this mail because:
You are the assignee for the bug.
[-- Attachment #1.2: Type: text/html, Size: 2463 bytes --]
[-- Attachment #2: Type: text/plain, Size: 153 bytes --]
_______________________________________________
Nouveau mailing list
Nouveau@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/nouveau
^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2019-12-04 9:44 UTC | newest]
Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2018-07-21 20:36 [Bug 107325] New: Reported temperature of nvidia card with nouveau driver is wrong bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
[not found] ` <bug-107325-8800-V0hAGp6uBxMKqLRl/0Ahz6D7qz1kEfGD2LY78lusg7I@public.gmane.org/>
2018-07-22 15:08 ` [Bug 107325] " bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2018-07-22 15:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2018-07-22 19:12 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2018-07-22 19:14 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-03-31 11:09 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-04-10 14:31 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-04-10 14:34 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-04-11 0:46 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
2019-12-04 9:44 ` bugzilla-daemon-CC+yJ3UmIYqDUpFQwHEjaQ
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.