From: "Rafael J. Wysocki" <rjw@sisk.pl>
To: James Hogan <james@albanarts.com>
Cc: linux-pm@lists.linux-foundation.org,
linux-kernel@vger.kernel.org, linux-doc@vger.kernel.org,
Len Brown <len.brown@intel.com>, Pavel Machek <pavel@ucw.cz>,
Randy Dunlap <rdunlap@xenotime.net>,
"Greg Kroah-Hartman" <gregkh@suse.de>,
Alan Stern <stern@rowland.harvard.edu>,
Jesse Barnes <jbarnes@virtuousgeek.org>,
markgross <markgross@thegnar.org>
Subject: Re: [PATCH 2/2] pm_trace: Add sysfs attr for rechecking dev hash.
Date: Sun, 10 Oct 2010 01:30:47 +0200 [thread overview]
Message-ID: <201010100130.47448.rjw@sisk.pl> (raw)
In-Reply-To: <201010100013.14142.james@albanarts.com>
On Sunday, October 10, 2010, James Hogan wrote:
> Thanks for taking a look...
>
> On Saturday 09 October 2010 23:49:15 Rafael J. Wysocki wrote:
> > On Sunday, October 10, 2010, James Hogan wrote:
> > > If the device which fails to resume is part of a loadable kernel module
> > > it won't be checked at startup against the magic number stored in the
> > > RTC.
> > >
> > > Add a read-only sysfs attribute /sys/power/pm_trace_dev_hash which
> > > contains a list of newline separated devices (usually just the one)
> > > which currently match the last magic number. This allows the device
> > > which is failing to resume to be found after the modules are loaded
> > > again.
> > >
> > > Signed-off-by: James Hogan <james@albanarts.com>
> > > ---
> > >
> > > Documentation/power/s2ram.txt | 7 +++++++
> > > drivers/base/power/trace.c | 27 +++++++++++++++++++++++++++
> > > include/linux/resume-trace.h | 2 ++
> > > kernel/power/main.c | 18 ++++++++++++++++++
> > > 4 files changed, 54 insertions(+), 0 deletions(-)
> > >
> > > diff --git a/Documentation/power/s2ram.txt
> > > b/Documentation/power/s2ram.txt index 514b94f..3a2801a 100644
> > > --- a/Documentation/power/s2ram.txt
> > > +++ b/Documentation/power/s2ram.txt
> > >
> > > @@ -49,6 +49,13 @@ machine that doesn't boot) is:
> > > device (lspci and /sys/devices/pci* is your friend), and see if you
> > > can fix it, disable it, or trace into its resume function.
> > >
> > > + If no device matches the hash, it may be a device from a loadable
> > > kernel + module that is not loaded until after the hash is checked.
> > > You can check + the hash against the current devices again after more
> > > modules are loaded + using sysfs:
> > > +
> > > + cat /sys/power/pm_trace_dev_hash
> > > +
> >
> > /sys/power/pm_trace_match perhaps?
>
> The magic number stores 1 "user" number (given in a RESUME_TRACE call) and 2
> hashes (representing source file/line and device) in the RTC, but this sysfs
> attribute only returns the matches for the device part, so I think it's
> important to have dev or device in there in case we want attributes for
> file/line (which doesn't work with modules at the moment either, but the
> "user" number can be used as that's printed at boot directly), but I agree
> that match is better than hash so I'm happy to change it to pm_trace_dev_match
> or pm_trace_dev_matches.
I'd prefer pm_trace_dev_match.
> >
> > How do we ensure it prints things that make sense when the last resume was
> > successful or the system hasn't suspended at all?
>
> If the last resume was successful, then the stored magic number won't have
> changed since the original boot, since it is read from the RTC in a
> core_initcall function (early_resume_init()).
>
> The case of when pm_trace wasn't in use before boot is impossible to avoid
> when using the RTC. There is a chance that a genuine RTC value will not match
> any of the device hashes (there are 1009 possible device hash values), in
> which case nothing will be output, but the same thing happens at boot when it
> does it's first comparison against the devices and printk's any matches.
OK
I think it would make sense to document that this somehow. Like for example
in Documentation/ABI/testing/sysfs-power (you should add the new attribute to
the list in there anyway).
> >
> > > For example, the above happens to be the VGA device on my EVO, which I
> > > used to run with "radeonfb" (it's an ATI Radeon mobility). It turns out
> > > that "radeonfb" simply cannot resume that device - it tries to set the
> > >
> > > diff --git a/drivers/base/power/trace.c b/drivers/base/power/trace.c
> > > index 17e24e3..e0cdba1 100644
> > > --- a/drivers/base/power/trace.c
> > > +++ b/drivers/base/power/trace.c
> > > @@ -207,6 +207,33 @@ static int show_dev_hash(unsigned int value)
> > >
> > > static unsigned int hash_value_early_read;
> > >
> > > +int snprint_trace_dev_hash(char *buf, size_t size)
> > > +{
> > > + unsigned int value = hash_value_early_read / (USERHASH * FILEHASH);
> > > + int ret = 0;
> > > + struct list_head *entry;
> > > +
> > > + device_pm_lock();
> > > + entry = dpm_list.prev;
> > > + while (size && entry != &dpm_list) {
> > > + struct device *dev = to_device(entry);
> > > + unsigned int hash = hash_string(DEVSEED, dev_name(dev),
> > > + DEVHASH);
> > > + if (hash == value) {
> > > + int len = snprintf(buf, size, "%s\n",
> > > + dev_driver_string(dev));
> > > + if (len > size)
> > > + len = size;
> > > + buf += len;
> > > + ret += len;
> > > + size -= len;
> >
> > Don't we want to break; here and if so then why?
>
> No, because it's possible that two devices will hash to the same value, in
> which case it is better to print both out so we know that the problem could be
> in either one of them. I'll add a comment to that effect.
OK
Thanks,
Rafael
next prev parent reply other threads:[~2010-10-09 23:32 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-10-09 22:34 [PATCH 0/2] pm_trace: Add locking and add sysfs attr for rechecking dev hash James Hogan
2010-10-09 22:36 ` [PATCH 1/2] pm_trace: Lock pm device list mutex James Hogan
2010-10-09 22:43 ` Rafael J. Wysocki
2010-10-09 22:37 ` [PATCH 2/2] pm_trace: Add sysfs attr for rechecking dev hash James Hogan
2010-10-09 22:49 ` Rafael J. Wysocki
2010-10-09 23:13 ` James Hogan
2010-10-09 23:30 ` Rafael J. Wysocki [this message]
2010-10-10 18:04 ` Pavel Machek
2010-10-10 19:47 ` James Hogan
2010-10-10 20:47 ` Pavel Machek
2010-10-10 21:19 ` James Hogan
2010-10-10 22:12 ` Rafael J. Wysocki
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=201010100130.47448.rjw@sisk.pl \
--to=rjw@sisk.pl \
--cc=gregkh@suse.de \
--cc=james@albanarts.com \
--cc=jbarnes@virtuousgeek.org \
--cc=len.brown@intel.com \
--cc=linux-doc@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-pm@lists.linux-foundation.org \
--cc=markgross@thegnar.org \
--cc=pavel@ucw.cz \
--cc=rdunlap@xenotime.net \
--cc=stern@rowland.harvard.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox