xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
* Possible error restoring machine
@ 2012-05-23  9:39 Frediano Ziglio
  2012-05-23 10:25 ` Ian Campbell
  2012-05-23 13:30 ` Shriram Rajagopalan
  0 siblings, 2 replies; 5+ messages in thread
From: Frediano Ziglio @ 2012-05-23  9:39 UTC (permalink / raw)
  To: xen-devel@lists.xensource.com

I noted a possible problem restoring a machine.

In xc_domain_restore (xc_domain_restore.c) if it's not the last
checkpoint we set O_NONBLOCK flag (search for fcntl) that we can call
pagebuf_get or just load other pages (see following "goto loadpages;"
line).
Now we could ending up calling xc_tmem_restore/xc_tmem_restore_extra
(xc_tmem.c) which call read_extract (xc_private.c) on the same non
blocking socket/file but read_extract does not handle EAGAIN/EWOULDBLOCK
(both can be returned on non blocking socket depending on file type and
Unix/Linux version) leading to a failure.
Does this make sense or is it impossible ??

Also note that rdexact (xc_domain_restore.c) handle data timeout but we
can still block in read_exact called by
xc_tmem_restore/xc_tmem_restore_extra.

Last note on rdexact, isn't 1 second (HEARTBEAT_MS) too small if there
are network problems?

Frediano

^ permalink raw reply	[flat|nested] 5+ messages in thread

end of thread, other threads:[~2012-05-23 14:15 UTC | newest]

Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-05-23  9:39 Possible error restoring machine Frediano Ziglio
2012-05-23 10:25 ` Ian Campbell
2012-05-23 11:37   ` Frediano Ziglio
2012-05-23 13:30 ` Shriram Rajagopalan
2012-05-23 14:15   ` Dan Magenheimer

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).