All of lore.kernel.org
 help / color / mirror / Atom feed
* [Ocfs2-devel] Can recovery be done in process context (as opposed to kthread)?
@ 2011-09-09 22:22 Goldwyn Rodrigues
  2011-09-10 14:29 ` Sunil Mushran
  2011-09-11  8:45 ` Joel Becker
  0 siblings, 2 replies; 3+ messages in thread
From: Goldwyn Rodrigues @ 2011-09-09 22:22 UTC (permalink / raw)
  To: ocfs2-devel

Hi,

I finally got back to improve the recovery procedure by offloading
work to work queues. However, I would like to know if we can
completely do away with ocfs2rec kthread. The process would just mark
the nodes which need recovery and offload the work on the work queues
and wait until all is over.

The reason for doing it this way is to make the mount process
killable. Currently the dlm locks are taken by ocfs2rec kthread while
the mount waits in uninterruptible sleep while the recovery happens.

This would help the High Availability software which send signals to
mount procedure if it does not complete within timeout. This usually
happens when journal takes a long time to replay; especially for nodes
waiting for recovery to complete and not doing the actual recovery.

Consider one node down procedure in the middle of I/O on a mounted
system as well.

We could keep the kthread with co-ordination as well.

-- 
Goldwyn

^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Ocfs2-devel] Can recovery be done in process context (as opposed to kthread)?
  2011-09-09 22:22 [Ocfs2-devel] Can recovery be done in process context (as opposed to kthread)? Goldwyn Rodrigues
@ 2011-09-10 14:29 ` Sunil Mushran
  2011-09-11  8:45 ` Joel Becker
  1 sibling, 0 replies; 3+ messages in thread
From: Sunil Mushran @ 2011-09-10 14:29 UTC (permalink / raw)
  To: ocfs2-devel

On 09/09/2011 03:22 PM, Goldwyn Rodrigues wrote:
> Hi,
>
> I finally got back to improve the recovery procedure by offloading
> work to work queues. However, I would like to know if we can
> completely do away with ocfs2rec kthread. The process would just mark
> the nodes which need recovery and offload the work on the work queues
> and wait until all is over.
>
> The reason for doing it this way is to make the mount process
> killable. Currently the dlm locks are taken by ocfs2rec kthread while
> the mount waits in uninterruptible sleep while the recovery happens.
>
> This would help the High Availability software which send signals to
> mount procedure if it does not complete within timeout. This usually
> happens when journal takes a long time to replay; especially for nodes
> waiting for recovery to complete and not doing the actual recovery.
>
> Consider one node down procedure in the middle of I/O on a mounted
> system as well.
>
> We could keep the kthread with co-ordination as well.

I am not sure what that buys. The focus should be fixing what
ever that got the reco stuck in the first place. For the most
part, it gets stuck for reasons unrelated to ocfs2. Our focus
has been on allowing users to quickly identify the "bad" node
quickly.

^ permalink raw reply	[flat|nested] 3+ messages in thread

* [Ocfs2-devel] Can recovery be done in process context (as opposed to kthread)?
  2011-09-09 22:22 [Ocfs2-devel] Can recovery be done in process context (as opposed to kthread)? Goldwyn Rodrigues
  2011-09-10 14:29 ` Sunil Mushran
@ 2011-09-11  8:45 ` Joel Becker
  1 sibling, 0 replies; 3+ messages in thread
From: Joel Becker @ 2011-09-11  8:45 UTC (permalink / raw)
  To: ocfs2-devel

On Fri, Sep 09, 2011 at 05:22:49PM -0500, Goldwyn Rodrigues wrote:
> Hi,
> 
> I finally got back to improve the recovery procedure by offloading
> work to work queues. However, I would like to know if we can
> completely do away with ocfs2rec kthread. The process would just mark
> the nodes which need recovery and offload the work on the work queues
> and wait until all is over.
> 
> The reason for doing it this way is to make the mount process
> killable. Currently the dlm locks are taken by ocfs2rec kthread while
> the mount waits in uninterruptible sleep while the recovery happens.

	If the mount dies, but then actually succeeds in the
background...that's weird and violates the Principle of Least Surprise.

Joel

-- 

"Conservative, n.  A statesman who is enamoured of existing evils,
 as distinguished from the Liberal, who wishes to replace them
 with others."
	- Ambrose Bierce, The Devil's Dictionary

			http://www.jlbec.org/
			jlbec at evilplan.org

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2011-09-11  8:45 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-09-09 22:22 [Ocfs2-devel] Can recovery be done in process context (as opposed to kthread)? Goldwyn Rodrigues
2011-09-10 14:29 ` Sunil Mushran
2011-09-11  8:45 ` Joel Becker

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.