All of lore.kernel.org
 help / color / mirror / Atom feed
* recovery priority preemption
@ 2017-09-13 15:03 Sage Weil
  2017-09-13 18:02 ` Piotr Dałek
  0 siblings, 1 reply; 2+ messages in thread
From: Sage Weil @ 2017-09-13 15:03 UTC (permalink / raw)
  To: ceph-devel; +Cc: piotr.dalek

I recently observed a problem on the lab cluster while doing a log of 
rebalancing (filestore->bluestore conversion):

 - lots of pgs in backfill_wait
 - a few pgs that need pg log recovery, but these appear after backfills 
are already in progress, so they end up in backfill_wait too (confusing 
state name!)
 - ongoing write activity extents pg logs for those pgs, but they cannot 
trim
 - pg logs reach 5x-10x the max
 - OSDs OOM

I think what is needed is for the recovery priority scheduling to allow 
preemption.  If we are currently working on recovery/backfill for PG X, 
but PG Y appears with a higher priority, we should suspend work on X and 
switch to Y.

Piotr, I didn't look too closely at forced recovery changes you folks 
recently did, but I'm guessing that it was added to address this sort of 
situation, right?  Would a general solution that preempts and always works 
on the highest priority PG resolve the problem you've observed?

Thanks-
sage

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2017-09-13 18:11 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-09-13 15:03 recovery priority preemption Sage Weil
2017-09-13 18:02 ` Piotr Dałek

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.