All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH] PG: Do not discard op data too early
@ 2012-09-27 21:56 Jim Schutt
  2012-09-27 22:07 ` Gregory Farnum
  0 siblings, 1 reply; 8+ messages in thread
From: Jim Schutt @ 2012-09-27 21:56 UTC (permalink / raw)
  To: ceph-devel; +Cc: Jim Schutt

Under a sustained cephfs write load where the offered load is higher
than the storage cluster write throughput, a backlog of replication ops
that arrive via the cluster messenger builds up.  The client message
policy throttler, which should be limiting the total write workload
accepted by the storage cluster, is unable to prevent it, for any
value of osd_client_message_size_cap, under such an overload condition.

The root cause is that op data is released too early, in op_applied().

If instead the op data is released at op deletion, then the limit
imposed by the client policy throttler applies over the entire
lifetime of the op, including commits of replication ops.  That
makes the policy throttler an effective means for an OSD to
protect itself from a sustained high offered load, because it can
effectively limit the total, cluster-wide resources needed to process
in-progress write ops.

Signed-off-by: Jim Schutt <jaschut@sandia.gov>
---
 src/osd/ReplicatedPG.cc |    4 ----
 1 files changed, 0 insertions(+), 4 deletions(-)

diff --git a/src/osd/ReplicatedPG.cc b/src/osd/ReplicatedPG.cc
index a64abda..80bec2a 100644
--- a/src/osd/ReplicatedPG.cc
+++ b/src/osd/ReplicatedPG.cc
@@ -3490,10 +3490,6 @@ void ReplicatedPG::op_applied(RepGather *repop)
   dout(10) << "op_applied " << *repop << dendl;
   if (repop->ctx->op)
     repop->ctx->op->mark_event("op_applied");
-
-  // discard my reference to the buffer
-  if (repop->ctx->op)
-    repop->ctx->op->request->clear_data();
   
   repop->applying = false;
   repop->applied = true;
-- 
1.7.8.2



^ permalink raw reply related	[flat|nested] 8+ messages in thread

end of thread, other threads:[~2012-10-26 21:30 UTC | newest]

Thread overview: 8+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-09-27 21:56 [PATCH] PG: Do not discard op data too early Jim Schutt
2012-09-27 22:07 ` Gregory Farnum
2012-09-27 22:23   ` Jim Schutt
2012-09-27 22:27     ` Gregory Farnum
2012-09-27 22:36       ` Jim Schutt
2012-10-26 20:52         ` Gregory Farnum
2012-10-26 21:07           ` Jim Schutt
2012-10-26 21:30             ` Sage Weil

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.