All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Stillwell, Bryan" <bryan.stillwell-nsCYeiwbiy9BDgjK7y7TUQ@public.gmane.org>
To: Josef Johansson <josef86-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	Samuel Just <sjust-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>,
	ceph-devel <ceph-devel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	"'ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org'
	(ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org)"
	<ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org>
Subject: Re: Discuss: New default recovery config settings
Date: Fri, 29 May 2015 18:56:33 -0400	[thread overview]
Message-ID: <D18E470A.6229%bryan.stillwell@twcable.com> (raw)
In-Reply-To: <CAOnYue-zF1driSi1oxGCQ8Vh1UcG=viT5P8AeHA3R1NCca1o6w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>


[-- Attachment #1.1: Type: text/plain, Size: 3620 bytes --]

I like the idea of turning the defaults down.  During the ceph operators session at the OpenStack conference last week Warren described the behavior pretty accurately as "Ceph basically DOSes itself unless you reduce those settings."  Maybe this is more of a problem when the clusters are small?

Another idea would be to have a better way to prioritize recovery traffic to an even lower priority level by setting the ionice value to 'Idle' in the CFQ scheduler?

Bryan

From: Josef Johansson <josef86-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org<mailto:josef86-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>>
Date: Friday, May 29, 2015 at 4:16 PM
To: Samuel Just <sjust-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org<mailto:sjust-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>>, ceph-devel <ceph-devel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org<mailto:ceph-devel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>>, "'ceph-users@lists.ceph.com<mailto:'ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org>' (ceph-users-idqoXFIVOFKIjjVqG0RrOw@public.gmane.orgom<mailto:ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org>)" <ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org<mailto:ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org>>
Subject: Re: [ceph-users] Discuss: New default recovery config settings


Hi,

We did it the other way around instead, defining a period where the load is lighter and turn off/on backfill/recover. Then you want the backfill values to be the what is default right now.

Also, someone said that (think it was Greg?) If you have problems with backfill, your cluster backing store is not fast enough/too much load.
If 10 osds goes down at the same time you want those values to be high to minimize the downtime.

/Josef

fre 29 maj 2015 23:47 Samuel Just <sjust-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org<mailto:sjust-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>> skrev:
Many people have reported that they need to lower the osd recovery config options to minimize the impact of recovery on client io.  We are talking about changing the defaults as follows:

osd_max_backfills to 1 (from 10)
osd_recovery_max_active to 3 (from 15)
osd_recovery_op_priority to 1 (from 10)
osd_recovery_max_single_start to 1 (from 5)

We'd like a bit of feedback first though.  Is anyone happy with the current configs?  Is anyone using something between these values and the current defaults?  What kind of workload?  I'd guess that lowering osd_max_backfills to 1 is probably a good idea, but I wonder whether lowering osd_recovery_max_active and osd_recovery_max_single_start will cause small objects to recover unacceptably slowly.

Thoughts?
-Sam
_______________________________________________
ceph-users mailing list
ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org<mailto:ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org>
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

________________________________
This E-mail and any of its attachments may contain Time Warner Cable proprietary information, which is privileged, confidential, or subject to copyright belonging to Time Warner Cable. This E-mail is intended solely for the use of the individual or entity to which it is addressed. If you are not the intended recipient of this E-mail, you are hereby notified that any dissemination, distribution, copying, or action taken in relation to the contents of and attachments to this E-mail is strictly prohibited and may be unlawful. If you have received this E-mail in error, please notify the sender immediately and permanently delete the original and any copy of this E-mail and any printout.

[-- Attachment #1.2: Type: text/html, Size: 5240 bytes --]

[-- Attachment #2: Type: text/plain, Size: 178 bytes --]

_______________________________________________
ceph-users mailing list
ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

  parent reply	other threads:[~2015-05-29 22:56 UTC|newest]

Thread overview: 18+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <1939533999.9756941.1432935747903.JavaMail.zimbra@redhat.com>
2015-05-29 21:47 ` Discuss: New default recovery config settings Samuel Just
2015-05-29 22:16   ` Milosz Tanski
     [not found]   ` <1394947829.9758745.1432936033017.JavaMail.zimbra-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org>
2015-05-29 22:16     ` Josef Johansson
     [not found]       ` <CAOnYue-zF1driSi1oxGCQ8Vh1UcG=viT5P8AeHA3R1NCca1o6w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-05-29 22:56         ` Stillwell, Bryan [this message]
2015-05-29 22:33     ` Somnath Roy
2015-05-29 23:17     ` Gregory Farnum
     [not found]       ` <CAC6JEv8LyM1SRsOszDCj6tLxhZ=hvrEkinDARW8BvyQHDCz+LQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-06-01  7:43         ` Jan Schermer
     [not found]           ` <9555849C-7C6E-485E-B60C-BB4996F96E32-SB6/BxVxTjHtwjQa/ONI9g@public.gmane.org>
2015-06-01  8:13             ` Lionel Bouton
2015-06-01  8:57           ` [ceph-users] " huang jun
2015-06-01  9:01             ` Jan Schermer
2015-06-02  1:39       ` Paul Von-Stamwitz
     [not found]         ` <622F4407872BA447A16110F65453358C03DFB21C4B5F-Y+un6SQecilYCZvkXUWeucM6rOWSkUom@public.gmane.org>
2015-06-02  3:43           ` Gregory Farnum
     [not found]             ` <CAC6JEv-i6tOwPBDjP++EW4FxAKZ_X_prEZ8YZOpLq-RTP1ZguQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-06-03 22:44               ` Sage Weil
2015-06-03 22:55                 ` Gregory Farnum
     [not found]                 ` <alpine.DEB.2.00.1506031541200.26591-vIokxiIdD2AQNTJnQDzGJqxOck334EZe@public.gmane.org>
2015-06-04 21:01                   ` Mike Dawson
     [not found]                     ` <5570BCAE.5050509-ffsCFlcjuZBWk0Htik3J/w@public.gmane.org>
2015-06-04 23:24                       ` Scottix
     [not found]                         ` <CANKFHZ_yWaUYoKt3X5RE8BewRqhM_nvfmKwJPRPcFz444q_nWA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-08-23  4:09                           ` Shinobu
2015-05-31 14:29   ` Justin Erenkrantz

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=D18E470A.6229%bryan.stillwell@twcable.com \
    --to=bryan.stillwell-nscyeiwbiy9bdgjk7y7tuq@public.gmane.org \
    --cc=ceph-devel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=ceph-users-idqoXFIVOFJgJs9I8MT0rw@public.gmane.org \
    --cc=josef86-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org \
    --cc=sjust-H+wXaHxf7aLQT0dZR+AlfA@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.