All of lore.kernel.org
 help / color / mirror / Atom feed
From: Josef Bacik <jbacik@fusionio.com>
To: Russell Coker <russell@coker.com.au>
Cc: Btrfs BTRFS <linux-btrfs@vger.kernel.org>
Subject: Re: performance loss with lots of snapshots
Date: Wed, 10 Jul 2013 08:42:14 -0400	[thread overview]
Message-ID: <20130710124214.GA30450@localhost.localdomain> (raw)
In-Reply-To: <201307101254.44583.russell@coker.com.au>

On Wed, Jul 10, 2013 at 12:54:44PM +1000, Russell Coker wrote:
> There are two uses of backups, recovering from user errors (IE deleting the 
> wrong file) and recovering from sysadmin errors or hardware failures (IE disks 
> are dead or wiped).  For the former use I'm mainly using BTRFS snapshots on 
> many systems.
> 
> A problem that I have had on more than a few occasions (most recently on the 
> latest Debian 3.9 kernel) is of severe performance loss.  A few days ago this 
> happened on a workstation running an Intel 120G SSD device for the root 
> filesystem which was being used for basic workstation tasks (kmail, GIMP, 
> OpenOffice, etc).  The /home and / subvols had about 400 snapshots between 
> them (which doesn't seem like a huge number) when the system became unusably 
> slow while running a scrub from a cron job, programs like GIMP became stuck in 
> D state.  The system in question has 8G of RAM and very light load, there 
> shouldn't be any reason for it not giving good performance while the scrub was 
> in progress and it definitely should have performed well when the scrub was 
> cancelled.  But it didn't return to decent performance until I deleted about 
> 300 snapshots.
> 
> This has happened to me often enough that I can probably reproduce it on a VM.  
> What kernel should I use for such tests?
> 
> If I get a virtual machine in a state where it has ongoing performance 
> problems would any of the BTRFS developers like root access to debug it?
> 

There is a memory leak-ish with scrub where it doesn't free up the csums it's
looked up until after its done scrubbing an area which can lead to OOM's or
degraded performance.  Btrfs-next has the fix as well as the pull request that
just went to Linus, so pick which one you want and run again and see if that
helps?  I imagine you are probably seeing two things, first that oom'ish
behavior and then some other performance gotcha with a fair number of snapshots,
but just in case.  Thanks,

Josef

      parent reply	other threads:[~2013-07-10 12:42 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-07-10  2:54 performance loss with lots of snapshots Russell Coker
2013-07-10  4:18 ` Chris Samuel
2013-07-10 12:42 ` Josef Bacik [this message]

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20130710124214.GA30450@localhost.localdomain \
    --to=jbacik@fusionio.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=russell@coker.com.au \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.