From: Adam Litke <agl@us.ibm.com>
To: Dave Hansen <haveblue@us.ibm.com>
Cc: Andrew Morton <akpm@linux-foundation.org>,
linux-mm@kvack.org, Ken Chen <kenchen@google.com>,
Andy Whitcroft <apw@shadowen.org>
Subject: Re: [PATCH 3/3] [PATCH] hugetlb: Enforce quotas during reservation for shared mappings
Date: Wed, 24 Oct 2007 14:52:58 -0500 [thread overview]
Message-ID: <1193255578.18417.63.camel@localhost.localdomain> (raw)
In-Reply-To: <1193252821.4039.33.camel@localhost>
On Wed, 2007-10-24 at 12:07 -0700, Dave Hansen wrote:
> On Wed, 2007-10-24 at 06:24 -0700, Adam Litke wrote:
> > diff --git a/mm/hugetlb.c b/mm/hugetlb.c
> > index eaade8c..5fc075e 100644
> > --- a/mm/hugetlb.c
> > +++ b/mm/hugetlb.c
> > @@ -769,6 +769,7 @@ static int hugetlb_no_page(struct mm_struct *mm, struct vm_area_struct *vma,
> > struct page *page;
> > struct address_space *mapping;
> > pte_t new_pte;
> > + int shared_page = vma->vm_flags & VM_SHARED;
> >
> > mapping = vma->vm_file->f_mapping;
> > idx = ((address - vma->vm_start) >> HPAGE_SHIFT)
> > @@ -784,23 +785,24 @@ retry:
> > size = i_size_read(mapping->host) >> HPAGE_SHIFT;
> > if (idx >= size)
> > goto out;
> > - if (hugetlb_get_quota(mapping, 1))
> > + /* Shared pages are quota-accounted at reservation/mmap time */
> > + if (!shared_page && hugetlb_get_quota(mapping, 1))
> > goto out;
> > page = alloc_huge_page(vma, address);
>
> Since alloc_huge_page() gets the VMA it could, in theory, be doing the
> accounting. The other user, hugetlb_cow(), seems to have a similar code
> path. But, it doesn't have to worry about shared_page, right? We can
> only have COWs on MAP_PRIVATE.
>
> I'm just trying to find ways to future-proof the quotas since they
> already got screwed up once. The fewer call sites we have for them, the
> fewer places they can get screwed up. :)
Yep. Originally I wanted to put the hugetlb_get_quota() call inside
alloc_huge_page() but the devil is in the details. Failure to get quota
needs to result in a SIGBUS whereas a standard allocation failure is
OOM. Because of this, we'd still need special handling of the
alloc_huge_page() return value. While that can be done easily enough, I
didn't think it was worth it.
> > if (!page) {
> > - hugetlb_put_quota(mapping, 1);
> > + if (!shared_page)
> > + hugetlb_put_quota(mapping, 1);
> > ret = VM_FAULT_OOM;
> > goto out;
> > }
> > clear_huge_page(page, address);
> >
> > - if (vma->vm_flags & VM_SHARED) {
> > + if (shared_page) {
> > int err;
> >
> > err = add_to_page_cache(page, mapping, idx, GFP_KERNEL);
> > if (err) {
> > put_page(page);
> > - hugetlb_put_quota(mapping, 1);
> > if (err == -EEXIST)
> > goto retry;
> > goto out;
>
> To where was this quota put moved? Is it because we're in a fault path
> here, and shared pages don't modify quotas during faults, only at
> mmap/truncate() time now?
That's right.
> > backout:
> > spin_unlock(&mm->page_table_lock);
> > - hugetlb_put_quota(mapping, 1);
> > + if (!shared_page)
> > + hugetlb_put_quota(mapping, 1);
> > unlock_page(page);
> > put_page(page);
> > goto out;
> > @@ -1144,6 +1147,8 @@ int hugetlb_reserve_pages(struct inode *inode, long from, long to)
> > if (chg < 0)
> > return chg;
> >
> > + if (hugetlb_get_quota(inode->i_mapping, chg))
> > + return -ENOSPC;
> > ret = hugetlb_acct_memory(chg);
> > if (ret < 0)
> > return ret;
> > @@ -1154,5 +1159,6 @@ int hugetlb_reserve_pages(struct inode *inode, long from, long to)
> > void hugetlb_unreserve_pages(struct inode *inode, long offset, long freed)
> > {
> > long chg = region_truncate(&inode->i_mapping->private_list, offset);
> > - hugetlb_acct_memory(freed - chg);
> > + hugetlb_put_quota(inode->i_mapping, (chg - freed));
> > + hugetlb_acct_memory(-(chg - freed));
> > }
>
> Would it be any easier to just do all of the quota operations in
> _either_ truncate_hugepages() or in here? Could you skip the quota
> operation in truncate_hugepages()'s while() loop, and just put the quota
> for the entire region in hugetlb_unreserve_pages()?
Yep, we certainly could do that. I'll change it to that for the next
version so we can see how it looks.
> void hugetlb_unreserve_pages(struct inode *inode, long offset, long already_freed)
> {
> long total_truncated = region_truncate(&inode->i_mapping->private_list, offset);
> long newly_freed = total_truncated - already_freed;
> hugetlb_put_quota(inode->i_mapping, newly_freed);
> hugetlb_acct_memory(-newly_freed);
> }
>
> I do see several hugetlb_put_quota()/hugetlb_acct_memory() pairs next to
> each other. Do they deserve to be lumped together in one helper?
I don't really think putting them together in one helper would do
anything to improve readability.
--
Adam Litke - (agl at us.ibm.com)
IBM Linux Technology Center
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2007-10-24 19:52 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-10-24 13:23 [PATCH 0/3] hugetlb: Fix up filesystem quota accounting Adam Litke
2007-10-24 13:23 ` [PATCH 1/3] [FIX] hugetlb: Fix broken fs quota management Adam Litke
2007-10-24 18:43 ` Dave Hansen
2007-10-24 19:03 ` Adam Litke
2007-10-24 19:18 ` Dave Hansen
2007-10-24 19:21 ` Ken Chen
2007-10-24 20:02 ` Adam Litke
2007-10-24 22:12 ` Dave Hansen
2007-10-25 5:20 ` Ken Chen
2007-10-25 14:54 ` Adam Litke
2007-10-24 13:23 ` [PATCH 2/3] hugetlb: Allow bulk updating in hugetlb_*_quota() Adam Litke
2007-10-24 13:24 ` [PATCH 3/3] [PATCH] hugetlb: Enforce quotas during reservation for shared mappings Adam Litke
2007-10-24 19:07 ` Dave Hansen
2007-10-24 19:52 ` Adam Litke [this message]
2007-10-24 20:00 ` Dave Hansen
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1193255578.18417.63.camel@localhost.localdomain \
--to=agl@us.ibm.com \
--cc=akpm@linux-foundation.org \
--cc=apw@shadowen.org \
--cc=haveblue@us.ibm.com \
--cc=kenchen@google.com \
--cc=linux-mm@kvack.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.