From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-8.8 required=3.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, MENTIONS_GIT_HOSTING,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id AAEBDC433E3 for ; Thu, 16 Jul 2020 10:45:14 +0000 (UTC) Received: from kanga.kvack.org (kanga.kvack.org [205.233.56.17]) by mail.kernel.org (Postfix) with ESMTP id 57FFA20760 for ; Thu, 16 Jul 2020 10:45:14 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=fail reason="signature verification failed" (1024-bit key) header.d=redhat.com header.i=@redhat.com header.b="TUi+qcnB" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 57FFA20760 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=owner-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix) id C4E4E8D0007; Thu, 16 Jul 2020 06:45:13 -0400 (EDT) Received: by kanga.kvack.org (Postfix, from userid 40) id C00738D0001; Thu, 16 Jul 2020 06:45:13 -0400 (EDT) X-Delivered-To: int-list-linux-mm@kvack.org Received: by kanga.kvack.org (Postfix, from userid 63042) id AEC848D0007; Thu, 16 Jul 2020 06:45:13 -0400 (EDT) X-Delivered-To: linux-mm@kvack.org Received: from forelay.hostedemail.com (smtprelay0053.hostedemail.com [216.40.44.53]) by kanga.kvack.org (Postfix) with ESMTP id 995238D0001 for ; Thu, 16 Jul 2020 06:45:13 -0400 (EDT) Received: from smtpin15.hostedemail.com (10.5.19.251.rfc1918.com [10.5.19.251]) by forelay03.hostedemail.com (Postfix) with ESMTP id 1053B824934B for ; Thu, 16 Jul 2020 10:45:13 +0000 (UTC) X-FDA: 77043606906.15.brass06_3003e5426f01 Received: from filter.hostedemail.com (10.5.16.251.rfc1918.com [10.5.16.251]) by smtpin15.hostedemail.com (Postfix) with ESMTP id AD3D31814B0C8 for ; Thu, 16 Jul 2020 10:45:12 +0000 (UTC) X-HE-Tag: brass06_3003e5426f01 X-Filterd-Recvd-Size: 10424 Received: from us-smtp-1.mimecast.com (us-smtp-delivery-1.mimecast.com [207.211.31.120]) by imf29.hostedemail.com (Postfix) with ESMTP for ; Thu, 16 Jul 2020 10:45:11 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1594896311; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=OxYWZgwLWn7Rnmoh675ZCFOcGubhbtM3QKjvJxepby8=; b=TUi+qcnBaDiD11KS7rbb3y87vAhw8El9JwyVJIMAkje2t+XHEMJvcXbXloHRn5x4nwy+Jq fLjLUPdlCfHVUb6RGzIPwSJ8kQf8lvN+bZTmpAyYUgRwMoTeSylJt81sWJtoMBUR59JOak E0tN9TP3luOOQE7RCHz7jvuNKxHsCGU= Received: from mail-wr1-f69.google.com (mail-wr1-f69.google.com [209.85.221.69]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-191-CxlTaSHZN8O68Y76w6kgCg-1; Thu, 16 Jul 2020 06:45:07 -0400 X-MC-Unique: CxlTaSHZN8O68Y76w6kgCg-1 Received: by mail-wr1-f69.google.com with SMTP id j5so5364801wro.6 for ; Thu, 16 Jul 2020 03:45:07 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:content-transfer-encoding :in-reply-to; bh=OxYWZgwLWn7Rnmoh675ZCFOcGubhbtM3QKjvJxepby8=; b=tslYqxkTGi5dfKyUvao0RJcIE1RBoIjMZfkeaPxGkEKNr2kJbgY2w0nzZ4+bAyPJTS AixSwbMroUUh7bA4XSooZ6GHOXFMcAzGBW8nwOVCVV8PsSnXHDboxJtfNS94e3OrhdsF OIwZJd3taaXhw5KdtN2xWm+ZU9YuP2/tr15AZGgoGuWrRlwqXU12jRJk2F1uCrxMwQHD mOnwBSCwKPmlHBlJHDFXi4pRbNCaLfKp/GxjQhKbHh+dNcWo+Mzm7hXs6PZN7V4h34WE Vw6c7IW8tpQEf6OGVNvvKx1SWyhsiKpEjOZnUetw0/5yvbr2l+q11+srm12rZaSV4PaT icMg== X-Gm-Message-State: AOAM531aYcb75zQkOie8wPqKdfgsaxJhhtvwetmTDs+rc+BkX8cmdDmd vnBEB5Mh4fdxjMLqKh0xjZ/GKb5uhA/sNu8oQEKpZUIj5yq5glXPejQUDhyY+sTIt0AocPCABW8 ySXlZQHi1UCk= X-Received: by 2002:a5d:55cb:: with SMTP id i11mr4277117wrw.28.1594896306112; Thu, 16 Jul 2020 03:45:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxYbnuKjraPbNIkft/3GTUfSsNTNTx6l/aUgESxOucWFlYdAHLRLOjvsyeHY5kY8Vrt3fD+lA== X-Received: by 2002:a5d:55cb:: with SMTP id i11mr4277090wrw.28.1594896305809; Thu, 16 Jul 2020 03:45:05 -0700 (PDT) Received: from redhat.com (bzq-79-182-31-92.red.bezeqint.net. [79.182.31.92]) by smtp.gmail.com with ESMTPSA id w128sm8996937wmb.19.2020.07.16.03.45.02 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 16 Jul 2020 03:45:04 -0700 (PDT) Date: Thu, 16 Jul 2020 06:45:00 -0400 From: "Michael S. Tsirkin" To: teawater Cc: Hui Zhu , david@redhat.com, jasowang@redhat.com, akpm@linux-foundation.org, virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, qemu-devel@nongnu.org, virtio-dev@lists.oasis-open.org, Andrea Arcangeli Subject: Re: [virtio-dev] [RFC for Linux v4 0/2] virtio_balloon: Add VIRTIO_BALLOON_F_CONT_PAGES to report continuous pages Message-ID: <20200716064340-mutt-send-email-mst@kernel.org> References: <1594867315-8626-1-git-send-email-teawater@gmail.com> <20200716021929-mutt-send-email-mst@kernel.org> <744230FA-78D8-4568-8188-683087065E84@linux.alibaba.com> MIME-Version: 1.0 In-Reply-To: <744230FA-78D8-4568-8188-683087065E84@linux.alibaba.com> X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Type: text/plain; charset=utf-8 Content-Disposition: inline X-Rspamd-Queue-Id: AD3D31814B0C8 X-Spamd-Result: default: False [0.00 / 100.00] X-Rspamd-Server: rspam02 Content-Transfer-Encoding: quoted-printable X-Bogosity: Ham, tests=bogofilter, spamicity=0.000000, version=1.2.4 Sender: owner-linux-mm@kvack.org Precedence: bulk X-Loop: owner-majordomo@kvack.org List-ID: On Thu, Jul 16, 2020 at 03:01:18PM +0800, teawater wrote: >=20 >=20 > > 2020=E5=B9=B47=E6=9C=8816=E6=97=A5 14:38=EF=BC=8CMichael S. Tsirkin <= mst@redhat.com> =E5=86=99=E9=81=93=EF=BC=9A > >=20 > > On Thu, Jul 16, 2020 at 10:41:50AM +0800, Hui Zhu wrote: > >> The first, second and third version are in [1], [2] and [3]. > >> Code of current version for Linux and qemu is available in [4] and [= 5]. > >> Update of this version: > >> 1. Report continuous pages will increase the speed. So added deflat= e > >> continuous pages. > >> 2. According to the comments from David in [6], added 2 new vqs infl= ate_cont_vq > >> and deflate_cont_vq to report continuous pages with format 32 bits= pfn and 32 > >> bits size. > >> Following is the introduction of the function. > >> These patches add VIRTIO_BALLOON_F_CONT_PAGES to virtio_balloon. Wit= h this > >> flag, balloon tries to use continuous pages to inflate and deflate. > >> Opening this flag can bring two benefits: > >> 1. Report continuous pages will increase memory report size of each = time > >> call tell_host. Then it will increase the speed of balloon inflat= e and > >> deflate. > >> 2. Host THPs will be splitted when qemu release the page of balloon = inflate. > >> Inflate balloon with continuous pages will let QEMU release the pa= ges > >> of same THPs. That will help decrease the splitted THPs number in > >> the host. > >> Following is an example in a VM with 1G memory 1CPU. This test se= tups an > >> environment that has a lot of fragmentation pages. Then inflate b= alloon will > >> split the THPs. >=20 >=20 > >> // This is the THP number before VM execution in the host. > >> // None use THP. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 0 kB > These lines are from host. >=20 > >> // After VM start, use usemem > >> // (https://git.kernel.org/pub/scm/linux/kernel/git/wfg/vm-scalabili= ty.git) > >> // punch-holes function generates 400m fragmentation pages in the gu= est > >> // kernel. > >> usemem --punch-holes -s -1 800m & > These lines are from guest. They setups the environment that has a lot= of fragmentation pages. >=20 > >> // This is the THP number after this command in the host. > >> // Some THP is used by VM because usemem will access 800M memory > >> // in the guest. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 911360 kB > These lines are from host. >=20 > >> // Connect to the QEMU monitor, setup balloon, and set it size to 60= 0M. > >> (qemu) device_add virtio-balloon-pci,id=3Dballoon1 > >> (qemu) info balloon > >> balloon: actual=3D1024 > >> (qemu) balloon 600 > >> (qemu) info balloon > >> balloon: actual=3D600 > These lines are from host. >=20 > >> // This is the THP number after inflate the balloon in the host. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 88064 kB > These lines are from host. >=20 > >> // Set the size back to 1024M in the QEMU monitor. > >> (qemu) balloon 1024 > >> (qemu) info balloon > >> balloon: actual=3D1024 > These lines are from host. >=20 > >> // Use usemem to increase the memory usage of QEMU. > >> killall usemem > >> usemem 800m > These lines are from guest. >=20 > >> // This is the THP number after this operation. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 65536 kB > These lines are from host. >=20 >=20 >=20 > >>=20 > >> Following example change to use continuous pages balloon. The numbe= r of > >> splitted THPs is decreased. > >> // This is the THP number before VM execution in the host. > >> // None use THP. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 0 kB > These lines are from host. >=20 > >> // After VM start, use usemem punch-holes function generates 400M > >> // fragmentation pages in the guest kernel. > >> usemem --punch-holes -s -1 800m & > These lines are from guest. They setups the environment that has a lot= of fragmentation pages. >=20 > >> // This is the THP number after this command in the host. > >> // Some THP is used by VM because usemem will access 800M memory > >> // in the guest. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 911360 kB > These lines are from host. >=20 > >> // Connect to the QEMU monitor, setup balloon, and set it size to 60= 0M. > >> (qemu) device_add virtio-balloon-pci,id=3Dballoon1,cont-pages=3Don > >> (qemu) info balloon > >> balloon: actual=3D1024 > >> (qemu) balloon 600 > >> (qemu) info balloon > >> balloon: actual=3D600 > These lines are from host. >=20 > >> // This is the THP number after inflate the balloon in the host. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 616448 kB > >> // Set the size back to 1024M in the QEMU monitor. > >> (qemu) balloon 1024 > >> (qemu) info balloon > >> balloon: actual=3D1024 > These lines are from host. >=20 > >> // Use usemem to increase the memory usage of QEMU. > >> killall usemem > >> usemem 800m > These lines are from guest. >=20 > >> // This is the THP number after this operation. > >> cat /proc/meminfo | grep AnonHugePages: > >> AnonHugePages: 907264 kB > These lines are from host. >=20 > >=20 > > I'm a bit confused about which of the above run within guest, > > and which run within host. Could you explain pls? > >=20 > >=20 >=20 > I added some introduction to show where these lines is get from. >=20 > Best, > Hui OK so we see host has more free THPs. But guest has presumably less now -= so the total page table depth is the same. Did we gain anything? >=20 > >=20 > >> [1] https://lkml.org/lkml/2020/3/12/144 > >> [2] https://lore.kernel.org/linux-mm/1584893097-12317-1-git-send-ema= il-teawater@gmail.com/ > >> [3] https://lkml.org/lkml/2020/5/12/324 > >> [4] https://github.com/teawater/linux/tree/balloon_conts > >> [5] https://github.com/teawater/qemu/tree/balloon_conts > >> [6] https://lkml.org/lkml/2020/5/13/1211 > >>=20 > >> Hui Zhu (2): > >> virtio_balloon: Add VIRTIO_BALLOON_F_CONT_PAGES and inflate_cont_vq > >> virtio_balloon: Add deflate_cont_vq to deflate continuous pages > >>=20 > >> drivers/virtio/virtio_balloon.c | 180 +++++++++++++++++++++++++= ++++++----- > >> include/linux/balloon_compaction.h | 12 ++ > >> include/uapi/linux/virtio_balloon.h | 1 > >> mm/balloon_compaction.c | 117 +++++++++++++++++++++-- > >> 4 files changed, 280 insertions(+), 30 deletions(-) > >=20 > >=20 > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: virtio-dev-unsubscribe@lists.oasis-open.org > > For additional commands, e-mail: virtio-dev-help@lists.oasis-open.org