From mboxrd@z Thu Jan 1 00:00:00 1970 From: Dor Laor Subject: Re: [PATCH] kvm-userspace: Make PC speaker emulation aware of in-kernel PIT Date: Tue, 28 Apr 2009 10:02:16 +0300 Message-ID: <49F6A9F8.1040308@redhat.com> References: <49F0CE65.4050005@web.de> <20090425001319.GB15144@amt.cnet> <49F30B55.6050207@codemonkey.ws> <49F33A1A.3060201@web.de> <49F36B8F.2090405@codemonkey.ws> <49F689B3.6090109@cisco.com> Reply-To: dlaor@redhat.com Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: Anthony Liguori , Jan Kiszka , Marcelo Tosatti , Avi Kivity , kvm-devel To: "David S. Ahern" Return-path: Received: from mx2.redhat.com ([66.187.237.31]:33943 "EHLO mx2.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755759AbZD1HB4 (ORCPT ); Tue, 28 Apr 2009 03:01:56 -0400 In-Reply-To: <49F689B3.6090109@cisco.com> Sender: kvm-owner@vger.kernel.org List-ID: David S. Ahern wrote: > Anthony Liguori wrote: > >> Jan Kiszka wrote: >> >>> Anthony Liguori wrote: >>> >>> >>>> Marcelo Tosatti wrote: >>>> >>>> >>>>> Jan, >>>>> >>>>> While the patch itself looks fine, IMO it would be better to move all >>>>> of the timer handling to userspace, except the performance critical >>>>> parts, >>>>> since most of it is generic. Either periodic or one-shot timer, with: >>>>> >>>>> >>>> The reason for having the PIT in-kernel is not performance. The PIT is >>>> not performance sensitive. >>>> >>>> >>> I think that depends. Some OSes (in some configurations) use the PIT >>> counter as clock source and/or program it regularly in one-shot mode. An >>> aging use case, but still a valid one. >>> >>> >> I can't find the thread, but this has been discussed at length before. >> The justification has always been for time drift correction. If you >> crunch the numbers, even at a 1024HZ, there just aren't enough exits to >> really make a difference from a performance perspective. >> >> Just to state it more clearly, if you assume an additional 5us to drop >> to userspace (which is absurdly high, but let's stick with it), 1024 >> exits per second comes out to about 5ms which is only 0.5% in terms of >> CPU consumption. >> > > > You are considering timekeeping activities only. > > RHEL4 for example reads the PIT for each gettimeofday call. For > applications that add timestamps to logging the PIT is a *HUGE* overhead > (and the PMTMR for that matter). I have one example where something like > 15% of each second is wasted handling the ioport reads and writes for > get_offset_pit. > > david > > I found the link to the previous discussion about moving the pit to userspace: http://www.mail-archive.com/kvm@vger.kernel.org/msg02357.html In the above discussion Marcelo pointed out that we need the pit in the kernel is order to have the timer and the vcpu thread running on the same cpu. Otherwise IPIs will be sent from the io-thread to the vcpu thread in order of injection the irq. I guess we can also do it also using specific timer thread in userspace, but it is getting more complex. btw: I found a type in the patch in the line below: "fprintf(stderr, "Create kernel PIC irqchip failed\n");" s/PIC/PIT/ > >> The APIC is quite a bit more understandable because especially with SMP, >> you can generate a very high number of interrupts per second and taking >> a drop to userspace for every EOI can be start to matter with exit rates >> in the hundreds of thousands. >> >> >>>> It's because it was easier to do interrupt catch-up by pushing the PIT >>>> into the kernel which IMHO was the wrong path to go down. >>>> >>>> >>> Pushing the emulation of port 0x61 into the kernel was a mistake we now >>> have to deal with. I'm not that sure about the PIT itself. >>> >>> >> I agree re: port 0x61. I'm just saying that there is no point in moving >> just the non "performance critical" components to userspace as Marcelo >> suggests because the whole thing is non "performance critical". >> >> Regards, >> >> Anthony Liguori >> >> -- >> To unsubscribe from this list: send the line "unsubscribe kvm" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html >> >> > -- > To unsubscribe from this list: send the line "unsubscribe kvm" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html >