From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752620AbcBKIXI (ORCPT ); Thu, 11 Feb 2016 03:23:08 -0500 Received: from mail-lf0-f50.google.com ([209.85.215.50]:35114 "EHLO mail-lf0-f50.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751915AbcBKIXG (ORCPT ); Thu, 11 Feb 2016 03:23:06 -0500 Date: Thu, 11 Feb 2016 17:21:12 +0900 From: Sergey Senozhatsky To: Steven Rostedt Cc: Petr Mladek , Denys Vlasenko , linux-kernel@vger.kernel.org, srostedt@redhat.com, Tejun Heo , Peter Hurley , Jan Kara , Sergey Senozhatsky , Andrew Morton , Kyle McMartin , KY Srinivasan , Dave Jones , Calvin Owens Subject: Re: [PATCH] printk: avoid livelock if another CPU printks continuously Message-ID: <20160211082112.GC895@swordfish> References: <1454963703-20433-1-git-send-email-dvlasenk@redhat.com> <20160210144416.GE12548@pathway.suse.cz> <20160210161016.GF12548@pathway.suse.cz> <20160210112530.04c5512f@gandalf.local.home> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20160210112530.04c5512f@gandalf.local.home> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello, Thanks for Cc-ing, and sorry for long reply, I'm traveling now. On (02/10/16 11:25), Steven Rostedt wrote: > On Wed, 10 Feb 2016 17:10:16 +0100 > Petr Mladek wrote: > > > > Note, it's not that performance critical, and the loop only happens if > > > someone else is adding to the console, which hopefully, should be rare. > > > > I probably used too strong words. It is possible that the performance > > impact will not be critical. But the behavior is non-deterministic. > > I think that the approach taken by Jack is more promising. > > I mean the offloading of the console stuff to a workqueue. > > My worry about that is that it never comes out. The point about printk, > is that it should pretty much be guaranteed to print. If the system is > dying, and we push it off to a work queue, and that workqueue never > runs, then we lose critical data. correct, IIRC Jan agreed to switch to 'direct' (current behaviour) printk when one of the CPUs calls panic() (we still can use that approach even with workqueue based printk) http://marc.info/?l=linux-kernel&m=145200464309562 the other thing with workqueues based approach is that all of them can be 'blocked' in some OOM cases, so sort of fallback mechanism is also needed here http://marc.info/?l=linux-kernel&m=145251885502488 -ss