From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from relay.sgi.com (relay3.corp.sgi.com [198.149.34.15]) by oss.sgi.com (Postfix) with ESMTP id D6CD17F47 for ; Sun, 28 Dec 2014 05:51:35 -0600 (CST) Received: from cuda.sgi.com (cuda1.sgi.com [192.48.157.11]) by relay3.corp.sgi.com (Postfix) with ESMTP id 82FBAAC04C for ; Sun, 28 Dec 2014 03:51:32 -0800 (PST) Received: from ipmail06.adl6.internode.on.net (ipmail06.adl6.internode.on.net [150.101.137.145]) by cuda.sgi.com with ESMTP id LHJmozd1X0vWozs2 for ; Sun, 28 Dec 2014 03:51:30 -0800 (PST) Date: Sun, 28 Dec 2014 22:51:27 +1100 From: Dave Chinner Subject: Re: XFS crashing system with general protection fault Message-ID: <20141228115127.GN24183@dastard> References: <20141224111403.54d7226b@neptune.home> MIME-Version: 1.0 Content-Disposition: inline In-Reply-To: <20141224111403.54d7226b@neptune.home> List-Id: XFS Filesystem from SGI List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Errors-To: xfs-bounces@oss.sgi.com Sender: xfs-bounces@oss.sgi.com To: Bruno =?iso-8859-1?Q?Pr=E9mont?= Cc: xfs@oss.sgi.com On Wed, Dec 24, 2014 at 11:14:03AM +0100, Bruno Pr=E9mont wrote: > Hi, > = > On a server I've got the following traces, the first on Monday, the second > one today. On Monday kernel was 3.14.17 and 3.14.27 for today (both captu= red > via netconsole). > = > Is that fixed in a newer kernel? > = > I've xfs_repaired one of the two XFS partitions on the server though it > found nothing to complain about. The other partition, containing /, has > not been explicitly checked yet. > = > If there is some information I should gather before xfs_repairing, please > tell as soon as possible! > = > = > Thanks, > Bruno > = > [6149136.014757] general protection fault: 0000 [#1] SMP = > [6149136.022825] Modules linked in: netconsole configfs > [6149136.028996] CPU: 4 PID: 151 Comm: kworker/4:1H Not tainted 3.14.18-x= 86_64 #1 > [6149136.040750] Hardware name: HP ProLiant DL360 G6, BIOS P64 07/02/2013 > [6149136.048936] Workqueue: xfslogd xfs_buf_iodone_work > [6149136.056836] task: ffff880212c67500 ti: ffff8800def3c000 task.ti: fff= f8800def3c000 > [6149136.067023] RIP: 0010:[] [] xfs= _trans_ail_delete_bulk+0x87/0x1a0 > [6149136.080940] RSP: 0018:ffff8800def3dce8 EFLAGS: 00010202 > [6149136.088889] RAX: dead000000100100 RBX: ffff88000211bd10 RCX: ffff880= 10e23fbb1 > [6149136.098962] RDX: 6b6b6b6b6b6b6b6b RSI: 6b6b6b6b6b6b6b6b RDI: ffff880= 00211bd10 > [6149136.110787] RBP: ffff8800def3dd38 R08: 6b6b6b6b6b6b6b6b R09: 2900000= 000000000 You have memory poisoning turned on? #define POISON_FREE 0x6b /* for use-after-free poisoning */ Did this occur at unmount? Can you reproduce it on a 3.18 kernel? Cheers, Dave. -- = Dave Chinner david@fromorbit.com _______________________________________________ xfs mailing list xfs@oss.sgi.com http://oss.sgi.com/mailman/listinfo/xfs