linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
@ 2013-03-27 13:19 Zheng Liu
  2013-03-27 13:41 ` Theodore Ts'o
  0 siblings, 1 reply; 14+ messages in thread
From: Zheng Liu @ 2013-03-27 13:19 UTC (permalink / raw)
  To: linux-ext4; +Cc: Zheng Liu

From: Zheng Liu <wenqing.lz@taobao.com>

After applied this commit (8e4061cb), we will get a warning from
ext4_truncate when i_mutex isn't taken.  Here the assumption is that
i_mutex should be taken when we do a truncation.  In ext4_symlink we
could need to call ext4_truncate to trim some blocks beyond i_size, but
the i_mutex isn't taken.

Signed-off-by: Zheng Liu <wenqing.lz@taobao.com>
---
 fs/ext4/namei.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/fs/ext4/namei.c b/fs/ext4/namei.c
index 3825d6a..d75f91a 100644
--- a/fs/ext4/namei.c
+++ b/fs/ext4/namei.c
@@ -2856,7 +2856,9 @@ retry:
 		ext4_journal_stop(handle);
 		if (err)
 			goto err_drop_inode;
+		mutex_lock(&inode->i_mutex);
 		err = __page_symlink(inode, symname, l, 1);
+		mutex_unlock(&inode->i_mutex);
 		if (err)
 			goto err_drop_inode;
 		/*
-- 
1.7.12.rc2.18.g61b472e


^ permalink raw reply related	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 13:19 [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate Zheng Liu
@ 2013-03-27 13:41 ` Theodore Ts'o
  2013-03-27 14:02   ` Zheng Liu
  0 siblings, 1 reply; 14+ messages in thread
From: Theodore Ts'o @ 2013-03-27 13:41 UTC (permalink / raw)
  To: Zheng Liu; +Cc: linux-ext4, Zheng Liu

On Wed, Mar 27, 2013 at 09:19:07PM +0800, Zheng Liu wrote:
> From: Zheng Liu <wenqing.lz@taobao.com>
> 
> After applied this commit (8e4061cb), we will get a warning from
> ext4_truncate when i_mutex isn't taken.  Here the assumption is that
> i_mutex should be taken when we do a truncation.  In ext4_symlink we
> could need to call ext4_truncate to trim some blocks beyond i_size, but
> the i_mutex isn't taken.

Hmm, and this is why I added the warning.  Even after looking your
patch, I'm having trouble finding the codepath that results in
ext4_truncate() getting called from __page_symlink().  Can you send
the stack trace from the WARN_ON, just so I can see what I missed?

Thanks,

						- Ted

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 14:02   ` Zheng Liu
@ 2013-03-27 13:51     ` Theodore Ts'o
  2013-03-27 15:07       ` Zheng Liu
  2013-03-27 14:04     ` [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate Zheng Liu
  1 sibling, 1 reply; 14+ messages in thread
From: Theodore Ts'o @ 2013-03-27 13:51 UTC (permalink / raw)
  To: linux-ext4, Zheng Liu

Ah, now I see.  Thanks for sending the stack trace.  On the failure
path, we're calling the inline function ext4_truncate_filaed_write()
and this is calling ext4_truncate().

But I'm now wondering if we need to take the i_data_sem mutex in
ext4_truncate_failed_write().

Otherwise, couldn't we end up with problems where a failed write calls
ext4_truncate() without i_data_sem(), and that races with something
else --- say, a punch or truncate call to that same inode?

     	      	      	 	       - Ted

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 13:41 ` Theodore Ts'o
@ 2013-03-27 14:02   ` Zheng Liu
  2013-03-27 13:51     ` Theodore Ts'o
  2013-03-27 14:04     ` [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate Zheng Liu
  0 siblings, 2 replies; 14+ messages in thread
From: Zheng Liu @ 2013-03-27 14:02 UTC (permalink / raw)
  To: Theodore Ts'o; +Cc: linux-ext4, Zheng Liu

On Wed, Mar 27, 2013 at 09:41:10AM -0400, Theodore Ts'o wrote:
> On Wed, Mar 27, 2013 at 09:19:07PM +0800, Zheng Liu wrote:
> > From: Zheng Liu <wenqing.lz@taobao.com>
> > 
> > After applied this commit (8e4061cb), we will get a warning from
> > ext4_truncate when i_mutex isn't taken.  Here the assumption is that
> > i_mutex should be taken when we do a truncation.  In ext4_symlink we
> > could need to call ext4_truncate to trim some blocks beyond i_size, but
> > the i_mutex isn't taken.
> 
> Hmm, and this is why I added the warning.  Even after looking your
> patch, I'm having trouble finding the codepath that results in
> ext4_truncate() getting called from __page_symlink().  Can you send
> the stack trace from the WARN_ON, just so I can see what I missed?

Here it is.

kernel: ------------[ cut here ]------------
kernel: WARNING: at fs/ext4/inode.c:3803 ext4_truncate+0x36/0x2d7 [ext4]()
kernel: Hardware name: OptiPlex 780
kernel: Modules linked in: ext4(O) jbd2 crc16 cpufreq_ondemand ipv6
dm_mirror dm_region_hash dm_log dm_mod parport_pc parport dcdbas
acpi_cpufreq mperf serio_raw button pcspkr i2c_i801 sg ehci_pci
ehci_hcd e1000e ptp pps_core ext3 jbd sd_mod ahci libahci libata
scsi_mod uhci_hcd radeon ttm drm_kms_helper drm hwmon i2c_algo_bit i2
c_core [last unloaded: crc16]
kernel: Pid: 4152, comm: fsstress Tainted: G W  O 3.9.0-rc4+ #6
kernel: Call Trace:
kernel: [<ffffffff82031e14>] warn_slowpath_common+0x85/0x9f
kernel: [<ffffffff82031e48>] warn_slowpath_null+0x1a/0x1c
kernel: [<ffffffffa044b789>] ext4_truncate+0x36/0x2d7 [ext4]
kernel: [<ffffffffa044c9a5>] ext4_write_begin+0x35f/0x3b9 [ext4]
kernel: [<ffffffff820af514>] pagecache_write_begin+0x1c/0x1e
kernel: [<ffffffff820f5f80>] __page_symlink+0x6d/0x10f
kernel: [<ffffffffa0450806>] ?  ext4_orphan_add+0x1ea/0x219 [ext4]
kernel: [<ffffffffa0451a7f>] ext4_symlink+0x1ca/0x33e [ext4]
kernel: [<ffffffff820f8c41>] vfs_symlink+0x7c/0xd6
kernel: [<ffffffff820fb634>] sys_symlinkat+0x68/0xb9
kernel: [<ffffffff820fb69b>] sys_symlink+0x16/0x18
kernel: [<ffffffff8238fac2>] system_call_fastpath+0x16/0x1b
kernel: ---[ end trace 05d179cc296c4f3a ]---

Regards,
                                                - Zheng

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 14:02   ` Zheng Liu
  2013-03-27 13:51     ` Theodore Ts'o
@ 2013-03-27 14:04     ` Zheng Liu
  1 sibling, 0 replies; 14+ messages in thread
From: Zheng Liu @ 2013-03-27 14:04 UTC (permalink / raw)
  To: Theodore Ts'o, linux-ext4, Zheng Liu

On Wed, Mar 27, 2013 at 10:02:50PM +0800, Zheng Liu wrote:
> On Wed, Mar 27, 2013 at 09:41:10AM -0400, Theodore Ts'o wrote:
> > On Wed, Mar 27, 2013 at 09:19:07PM +0800, Zheng Liu wrote:
> > > From: Zheng Liu <wenqing.lz@taobao.com>
> > > 
> > > After applied this commit (8e4061cb), we will get a warning from
> > > ext4_truncate when i_mutex isn't taken.  Here the assumption is that
> > > i_mutex should be taken when we do a truncation.  In ext4_symlink we
> > > could need to call ext4_truncate to trim some blocks beyond i_size, but
> > > the i_mutex isn't taken.
> > 
> > Hmm, and this is why I added the warning.  Even after looking your
> > patch, I'm having trouble finding the codepath that results in
> > ext4_truncate() getting called from __page_symlink().  Can you send
> > the stack trace from the WARN_ON, just so I can see what I missed?
> 
> Here it is.
> 
> kernel: ------------[ cut here ]------------
> kernel: WARNING: at fs/ext4/inode.c:3803 ext4_truncate+0x36/0x2d7 [ext4]()
> kernel: Hardware name: OptiPlex 780
> kernel: Modules linked in: ext4(O) jbd2 crc16 cpufreq_ondemand ipv6
> dm_mirror dm_region_hash dm_log dm_mod parport_pc parport dcdbas
> acpi_cpufreq mperf serio_raw button pcspkr i2c_i801 sg ehci_pci
> ehci_hcd e1000e ptp pps_core ext3 jbd sd_mod ahci libahci libata
> scsi_mod uhci_hcd radeon ttm drm_kms_helper drm hwmon i2c_algo_bit i2
> c_core [last unloaded: crc16]
> kernel: Pid: 4152, comm: fsstress Tainted: G W  O 3.9.0-rc4+ #6
> kernel: Call Trace:
> kernel: [<ffffffff82031e14>] warn_slowpath_common+0x85/0x9f
> kernel: [<ffffffff82031e48>] warn_slowpath_null+0x1a/0x1c
> kernel: [<ffffffffa044b789>] ext4_truncate+0x36/0x2d7 [ext4]
> kernel: [<ffffffffa044c9a5>] ext4_write_begin+0x35f/0x3b9 [ext4]
> kernel: [<ffffffff820af514>] pagecache_write_begin+0x1c/0x1e
> kernel: [<ffffffff820f5f80>] __page_symlink+0x6d/0x10f
> kernel: [<ffffffffa0450806>] ?  ext4_orphan_add+0x1ea/0x219 [ext4]
> kernel: [<ffffffffa0451a7f>] ext4_symlink+0x1ca/0x33e [ext4]
> kernel: [<ffffffff820f8c41>] vfs_symlink+0x7c/0xd6
> kernel: [<ffffffff820fb634>] sys_symlinkat+0x68/0xb9
> kernel: [<ffffffff820fb69b>] sys_symlink+0x16/0x18
> kernel: [<ffffffff8238fac2>] system_call_fastpath+0x16/0x1b
> kernel: ---[ end trace 05d179cc296c4f3a ]---

Sorry, maybe you want to reproduce this warning.  xfstests #083 can
trigger it.

Regards,
                                                - Zheng

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 13:51     ` Theodore Ts'o
@ 2013-03-27 15:07       ` Zheng Liu
  2013-03-27 15:19         ` Zheng Liu
  0 siblings, 1 reply; 14+ messages in thread
From: Zheng Liu @ 2013-03-27 15:07 UTC (permalink / raw)
  To: Theodore Ts'o; +Cc: linux-ext4, Zheng Liu

On Wed, Mar 27, 2013 at 09:51:55AM -0400, Theodore Ts'o wrote:
> Ah, now I see.  Thanks for sending the stack trace.  On the failure
> path, we're calling the inline function ext4_truncate_filaed_write()
> and this is calling ext4_truncate().
> 
> But I'm now wondering if we need to take the i_data_sem mutex in
> ext4_truncate_failed_write().
> 
> Otherwise, couldn't we end up with problems where a failed write calls
> ext4_truncate() without i_data_sem(), and that races with something
> else --- say, a punch or truncate call to that same inode?

I don't think we need to take i_mutex lock honestly.  In ext4_symlink
when we call __page_symlink() the new inode doesn't access yet.  So no
one can do a punching hole or truncation to this inode.  But I also
think we need to add WARN_ON in ext4_truncate because i_mutex lock is
used to serialize truncate/punch hole and buffered io.

Regards,
                                                - Zheng

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 15:19         ` Zheng Liu
@ 2013-03-27 15:12           ` Theodore Ts'o
  2013-03-27 15:35             ` Zheng Liu
  0 siblings, 1 reply; 14+ messages in thread
From: Theodore Ts'o @ 2013-03-27 15:12 UTC (permalink / raw)
  To: linux-ext4, Zheng Liu

On Wed, Mar 27, 2013 at 11:19:22PM +0800, Zheng Liu wrote:
> > > Otherwise, couldn't we end up with problems where a failed write calls
> > > ext4_truncate() without i_data_sem(), and that races with something
> > > else --- say, a punch or truncate call to that same inode?
> 
> Let me think about it.  I need to take a close look at it.

Note that I'm not so concerned when we are creating symlink --- you
are quite right in pointing out in that case the inode isn't in the
namespace yet, so that prevents races --- but also what might happen
in an ENOSPC write(2) failure racing against a punch/truncate call.

But again, this is why I added the warning --- it was to find these
edge cases that we might not have considered.  :-)

     	   	   	     	  	       - Ted

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 15:07       ` Zheng Liu
@ 2013-03-27 15:19         ` Zheng Liu
  2013-03-27 15:12           ` Theodore Ts'o
  0 siblings, 1 reply; 14+ messages in thread
From: Zheng Liu @ 2013-03-27 15:19 UTC (permalink / raw)
  To: Theodore Ts'o, linux-ext4, Zheng Liu

On Wed, Mar 27, 2013 at 11:07:35PM +0800, Zheng Liu wrote:
> On Wed, Mar 27, 2013 at 09:51:55AM -0400, Theodore Ts'o wrote:
> > Ah, now I see.  Thanks for sending the stack trace.  On the failure
> > path, we're calling the inline function ext4_truncate_filaed_write()
> > and this is calling ext4_truncate().
> > 
> > But I'm now wondering if we need to take the i_data_sem mutex in
                                                 ^^^
Sigh, I misread i_data_sem and i_mutex.  Really sorry about that. :-/

> > ext4_truncate_failed_write().
> > 
> > Otherwise, couldn't we end up with problems where a failed write calls
> > ext4_truncate() without i_data_sem(), and that races with something
> > else --- say, a punch or truncate call to that same inode?

Let me think about it.  I need to take a close look at it.

Regards,
                                                - Zheng

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 15:12           ` Theodore Ts'o
@ 2013-03-27 15:35             ` Zheng Liu
  2013-03-28 14:06               ` Theodore Ts'o
  0 siblings, 1 reply; 14+ messages in thread
From: Zheng Liu @ 2013-03-27 15:35 UTC (permalink / raw)
  To: Theodore Ts'o; +Cc: linux-ext4, Zheng Liu

On Wed, Mar 27, 2013 at 11:12:48AM -0400, Theodore Ts'o wrote:
> On Wed, Mar 27, 2013 at 11:19:22PM +0800, Zheng Liu wrote:
> > > > Otherwise, couldn't we end up with problems where a failed write calls
> > > > ext4_truncate() without i_data_sem(), and that races with something
> > > > else --- say, a punch or truncate call to that same inode?
> > 
> > Let me think about it.  I need to take a close look at it.
> 
> Note that I'm not so concerned when we are creating symlink --- you
> are quite right in pointing out in that case the inode isn't in the
> namespace yet, so that prevents races --- but also what might happen
> in an ENOSPC write(2) failure racing against a punch/truncate call.
> 
> But again, this is why I added the warning --- it was to find these
> edge cases that we might not have considered.  :-)

ext4_truncate_failed_write() is called by the following functions:
 - ext4_ind_direct_IO
 - ext4_convert_inline_data_to_extent
 - ext4_da_convert_inline_data_to_extent
 - ext4_write_begin
 - ext4_write_end
 - ext4_journalled_write_end
 - ext4_da_write_begin

All these functions are protected by i_mutex.  So we can serialize it
with truncate/punch hole.

Regards,
                                                - Zheng

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate
  2013-03-27 15:35             ` Zheng Liu
@ 2013-03-28 14:06               ` Theodore Ts'o
  2013-04-01 15:23                 ` [PATCH] fs: take i_mutex in __page_symlink() Theodore Ts'o
  0 siblings, 1 reply; 14+ messages in thread
From: Theodore Ts'o @ 2013-03-28 14:06 UTC (permalink / raw)
  To: Zheng Liu, Al Viro; +Cc: linux-ext4, linux-fsdevel

I looked more closely at the assumption that ext4_write_begin() holds
i_mutex.  This is guaranteed by Documentation/filesystems/Locking,
which notes that write_begin() and write_end() functions hold i_mutex:

			PageLocked(page)	i_mutex
write_begin:		locks the page		yes
write_end:		yes, unlocks		yes

So the bug is that ext4_symlink() calls __page_symlink();
__page_symlink() calls pagecache_write_begin() which calls
write_begin(), without taking i_mutex.

So we can fix this by taking i_mutex in ext4_symlink(), but I think it
would be better to take the i_mutex in __page_symlink(), since it
would then address a violation of the locking rules for all file
systems.

Al, do you agree?

					- Ted

^ permalink raw reply	[flat|nested] 14+ messages in thread

* [PATCH] fs: take i_mutex in __page_symlink()
  2013-03-28 14:06               ` Theodore Ts'o
@ 2013-04-01 15:23                 ` Theodore Ts'o
  2013-04-01 16:35                   ` Al Viro
  2013-04-02  8:19                   ` Dmitry Monakhov
  0 siblings, 2 replies; 14+ messages in thread
From: Theodore Ts'o @ 2013-04-01 15:23 UTC (permalink / raw)
  To: Ext4 Developers List; +Cc: Theodore Ts'o, linux-fsdevel, Al Viro

In Documentation/filesystems/Locking, it's documented that
write_begin() is guaranteed to be called with i_mutex locked.  The
function __page_symlink() was not taking i_mutex before calling
pagecache_write_begin(), which will eventually result in the file
system's write_begin()'s function getting called.

Other callers of pagecache_write_begin such as in fs/splice.c, call
pagecache_write_begin() with i_mutex locked, so fix __page_symlink()
to be consistent.

This was discovered by the addition of a new ext4 debugging assertion
which checked to make sure i_mutex was locked before calling
ext4_truncate().

Reported-by: Zheng Liu <gnehzuil.liu@gmail.com>
Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>
Cc: linux-fsdevel@vger.kernel.org
Cc: Al Viro <viro@ZenIV.linux.org.uk>
---

Note: I plan to carry the following patch in the ext4 tree, unless
someone objects or Al insists on carrying this in the vfs git tree.

 fs/namei.c | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/fs/namei.c b/fs/namei.c
index 57ae9c8..548e57b 100644
--- a/fs/namei.c
+++ b/fs/namei.c
@@ -4035,8 +4035,10 @@ int __page_symlink(struct inode *inode, const char *symname, int len, int nofs)
 		flags |= AOP_FLAG_NOFS;
 
 retry:
+	mutex_lock(&inode->i_mutex);
 	err = pagecache_write_begin(NULL, mapping, 0, len-1,
 				flags, &page, &fsdata);
+	mutex_unlock(&inode->i_mutex);
 	if (err)
 		goto fail;
 
-- 
1.7.12.rc0.22.gcdd159b


^ permalink raw reply related	[flat|nested] 14+ messages in thread

* Re: [PATCH] fs: take i_mutex in __page_symlink()
  2013-04-01 15:23                 ` [PATCH] fs: take i_mutex in __page_symlink() Theodore Ts'o
@ 2013-04-01 16:35                   ` Al Viro
  2013-04-01 17:38                     ` Theodore Ts'o
  2013-04-02  8:19                   ` Dmitry Monakhov
  1 sibling, 1 reply; 14+ messages in thread
From: Al Viro @ 2013-04-01 16:35 UTC (permalink / raw)
  To: Theodore Ts'o; +Cc: Ext4 Developers List, linux-fsdevel

On Mon, Apr 01, 2013 at 11:23:42AM -0400, Theodore Ts'o wrote:
> In Documentation/filesystems/Locking, it's documented that
> write_begin() is guaranteed to be called with i_mutex locked.  The
> function __page_symlink() was not taking i_mutex before calling
> pagecache_write_begin(), which will eventually result in the file
> system's write_begin()'s function getting called.
> 
> Other callers of pagecache_write_begin such as in fs/splice.c, call
> pagecache_write_begin() with i_mutex locked, so fix __page_symlink()
> to be consistent.
> 
> This was discovered by the addition of a new ext4 debugging assertion
> which checked to make sure i_mutex was locked before calling
> ext4_truncate().

	I doubt that it's worth doing (inode has just been created and
nobody else should have references to it - it's not fully set up, after
all)...

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] fs: take i_mutex in __page_symlink()
  2013-04-01 16:35                   ` Al Viro
@ 2013-04-01 17:38                     ` Theodore Ts'o
  0 siblings, 0 replies; 14+ messages in thread
From: Theodore Ts'o @ 2013-04-01 17:38 UTC (permalink / raw)
  To: Al Viro; +Cc: Ext4 Developers List, linux-fsdevel

On Mon, Apr 01, 2013 at 05:35:38PM +0100, Al Viro wrote:
> > This was discovered by the addition of a new ext4 debugging assertion
> > which checked to make sure i_mutex was locked before calling
> > ext4_truncate().
> 
> 	I doubt that it's worth doing (inode has just been created and
> nobody else should have references to it - it's not fully set up, after
> all)...

Well, my other option is to drop the assert in ext4_truncate(), which
I thought was a good thing from a perspective of defensive
programming, or to grab the mutex in ext4_symlink() which is what
calles __page_symlink().

Would you prefer that we take the mutex in ext4_symlink() instead?

      	  	      	      	  	- Ted

^ permalink raw reply	[flat|nested] 14+ messages in thread

* Re: [PATCH] fs: take i_mutex in __page_symlink()
  2013-04-01 15:23                 ` [PATCH] fs: take i_mutex in __page_symlink() Theodore Ts'o
  2013-04-01 16:35                   ` Al Viro
@ 2013-04-02  8:19                   ` Dmitry Monakhov
  1 sibling, 0 replies; 14+ messages in thread
From: Dmitry Monakhov @ 2013-04-02  8:19 UTC (permalink / raw)
  To: Theodore Ts'o, Ext4 Developers List
  Cc: Theodore Ts'o, linux-fsdevel, Al Viro

[-- Attachment #1: Type: text/plain, Size: 1953 bytes --]

On Mon,  1 Apr 2013 11:23:42 -0400, Theodore Ts'o <tytso@mit.edu> wrote:
> In Documentation/filesystems/Locking, it's documented that
> write_begin() is guaranteed to be called with i_mutex locked.  The
> function __page_symlink() was not taking i_mutex before calling
> pagecache_write_begin(), which will eventually result in the file
> system's write_begin()'s function getting called.
> 
> Other callers of pagecache_write_begin such as in fs/splice.c, call
> pagecache_write_begin() with i_mutex locked, so fix __page_symlink()
> to be consistent.
> 
> This was discovered by the addition of a new ext4 debugging assertion
> which checked to make sure i_mutex was locked before calling
> ext4_truncate().
> 
> Reported-by: Zheng Liu <gnehzuil.liu@gmail.com>
> Signed-off-by: "Theodore Ts'o" <tytso@mit.edu>
> Cc: linux-fsdevel@vger.kernel.org
> Cc: Al Viro <viro@ZenIV.linux.org.uk>
> ---
> 
> Note: I plan to carry the following patch in the ext4 tree, unless
> someone objects or Al insists on carrying this in the vfs git tree.
> 
>  fs/namei.c | 2 ++
>  1 file changed, 2 insertions(+)
> 
> diff --git a/fs/namei.c b/fs/namei.c
> index 57ae9c8..548e57b 100644
> --- a/fs/namei.c
> +++ b/fs/namei.c
> @@ -4035,8 +4035,10 @@ int __page_symlink(struct inode *inode, const char *symname, int len, int nofs)
>  		flags |= AOP_FLAG_NOFS;
>  
>  retry:
> +	mutex_lock(&inode->i_mutex);
>  	err = pagecache_write_begin(NULL, mapping, 0, len-1,
>  				flags, &page, &fsdata);
> +	mutex_unlock(&inode->i_mutex);
Noo. Please do no do that. i_mutex should being hold from write_begin() to
write_end() because:
1) both functions contains one logical block (critical section)
2) write_end() may update i_size
3) write_end() may call truncate

And since we know that we want to lock i_mutex here only for
convention purposes (no one can access this inode yet) let's do that
correct. Original Zheng's patch was much better.
I have following patch in my queue:

[-- Warning: decoded text below may be mangled, UTF-8 assumed --]
[-- Attachment #2: 0001-patch-ext4_symlink.patch.patch --]
[-- Type: text/x-patch, Size: 887 bytes --]

>From c147d9ae5f9511f722a97179cd9f661e7e10417e Mon Sep 17 00:00:00 2001
From: Dmitry Monakhov <dmonakhov@openvz.org>
Date: Sun, 31 Mar 2013 17:35:38 +0400
Subject: [PATCH] patch ext4_symlink.patch


Signed-off-by: Dmitry Monakhov <dmonakhov@openvz.org>
---
 fs/namei.c |    3 +++
 1 files changed, 3 insertions(+), 0 deletions(-)

diff --git a/fs/namei.c b/fs/namei.c
index 57ae9c8..9dcdb27 100644
--- a/fs/namei.c
+++ b/fs/namei.c
@@ -4034,6 +4034,7 @@ int __page_symlink(struct inode *inode, const char *symname, int len, int nofs)
 	if (nofs)
 		flags |= AOP_FLAG_NOFS;
 
+	mutex_lock(&inode->i_mutex);
 retry:
 	err = pagecache_write_begin(NULL, mapping, 0, len-1,
 				flags, &page, &fsdata);
@@ -4052,8 +4053,10 @@ retry:
 		goto retry;
 
 	mark_inode_dirty(inode);
+	mutex_unlock(&inode->i_mutex);
 	return 0;
 fail:
+	mutex_unlock(&inode->i_mutex);
 	return err;
 }
 
-- 
1.7.1


[-- Attachment #3: Type: text/plain, Size: 270 bytes --]


>  	if (err)
>  		goto fail;
>  
> -- 
> 1.7.12.rc0.22.gcdd159b
> 
> --
> To unsubscribe from this list: send the line "unsubscribe linux-ext4" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply related	[flat|nested] 14+ messages in thread

end of thread, other threads:[~2013-04-02  8:19 UTC | newest]

Thread overview: 14+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2013-03-27 13:19 [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate Zheng Liu
2013-03-27 13:41 ` Theodore Ts'o
2013-03-27 14:02   ` Zheng Liu
2013-03-27 13:51     ` Theodore Ts'o
2013-03-27 15:07       ` Zheng Liu
2013-03-27 15:19         ` Zheng Liu
2013-03-27 15:12           ` Theodore Ts'o
2013-03-27 15:35             ` Zheng Liu
2013-03-28 14:06               ` Theodore Ts'o
2013-04-01 15:23                 ` [PATCH] fs: take i_mutex in __page_symlink() Theodore Ts'o
2013-04-01 16:35                   ` Al Viro
2013-04-01 17:38                     ` Theodore Ts'o
2013-04-02  8:19                   ` Dmitry Monakhov
2013-03-27 14:04     ` [PATCH] ext4: take i_mutex in ext4_symlink to eliminate a warning from ext4_truncate Zheng Liu

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).