Call trace in ext4_es_lru_add on 3.10 stable

linux-ext4.vger.kernel.org archive mirror
 help / color / mirror / Atom feed

* Call trace in ext4_es_lru_add on 3.10 stable
@ 2014-09-18 13:08 Stefan Priebe - Profihost AG
  2014-09-18 19:21 ` Theodore Ts'o
  0 siblings, 1 reply; 22+ messages in thread
From: Stefan Priebe - Profihost AG @ 2014-09-18 13:08 UTC (permalink / raw)
  To: linux-ext4
  Cc: p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

Hi List,

I'm seeing this error pretty often at a vanilla 3.10.53 kernel.

Can anybody point me to a patch or tell me if this is a known bug?

Call Trace:
 [<ffffffffa0311e56>] ext4_es_lru_add+0x26/0x80 [ext4]
 [<ffffffffa0311f41>] ext4_es_lookup_extent+0x91/0x190 [ext4]
 [<ffffffffa02d66b3>] ext4_map_blocks+0x43/0x450 [ext4]
 [<ffffffffa02d8167>] _ext4_get_block+0x87/0x190 [ext4]
 [<ffffffffa02d82c6>] ext4_get_block+0x16/0x20 [ext4]
 [<ffffffff8117f63f>] generic_block_bmap+0x3f/0x50
 [<ffffffffa017e4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
 [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
 [<ffffffffa02d5d71>] ext4_bmap+0x91/0xf0 [ext4]
 [<ffffffff8116866e>] bmap+0x1e/0x30
 [<ffffffffa0187043>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
 [<ffffffffa01872fd>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
 [<ffffffffa017f238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
 [<ffffffff81084e03>] ? idle_balance+0xd3/0x110
 [<ffffffff8105a008>] ? lock_timer_base.isra.35+0x38/0x70
 [<ffffffffa018491a>] kjournald2+0xba/0x230 [jbd2]
 [<ffffffff81070350>] ? finish_wait+0x80/0x80
 [<ffffffffa0184860>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
 [<ffffffff8106fb50>] kthread+0xc0/0xd0
 [<ffffffff8106fa90>] ? kthread_create_on_node+0x130/0x130
 [<ffffffff815548ec>] ret_from_fork+0x7c/0xb0
 [<ffffffff8106fa90>] ? kthread_create_on_node+0x130/0x130

-- 
Mit freundlichen Grüßen
  Stefan Priebe
Bachelor of Science in Computer Science (BSCS)
Vorstand (CTO)

-------------------------------
Profihost AG
Expo Plaza 1
30539 Hannover
Deutschland

Tel.: +49 (511) 5151 8181     | Fax.: +49 (511) 5151 8282
URL: http://www.profihost.com | E-Mail: info@profihost.com

Sitz der Gesellschaft: Hannover, USt-IdNr. DE813460827
Registergericht: Amtsgericht Hannover, Register-Nr.: HRB 202350
Vorstand: Cristoph Bluhm, Sebastian Bluhm, Stefan Priebe
Aufsichtsrat: Prof. Dr. iur. Winfried Huck (Vorsitzender)

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-18 13:08 Call trace in ext4_es_lru_add on 3.10 stable Stefan Priebe - Profihost AG
@ 2014-09-18 19:21 ` Theodore Ts'o
  2014-09-18 19:29   ` Stefan Priebe
  0 siblings, 1 reply; 22+ messages in thread
From: Theodore Ts'o @ 2014-09-18 19:21 UTC (permalink / raw)
  To: Stefan Priebe - Profihost AG
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

On Thu, Sep 18, 2014 at 03:08:10PM +0200, Stefan Priebe - Profihost AG wrote:
> Hi List,
> 
> I'm seeing this error pretty often at a vanilla 3.10.53 kernel.
> 
> Can anybody point me to a patch or tell me if this is a known bug?

This is just call trace; can you send the complete set of kernel
messages?  Please send  everything starting a few lines before the

---------------- [ cut here ] ----------

line, and then going all the way to the 

----------- [ end trace XXXXXXXXXX ] --------

line.

Cheers,

				- Ted

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-18 19:21 ` Theodore Ts'o
@ 2014-09-18 19:29   ` Stefan Priebe
  2014-09-18 19:43     ` Theodore Ts'o
  0 siblings, 1 reply; 22+ messages in thread
From: Stefan Priebe @ 2014-09-18 19:29 UTC (permalink / raw)
  To: Theodore Ts'o
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable


Am 18.09.2014 21:21, schrieb Theodore Ts'o:
> On Thu, Sep 18, 2014 at 03:08:10PM +0200, Stefan Priebe - Profihost AG wrote:
>> Hi List,
>>
>> I'm seeing this error pretty often at a vanilla 3.10.53 kernel.
>>
>> Can anybody point me to a patch or tell me if this is a known bug?
>
> This is just call trace; can you send the complete set of kernel
> messages?  Please send  everything starting a few lines before the
>
> ---------------- [ cut here ] ----------
>
> line, and then going all the way to the
>
> ----------- [ end trace XXXXXXXXXX ] --------


Sorry but whole output is:
2014-09-18 02:30:34     0000000000000000 ffff881021663b20 
ffff881021663b08 ffffffffa02d66b3
2014-09-18 02:30:34     Call Trace:
2014-09-18 02:30:34     [<ffffffffa0311e56>] ext4_es_lru_add+0x26/0x80 
[ext4]
2014-09-18 02:30:34     [<ffffffffa0311f41>] 
ext4_es_lookup_extent+0x91/0x190 [ext4]
2014-09-18 02:30:34     [<ffffffffa02d66b3>] ext4_map_blocks+0x43/0x450 
[ext4]
2014-09-18 02:30:34     [<ffffffffa02d8167>] _ext4_get_block+0x87/0x190 
[ext4]
2014-09-18 02:30:34     [<ffffffffa02d82c6>] ext4_get_block+0x16/0x20 [ext4]
2014-09-18 02:30:34     [<ffffffff8117f63f>] generic_block_bmap+0x3f/0x50
2014-09-18 02:30:34     [<ffffffffa017e4ae>] ? 
jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
2014-09-18 02:30:34     [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
2014-09-18 02:30:34     [<ffffffffa02d5d71>] ext4_bmap+0x91/0xf0 [ext4]
2014-09-18 02:30:34     [<ffffffff8116866e>] bmap+0x1e/0x30
2014-09-18 02:30:34     [<ffffffffa0187043>] jbd2_journal_bmap+0x33/0xb0 
[jbd2]
2014-09-18 02:30:34     [<ffffffffa01872fd>] 
jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
2014-09-18 02:30:34     [<ffffffffa017f238>] 
jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
2014-09-18 02:30:34     [<ffffffff81084e03>] ? idle_balance+0xd3/0x110
2014-09-18 02:30:34     [<ffffffff8105a008>] ? 
lock_timer_base.isra.35+0x38/0x70
2014-09-18 02:30:34     [<ffffffffa018491a>] kjournald2+0xba/0x230 [jbd2]
2014-09-18 02:30:34     [<ffffffff81070350>] ? finish_wait+0x80/0x80
2014-09-18 02:30:34     [<ffffffffa0184860>] ? 
jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
2014-09-18 02:30:34     [<ffffffff8106fb50>] kthread+0xc0/0xd0
2014-09-18 02:30:34     [<ffffffff8106fa90>] ? 
kthread_create_on_node+0x130/0x130
2014-09-18 02:30:34     [<ffffffff815548ec>] ret_from_fork+0x7c/0xb0
2014-09-18 02:30:34     [<ffffffff8106fa90>] ? 
kthread_create_on_node+0x130/0x130
2014-09-18 02:30:34     Code: 84 00 00 00 00 00 66 66 66 66 90 55 48 89 
e5 b8 00 00 01 00 f0 0f c1 07 89 c2 c1 ea 10 66 39 c2 74 0e 0f 1f 40 00 
f3 90 0f b7 07 <66> 39 d0 75 f6 5d c3 90 90 90 90 66 66 66 66 90 55 48 
89 e5 53
2014-09-18 02:30:34     BUG: soft lockup - CPU#1 stuck for 23s! 
[jbd2/sdb1-8:1795]
2014-09-18 02:30:34     Modules linked in: nf_conntrack_ipv4 
nf_defrag_ipv4 xt_state nf_conntrack xt_tcpudp xt_owner ipt_REJECT 
xt_multiport mpt2sas raid_class iptable_filter ip_tables x_tables 
cpufreq_userspace cpufreq_powersave cpufreq_conservative 
cpufreq_ondemand 8021q garp ext4 crc16 jbd2 mbcache ext2 mperf usbhid 
coretemp kvm_intel kvm crc32_pclmul ehci_pci ghash_clmulni_intel sb_edac 
ehci_hcd edac_core microcode usbcore i2c_i801 usb_common button 
netconsole sg sd_mod igb i2c_algo_bit isci i2c_core libsas ahci ptp 
libahci scsi_transport_sas pps_core megaraid_sas
2014-09-18 02:30:34     CPU: 1 PID: 1795 Comm: jbd2/sdb1-8 Not tainted 
3.10.53+85-ph #1
2014-09-18 02:30:34     Hardware name: XX
2014-09-18 02:30:34     task: ffff881020d90000 ti: ffff881021662000 
task.ti: ffff881021662000
2014-09-18 02:30:34     RIP: 0010:[<ffffffff81553cb5>] 
[<ffffffff81553cb5>] _raw_spin_lock+0x25/0x30
2014-09-18 02:30:34     RSP: 0018:ffff881021663a38 EFLAGS: 00000297
2014-09-18 02:30:34     RAX: 000000000000c7a2 RBX: ffff881020800310 RCX: 
0000000000000000
2014-09-18 02:30:34     RDX: 000000000000c7a3 RSI: 00000000000046b6 RDI: 
ffff8810265c9440
2014-09-18 02:30:34     RBP: ffff881021663a38 R08: ffffffffa02d82b0 R09: 
ffffea001d48dd40
2014-09-18 02:30:34     R10: 0000000000000000 R11: 0000000000000000 R12: 
0000000000000020
2014-09-18 02:30:34     R13: 000000000000c79f R14: 0000000000000000 R15: 
ffff88107fc359c0
2014-09-18 02:30:34     FS: 0000000000000000(0000) 
GS:ffff88107fc20000(0000) knlGS:0000000000000000
2014-09-18 02:30:34     CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
2014-09-18 02:30:34     CR2: 000000000509d700 CR3: 0000000001a0b000 CR4: 
00000000000407e0
2014-09-18 02:30:34     DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
2014-09-18 02:30:34     DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
0000000000000400Stack:
2014-09-18 02:30:34     ffff881021663a58 ffffffffa0311e56 
00000000000046b6 ffff8810208000b0
2014-09-18 02:30:34     ffff881021663a88 ffffffffa0311f41 
ffff8810208000b0 0000000000000000


Stefan

>
> line.
>
> Cheers,
>
> 				- Ted
>

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-18 19:29   ` Stefan Priebe
@ 2014-09-18 19:43     ` Theodore Ts'o
  2014-09-22  6:56       ` Stefan Priebe
  0 siblings, 1 reply; 22+ messages in thread
From: Theodore Ts'o @ 2014-09-18 19:43 UTC (permalink / raw)
  To: Stefan Priebe
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

On Thu, Sep 18, 2014 at 09:29:37PM +0200, Stefan Priebe wrote:
> 
> Sorry but whole output is:
> 2014-09-18 02:30:34     0000000000000000 ffff881021663b20 ffff881021663b08
> ffffffffa02d66b3
		...

That's not the whole message; you just weren't able to capture it all.
How are you capturing these messages, by the way?  Serial console?   

It this reproducible?   Can you try a newer kernel?

Cheers,

					- Ted

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-18 19:43     ` Theodore Ts'o
@ 2014-09-22  6:56       ` Stefan Priebe
  2014-09-22 16:47         ` Theodore Ts'o
  0 siblings, 1 reply; 22+ messages in thread
From: Stefan Priebe @ 2014-09-22  6:56 UTC (permalink / raw)
  To: Theodore Ts'o
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

Hi Ted,

Am 18.09.2014 21:43, schrieb Theodore Ts'o:
> On Thu, Sep 18, 2014 at 09:29:37PM +0200, Stefan Priebe wrote:
>>
>> Sorry but whole output is:
>> 2014-09-18 02:30:34     0000000000000000 ffff881021663b20 ffff881021663b08
>> ffffffffa02d66b3
> 		...
>
> That's not the whole message; you just weren't able to capture it all.
> How are you capturing these messages, by the way?  Serial console?

Sorry this was an incomplete copy and paste by me.

Here is the complete output:
[1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
[1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4 
xt_state nf_conntrack xt_tcpudp xt_owner mpt2sas raid_class ipt_REJECT 
xt_multiport iptable_filter ip_tables x_tables cpufreq_userspace 
cpufreq_powersave cpufreq_conservative cpufreq_ondemand 8021q garp ext4 
crc16 jbd2 mbcache ext2 k8temp ehci_pci mperf coretemp kvm_intel kvm 
crc32_pclmul ehci_hcd ghash_clmulni_intel sb_edac edac_core usbcore 
i2c_i801 microcode usb_common button netconsole sg sd_mod igb 
i2c_algo_bit isci i2c_core libsas ahci ptp libahci scsi_transport_sas 
megaraid_sas pps_core
[1578545.192373] CPU: 7 PID: 29281 Comm: mysqld Tainted: G        W 
3.10.53+85-ph #1
[1578545.254369] Hardware name: Supermicro 
X9SRE/X9SRE-3F/X9SRi/X9SRi-3F/X9SRE/X9SRE-3F/X9SRi/X9SRi-3F, BIOS 1.0a 
03/06/2012
[1578545.317333] task: ffff880d5bab9900 ti: ffff880048da4000 task.ti: 
ffff880048da4000
[1578545.380284] RIP: 0010:[<ffffffff81553cb2>]  [<ffffffff81553cb2>] 
_raw_spin_lock+0x22/0x30
[1578545.444138] RSP: 0000:ffff880048da5878  EFLAGS: 00000297
[1578545.507802] RAX: 000000000000f53c RBX: ffffffffa0372a69 RCX: 
000000008802cc10
[1578545.571007] RDX: 000000000000f53d RSI: 0000000000000000 RDI: 
ffff8810265a6440
[1578545.632916] RBP: ffff880048da5878 R08: 1038000000000000 R09: 
0ab3417d081c0000
[1578545.694103] R10: 0000000000000005 R11: dead000000100100 R12: 
ffffffff812ba03b
[1578545.755009] R13: ffff880048da57e8 R14: ffffffff810fa58c R15: 
ffff880048da58a8
[1578545.815734] FS:  00007f55e0d1e700(0000) GS:ffff88107fdc0000(0000) 
knlGS:0000000000000000
[1578545.877485] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[1578545.939121] CR2: 00007f5505000000 CR3: 0000001024ba0000 CR4: 
00000000000407e0
[1578546.001641] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 
0000000000000000
[1578546.064081] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 
0000000000000400
[1578546.125544] Stack:
[1578546.186027]  ffff880048da5898 ffffffffa0373350 ffff880ab3417c70 
ffff880ab3417c70
[1578546.248189]  ffff880048da58b8 ffffffffa03548b5 ffff880ab3417db8 
ffff880ab3417db8
[1578546.310187]  ffff880048da58d8 ffffffffa033cf43 ffff880ab3417c70 
ffff880ab3417d70
[1578546.371830] Call Trace:
[1578546.432274]  [<ffffffffa0373350>] ext4_es_lru_del+0x30/0x80 [ext4]
[1578546.493276]  [<ffffffffa03548b5>] ext4_clear_inode+0x45/0x90 [ext4]
[1578546.554093]  [<ffffffffa033cf43>] ext4_evict_inode+0x83/0x4d0 [ext4]
[1578546.614444]  [<ffffffff81169ef0>] evict+0xb0/0x1b0
[1578546.673944]  [<ffffffff8116a031>] dispose_list+0x41/0x50
[1578546.733061]  [<ffffffff8116ae23>] prune_icache_sb+0x183/0x340
[1578546.792425]  [<ffffffff81154c7b>] prune_super+0x17b/0x1b0
[1578546.851603]  [<ffffffff810fd0f1>] shrink_slab+0x151/0x2e0
[1578546.910609]  [<ffffffff8110dd22>] ? compact_zone+0x32/0x430
[1578546.969573]  [<ffffffff810ffc55>] do_try_to_free_pages+0x405/0x540
[1578547.028754]  [<ffffffff810fffc8>] try_to_free_pages+0xf8/0x180
[1578547.087924]  [<ffffffff810f5d63>] __alloc_pages_nodemask+0x553/0x900
[1578547.147165]  [<ffffffff81131a05>] alloc_pages_vma+0xa5/0x150
[1578547.206584]  [<ffffffff811445a4>] 
do_huge_pmd_anonymous_page+0x174/0x3d0
[1578547.265245]  [<ffffffff8111c568>] ? change_protection+0x5b8/0x670
[1578547.322947]  [<ffffffff81114a22>] handle_mm_fault+0x292/0x340
[1578547.379690]  [<ffffffff81032b68>] __do_page_fault+0x168/0x460
[1578547.434929]  [<ffffffff8111c777>] ? mprotect_fixup+0x157/0x280
[1578547.488655]  [<ffffffff8111851b>] ? remove_vma+0x5b/0x70
[1578547.541197]  [<ffffffff81032e9e>] do_page_fault+0xe/0x10
[1578547.594037]  [<ffffffff81554242>] page_fault+0x22/0x30

> It this reproducible?   Can you try a newer kernel?

I'm seeing this on various systems doing rsync backups to an ext4 
partition. I can't try a newer kernel. But i also don't have exact steps 
to reproduce. It just happens sometimes.

Greets,
Stefan

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-22  6:56       ` Stefan Priebe
@ 2014-09-22 16:47         ` Theodore Ts'o
  2014-09-22 18:29           ` Stefan Priebe
  0 siblings, 1 reply; 22+ messages in thread
From: Theodore Ts'o @ 2014-09-22 16:47 UTC (permalink / raw)
  To: Stefan Priebe
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

On Mon, Sep 22, 2014 at 08:56:23AM +0200, Stefan Priebe wrote:
> >That's not the whole message; you just weren't able to capture it all.
> >How are you capturing these messages, by the way?  Serial console?
> 
> Sorry this was an incomplete copy and paste by me.
> 
> Here is the complete output:
> [1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
> [1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4

OK, thanks, this is a known bug, where when ext4 is under heavy memory
pressure, we can end up stalling in reclaim.  This message indicates
that the system got stalled for 22 seconds, which is not good, since
it impacts the interactivity of your system, and increases the
long-tail latency of requests to servers running on your system, but
it doesn't cause any data loss or will cause any of your processes to
crash or otherwise stop functioning (except for temporarily).

It's something that we are working on, and there are patches which
Zheng Liu submitted that still need a bit of polishing, but I hope to
have it addressed soon.

Cheers,

						- Ted

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-22 16:47         ` Theodore Ts'o
@ 2014-09-22 18:29           ` Stefan Priebe
  2014-09-22 20:20             ` Theodore Ts'o
  0 siblings, 1 reply; 22+ messages in thread
From: Stefan Priebe @ 2014-09-22 18:29 UTC (permalink / raw)
  To: Theodore Ts'o
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

Hi,
Am 22.09.2014 18:47, schrieb Theodore Ts'o:
> On Mon, Sep 22, 2014 at 08:56:23AM +0200, Stefan Priebe wrote:
>>> That's not the whole message; you just weren't able to capture it all.
>>> How are you capturing these messages, by the way?  Serial console?
>>
>> Sorry this was an incomplete copy and paste by me.
>>
>> Here is the complete output:
>> [1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
>> [1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4
>
> OK, thanks, this is a known bug, where when ext4 is under heavy memory
> pressure, we can end up stalling in reclaim.  This message indicates
> that the system got stalled for 22 seconds, which is not good, since
> it impacts the interactivity of your system, and increases the
> long-tail latency of requests to servers running on your system, but
> it doesn't cause any data loss or will cause any of your processes to
> crash or otherwise stop functioning (except for temporarily).
>
> It's something that we are working on, and there are patches which
> Zheng Liu submitted that still need a bit of polishing, but I hope to
> have it addressed soon.

Thanks for your feedback. Will those patches go to stable? Any link to 
those patches?

Stefan

>
> Cheers,
>
> 						- Ted
>

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-22 18:29           ` Stefan Priebe
@ 2014-09-22 20:20             ` Theodore Ts'o
  2014-09-23  7:50               ` Stefan Priebe - Profihost AG
  0 siblings, 1 reply; 22+ messages in thread
From: Theodore Ts'o @ 2014-09-22 20:20 UTC (permalink / raw)
  To: Stefan Priebe
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

On Mon, Sep 22, 2014 at 08:29:54PM +0200, Stefan Priebe wrote:
> Hi,
> Am 22.09.2014 18:47, schrieb Theodore Ts'o:
> >On Mon, Sep 22, 2014 at 08:56:23AM +0200, Stefan Priebe wrote:
> >>>That's not the whole message; you just weren't able to capture it all.
> >>>How are you capturing these messages, by the way?  Serial console?
> >>
> >>Sorry this was an incomplete copy and paste by me.
> >>
> >>Here is the complete output:
> >>[1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
> >>[1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4
> >
> >OK, thanks, this is a known bug, where when ext4 is under heavy memory
> >pressure, we can end up stalling in reclaim.  This message indicates
> >that the system got stalled for 22 seconds, which is not good, since
> >it impacts the interactivity of your system, and increases the
> >long-tail latency of requests to servers running on your system, but
> >it doesn't cause any data loss or will cause any of your processes to
> >crash or otherwise stop functioning (except for temporarily).
> >
> >It's something that we are working on, and there are patches which
> >Zheng Liu submitted that still need a bit of polishing, but I hope to
> >have it addressed soon.
> 
> Thanks for your feedback. Will those patches go to stable? Any link to
> those patches?

I'm not sure they will go to Stable when they are ready, because the
patches are somewhat complex and so they may not apply cleanly to much
older kernels.

The patches under discussion (some have been applied, others hae been
waiting for some requested changes) can be found here:

http://patchwork.ozlabs.org/patch/377720
http://patchwork.ozlabs.org/patch/377721
http://patchwork.ozlabs.org/patch/377722
http://patchwork.ozlabs.org/patch/377723
http://patchwork.ozlabs.org/patch/377724
http://patchwork.ozlabs.org/patch/377725
http://patchwork.ozlabs.org/patch/377727

						- Ted

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-22 20:20             ` Theodore Ts'o
@ 2014-09-23  7:50               ` Stefan Priebe - Profihost AG
  2014-09-23  9:42                 ` Jan Kara
  0 siblings, 1 reply; 22+ messages in thread
From: Stefan Priebe - Profihost AG @ 2014-09-23  7:50 UTC (permalink / raw)
  To: Theodore Ts'o
  Cc: linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable


Am 22.09.2014 um 22:20 schrieb Theodore Ts'o:
> On Mon, Sep 22, 2014 at 08:29:54PM +0200, Stefan Priebe wrote:
>> Hi,
>> Am 22.09.2014 18:47, schrieb Theodore Ts'o:
>>> On Mon, Sep 22, 2014 at 08:56:23AM +0200, Stefan Priebe wrote:
>>>>> That's not the whole message; you just weren't able to capture it all.
>>>>> How are you capturing these messages, by the way?  Serial console?
>>>>
>>>> Sorry this was an incomplete copy and paste by me.
>>>>
>>>> Here is the complete output:
>>>> [1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
>>>> [1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4
>>>
>>> OK, thanks, this is a known bug, where when ext4 is under heavy memory
>>> pressure, we can end up stalling in reclaim.  This message indicates
>>> that the system got stalled for 22 seconds, which is not good, since
>>> it impacts the interactivity of your system, and increases the
>>> long-tail latency of requests to servers running on your system, but
>>> it doesn't cause any data loss or will cause any of your processes to
>>> crash or otherwise stop functioning (except for temporarily).
>>>
>>> It's something that we are working on, and there are patches which
>>> Zheng Liu submitted that still need a bit of polishing, but I hope to
>>> have it addressed soon.
>>
>> Thanks for your feedback. Will those patches go to stable? Any link to
>> those patches?
> 
> I'm not sure they will go to Stable when they are ready, because the
> patches are somewhat complex and so they may not apply cleanly to much
> older kernels.
> 
> The patches under discussion (some have been applied, others hae been
> waiting for some requested changes) can be found here:
> 
> http://patchwork.ozlabs.org/patch/377720
> http://patchwork.ozlabs.org/patch/377721
> http://patchwork.ozlabs.org/patch/377722
> http://patchwork.ozlabs.org/patch/377723
> http://patchwork.ozlabs.org/patch/377724
> http://patchwork.ozlabs.org/patch/377725
> http://patchwork.ozlabs.org/patch/377727

hui that's a lot. Are they ALL needed to fix this? No workaround
possible? What will Redhat do with their 3.10 RHEL 7 kernel?

Stefan

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-23  7:50               ` Stefan Priebe - Profihost AG
@ 2014-09-23  9:42                 ` Jan Kara
  2014-09-23 12:23                   ` Stefan Priebe
  0 siblings, 1 reply; 22+ messages in thread
From: Jan Kara @ 2014-09-23  9:42 UTC (permalink / raw)
  To: Stefan Priebe - Profihost AG
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable

On Tue 23-09-14 09:50:25, Stefan Priebe - Profihost AG wrote:
> 
> Am 22.09.2014 um 22:20 schrieb Theodore Ts'o:
> > On Mon, Sep 22, 2014 at 08:29:54PM +0200, Stefan Priebe wrote:
> >> Hi,
> >> Am 22.09.2014 18:47, schrieb Theodore Ts'o:
> >>> On Mon, Sep 22, 2014 at 08:56:23AM +0200, Stefan Priebe wrote:
> >>>>> That's not the whole message; you just weren't able to capture it all.
> >>>>> How are you capturing these messages, by the way?  Serial console?
> >>>>
> >>>> Sorry this was an incomplete copy and paste by me.
> >>>>
> >>>> Here is the complete output:
> >>>> [1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
> >>>> [1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4
> >>>
> >>> OK, thanks, this is a known bug, where when ext4 is under heavy memory
> >>> pressure, we can end up stalling in reclaim.  This message indicates
> >>> that the system got stalled for 22 seconds, which is not good, since
> >>> it impacts the interactivity of your system, and increases the
> >>> long-tail latency of requests to servers running on your system, but
> >>> it doesn't cause any data loss or will cause any of your processes to
> >>> crash or otherwise stop functioning (except for temporarily).
> >>>
> >>> It's something that we are working on, and there are patches which
> >>> Zheng Liu submitted that still need a bit of polishing, but I hope to
> >>> have it addressed soon.
> >>
> >> Thanks for your feedback. Will those patches go to stable? Any link to
> >> those patches?
> > 
> > I'm not sure they will go to Stable when they are ready, because the
> > patches are somewhat complex and so they may not apply cleanly to much
> > older kernels.
> > 
> > The patches under discussion (some have been applied, others hae been
> > waiting for some requested changes) can be found here:
> > 
> > http://patchwork.ozlabs.org/patch/377720
> > http://patchwork.ozlabs.org/patch/377721
> > http://patchwork.ozlabs.org/patch/377722
> > http://patchwork.ozlabs.org/patch/377723
> > http://patchwork.ozlabs.org/patch/377724
> > http://patchwork.ozlabs.org/patch/377725
> > http://patchwork.ozlabs.org/patch/377727
> 
> hui that's a lot. Are they ALL needed to fix this?
  Yes, all of them are needed.

> No workaround possible?
  I don't know about any.

> What will Redhat do with their 3.10 RHEL 7 kernel?
  Well, I cannot speak for RH guys but for SLES if there's a customer
request, we'll just go and backport the patches...
								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-23  9:42                 ` Jan Kara
@ 2014-09-23 12:23                   ` Stefan Priebe
  2014-09-23 14:43                     ` Jan Kara
  0 siblings, 1 reply; 22+ messages in thread
From: Stefan Priebe @ 2014-09-23 12:23 UTC (permalink / raw)
  To: Jan Kara
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable


Am 23.09.2014 11:42, schrieb Jan Kara:
> On Tue 23-09-14 09:50:25, Stefan Priebe - Profihost AG wrote:
>>
>> Am 22.09.2014 um 22:20 schrieb Theodore Ts'o:
>>> On Mon, Sep 22, 2014 at 08:29:54PM +0200, Stefan Priebe wrote:
>>>> Hi,
>>>> Am 22.09.2014 18:47, schrieb Theodore Ts'o:
>>>>> On Mon, Sep 22, 2014 at 08:56:23AM +0200, Stefan Priebe wrote:
>>>>>>> That's not the whole message; you just weren't able to capture it all.
>>>>>>> How are you capturing these messages, by the way?  Serial console?
>>>>>>
>>>>>> Sorry this was an incomplete copy and paste by me.
>>>>>>
>>>>>> Here is the complete output:
>>>>>> [1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
>>>>>> [1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4
>>>>>
>>>>> OK, thanks, this is a known bug, where when ext4 is under heavy memory
>>>>> pressure, we can end up stalling in reclaim.  This message indicates
>>>>> that the system got stalled for 22 seconds, which is not good, since
>>>>> it impacts the interactivity of your system, and increases the
>>>>> long-tail latency of requests to servers running on your system, but
>>>>> it doesn't cause any data loss or will cause any of your processes to
>>>>> crash or otherwise stop functioning (except for temporarily).
>>>>>
>>>>> It's something that we are working on, and there are patches which
>>>>> Zheng Liu submitted that still need a bit of polishing, but I hope to
>>>>> have it addressed soon.
>>>>
>>>> Thanks for your feedback. Will those patches go to stable? Any link to
>>>> those patches?
>>>
>>> I'm not sure they will go to Stable when they are ready, because the
>>> patches are somewhat complex and so they may not apply cleanly to much
>>> older kernels.
>>>
>>> The patches under discussion (some have been applied, others hae been
>>> waiting for some requested changes) can be found here:
>>>
>>> http://patchwork.ozlabs.org/patch/377720
>>> http://patchwork.ozlabs.org/patch/377721
>>> http://patchwork.ozlabs.org/patch/377722
>>> http://patchwork.ozlabs.org/patch/377723
>>> http://patchwork.ozlabs.org/patch/377724
>>> http://patchwork.ozlabs.org/patch/377725
>>> http://patchwork.ozlabs.org/patch/377727
>>
>> hui that's a lot. Are they ALL needed to fix this?
>    Yes, all of them are needed.

How can i get notified when they're ready / polished?

Stefan

>> No workaround possible?
>    I don't know about any.
>
>> What will Redhat do with their 3.10 RHEL 7 kernel?
>    Well, I cannot speak for RH guys but for SLES if there's a customer
> request, we'll just go and backport the patches...
> 								Honza
>

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-23 12:23                   ` Stefan Priebe
@ 2014-09-23 14:43                     ` Jan Kara
  2014-11-26  8:06                       ` Stefan Priebe - Profihost AG
  0 siblings, 1 reply; 22+ messages in thread
From: Jan Kara @ 2014-09-23 14:43 UTC (permalink / raw)
  To: Stefan Priebe
  Cc: Jan Kara, Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

On Tue 23-09-14 14:23:29, Stefan Priebe wrote:
> 
> Am 23.09.2014 11:42, schrieb Jan Kara:
> >On Tue 23-09-14 09:50:25, Stefan Priebe - Profihost AG wrote:
> >>
> >>Am 22.09.2014 um 22:20 schrieb Theodore Ts'o:
> >>>On Mon, Sep 22, 2014 at 08:29:54PM +0200, Stefan Priebe wrote:
> >>>>Hi,
> >>>>Am 22.09.2014 18:47, schrieb Theodore Ts'o:
> >>>>>On Mon, Sep 22, 2014 at 08:56:23AM +0200, Stefan Priebe wrote:
> >>>>>>>That's not the whole message; you just weren't able to capture it all.
> >>>>>>>How are you capturing these messages, by the way?  Serial console?
> >>>>>>
> >>>>>>Sorry this was an incomplete copy and paste by me.
> >>>>>>
> >>>>>>Here is the complete output:
> >>>>>>[1578544.839610] BUG: soft lockup - CPU#7 stuck for 22s! [mysqld:29281]
> >>>>>>[1578544.893450] Modules linked in: nf_conntrack_ipv4 nf_defrag_ipv4
> >>>>>
> >>>>>OK, thanks, this is a known bug, where when ext4 is under heavy memory
> >>>>>pressure, we can end up stalling in reclaim.  This message indicates
> >>>>>that the system got stalled for 22 seconds, which is not good, since
> >>>>>it impacts the interactivity of your system, and increases the
> >>>>>long-tail latency of requests to servers running on your system, but
> >>>>>it doesn't cause any data loss or will cause any of your processes to
> >>>>>crash or otherwise stop functioning (except for temporarily).
> >>>>>
> >>>>>It's something that we are working on, and there are patches which
> >>>>>Zheng Liu submitted that still need a bit of polishing, but I hope to
> >>>>>have it addressed soon.
> >>>>
> >>>>Thanks for your feedback. Will those patches go to stable? Any link to
> >>>>those patches?
> >>>
> >>>I'm not sure they will go to Stable when they are ready, because the
> >>>patches are somewhat complex and so they may not apply cleanly to much
> >>>older kernels.
> >>>
> >>>The patches under discussion (some have been applied, others hae been
> >>>waiting for some requested changes) can be found here:
> >>>
> >>>http://patchwork.ozlabs.org/patch/377720
> >>>http://patchwork.ozlabs.org/patch/377721
> >>>http://patchwork.ozlabs.org/patch/377722
> >>>http://patchwork.ozlabs.org/patch/377723
> >>>http://patchwork.ozlabs.org/patch/377724
> >>>http://patchwork.ozlabs.org/patch/377725
> >>>http://patchwork.ozlabs.org/patch/377727
> >>
> >>hui that's a lot. Are they ALL needed to fix this?
> >   Yes, all of them are needed.
> 
> How can i get notified when they're ready / polished?
  Watching changes to fs/ext4/extents_status.c is probably the most
reliable. Or maybe Zheng (Cced) can add you to CC list when submitting the
patch set next time.

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-09-23 14:43                     ` Jan Kara
@ 2014-11-26  8:06                       ` Stefan Priebe - Profihost AG
  2014-11-26  8:25                         ` Jan Kara
  0 siblings, 1 reply; 22+ messages in thread
From: Stefan Priebe - Profihost AG @ 2014-11-26  8:06 UTC (permalink / raw)
  To: Jan Kara
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

Hi all,

i'm still getting a lot of those call traces:
"
Call Trace:
 [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
 [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
 [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
 [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
 [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
 [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
 [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
 [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
 [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
 [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
 [<ffffffff811686de>] bmap+0x1e/0x30
 [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
 [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
 [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
 [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
 [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
 [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
 [<ffffffff81070360>] ? finish_wait+0x80/0x80
 [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
 [<ffffffff8106fb60>] kthread+0xc0/0xd0
 [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
 [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
 [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
"

Is there any chance to fix them in vanilla 3.10.61?

Greets,
Stefan

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-11-26  8:06                       ` Stefan Priebe - Profihost AG
@ 2014-11-26  8:25                         ` Jan Kara
  2014-11-26 10:28                           ` Stefan Priebe - Profihost AG
  2014-11-26 15:11                           ` Stefan Priebe - Profihost AG
  0 siblings, 2 replies; 22+ messages in thread
From: Jan Kara @ 2014-11-26  8:25 UTC (permalink / raw)
  To: Stefan Priebe - Profihost AG
  Cc: Jan Kara, Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

[-- Attachment #1: Type: text/plain, Size: 1876 bytes --]

  Hi,

On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
> i'm still getting a lot of those call traces:
> "
> Call Trace:
>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
>  [<ffffffff811686de>] bmap+0x1e/0x30
>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
> "
> 
> Is there any chance to fix them in vanilla 3.10.61?
  Ted is just testing patches to fix these. You are welcome if you can give
them a try as well (tarball attached). I'm not sure patches will be
backported as far as to 3.10-stable but when the patches get some testing
in mainline, I'll be porting them to 3.12-stable for our enterprise
kernel...

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

[-- Attachment #2: ext4_status_shrinker.tar.gz --]
[-- Type: application/x-compressed-tar, Size: 16847 bytes --]

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-11-26  8:25                         ` Jan Kara
@ 2014-11-26 10:28                           ` Stefan Priebe - Profihost AG
  2014-11-26 10:38                             ` Jan Kara
  2014-11-26 15:11                           ` Stefan Priebe - Profihost AG
  1 sibling, 1 reply; 22+ messages in thread
From: Stefan Priebe - Profihost AG @ 2014-11-26 10:28 UTC (permalink / raw)
  To: Jan Kara
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

Am 26.11.2014 um 09:25 schrieb Jan Kara:
>   Hi,
> 
> On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
>> i'm still getting a lot of those call traces:
>> "
>> Call Trace:
>>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
>>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
>>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
>>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
>>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
>>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
>>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
>>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
>>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
>>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
>>  [<ffffffff811686de>] bmap+0x1e/0x30
>>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
>>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
>>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
>>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
>>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
>>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
>>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
>>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
>>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>> "
>>
>> Is there any chance to fix them in vanilla 3.10.61?
>   Ted is just testing patches to fix these. You are welcome if you can give
> them a try as well (tarball attached). I'm not sure patches will be
> backported as far as to 3.10-stable but when the patches get some testing
> in mainline, I'll be porting them to 3.12-stable for our enterprise
> kernel...

Thanks, on which kernel do they apply? They do not apply to a current 3.17.

Stefan

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-11-26 10:28                           ` Stefan Priebe - Profihost AG
@ 2014-11-26 10:38                             ` Jan Kara
  0 siblings, 0 replies; 22+ messages in thread
From: Jan Kara @ 2014-11-26 10:38 UTC (permalink / raw)
  To: Stefan Priebe - Profihost AG
  Cc: Jan Kara, Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

On Wed 26-11-14 11:28:26, Stefan Priebe - Profihost AG wrote:
> Am 26.11.2014 um 09:25 schrieb Jan Kara:
> >   Hi,
> > 
> > On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
> >> i'm still getting a lot of those call traces:
> >> "
> >> Call Trace:
> >>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
> >>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
> >>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
> >>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
> >>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
> >>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
> >>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
> >>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
> >>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
> >>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
> >>  [<ffffffff811686de>] bmap+0x1e/0x30
> >>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
> >>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
> >>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
> >>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
> >>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
> >>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
> >>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
> >>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
> >>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
> >>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
> >>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
> >>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
> >> "
> >>
> >> Is there any chance to fix them in vanilla 3.10.61?
> >   Ted is just testing patches to fix these. You are welcome if you can give
> > them a try as well (tarball attached). I'm not sure patches will be
> > backported as far as to 3.10-stable but when the patches get some testing
> > in mainline, I'll be porting them to 3.12-stable for our enterprise
> > kernel...
> 
> Thanks, on which kernel do they apply? They do not apply to a current 3.17.
  They are based on current Linus' git tree. So you have to try something
like 3.18-rc5 or so.

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-11-26  8:25                         ` Jan Kara
  2014-11-26 10:28                           ` Stefan Priebe - Profihost AG
@ 2014-11-26 15:11                           ` Stefan Priebe - Profihost AG
  2014-11-26 20:26                             ` Jan Kara
  1 sibling, 1 reply; 22+ messages in thread
From: Stefan Priebe - Profihost AG @ 2014-11-26 15:11 UTC (permalink / raw)
  To: Jan Kara
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

Am 26.11.2014 um 09:25 schrieb Jan Kara:
>   Hi,
> 
> On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
>> i'm still getting a lot of those call traces:
>> "
>> Call Trace:
>>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
>>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
>>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
>>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
>>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
>>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
>>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
>>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
>>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
>>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
>>  [<ffffffff811686de>] bmap+0x1e/0x30
>>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
>>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
>>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
>>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
>>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
>>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
>>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
>>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
>>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>> "
>>
>> Is there any chance to fix them in vanilla 3.10.61?
>   Ted is just testing patches to fix these. You are welcome if you can give
> them a try as well (tarball attached). I'm not sure patches will be
> backported as far as to 3.10-stable but when the patches get some testing
> in mainline, I'll be porting them to 3.12-stable for our enterprise
> kernel...

OK i tried to port them to 3.10 but it seems i can't handle this. There
are so many differences. Are there any workarounds possible? Currently
the 3.10 kernel is also completely crashing with this backtrace.

Greets,
Stefan

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-11-26 15:11                           ` Stefan Priebe - Profihost AG
@ 2014-11-26 20:26                             ` Jan Kara
  2014-11-26 20:33                               ` Stefan Priebe
  2014-12-04 15:06                               ` Stefan Priebe - Profihost AG
  0 siblings, 2 replies; 22+ messages in thread
From: Jan Kara @ 2014-11-26 20:26 UTC (permalink / raw)
  To: Stefan Priebe - Profihost AG
  Cc: Jan Kara, Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

On Wed 26-11-14 16:11:37, Stefan Priebe - Profihost AG wrote:
> Am 26.11.2014 um 09:25 schrieb Jan Kara:
> >   Hi,
> > 
> > On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
> >> i'm still getting a lot of those call traces:
> >> "
> >> Call Trace:
> >>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
> >>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
> >>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
> >>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
> >>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
> >>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
> >>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
> >>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
> >>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
> >>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
> >>  [<ffffffff811686de>] bmap+0x1e/0x30
> >>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
> >>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
> >>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
> >>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
> >>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
> >>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
> >>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
> >>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
> >>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
> >>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
> >>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
> >>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
> >> "
> >>
> >> Is there any chance to fix them in vanilla 3.10.61?
> >   Ted is just testing patches to fix these. You are welcome if you can give
> > them a try as well (tarball attached). I'm not sure patches will be
> > backported as far as to 3.10-stable but when the patches get some testing
> > in mainline, I'll be porting them to 3.12-stable for our enterprise
> > kernel...
> 
> OK i tried to port them to 3.10 but it seems i can't handle this. There
> are so many differences. Are there any workarounds possible? Currently
> the 3.10 kernel is also completely crashing with this backtrace.
  No workarounds I'm aware of. Sorry. When I have patches for 3.12, you can
try porting them to 3.10. That should be an easier task...

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-11-26 20:26                             ` Jan Kara
@ 2014-11-26 20:33                               ` Stefan Priebe
  2014-12-04 15:06                               ` Stefan Priebe - Profihost AG
  1 sibling, 0 replies; 22+ messages in thread
From: Stefan Priebe @ 2014-11-26 20:33 UTC (permalink / raw)
  To: Jan Kara
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu


Am 26.11.2014 21:26, schrieb Jan Kara:
> On Wed 26-11-14 16:11:37, Stefan Priebe - Profihost AG wrote:
>> Am 26.11.2014 um 09:25 schrieb Jan Kara:
>>>    Hi,
>>>
>>> On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
>>>> i'm still getting a lot of those call traces:
>>>> "
>>>> Call Trace:
>>>>   [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
>>>>   [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
>>>>   [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
>>>>   [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
>>>>   [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
>>>>   [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
>>>>   [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
>>>>   [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
>>>>   [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
>>>>   [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
>>>>   [<ffffffff811686de>] bmap+0x1e/0x30
>>>>   [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
>>>>   [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
>>>>   [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
>>>>   [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
>>>>   [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
>>>>   [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
>>>>   [<ffffffff81070360>] ? finish_wait+0x80/0x80
>>>>   [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
>>>>   [<ffffffff8106fb60>] kthread+0xc0/0xd0
>>>>   [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>>>   [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
>>>>   [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>>> "
>>>>
>>>> Is there any chance to fix them in vanilla 3.10.61?
>>>    Ted is just testing patches to fix these. You are welcome if you can give
>>> them a try as well (tarball attached). I'm not sure patches will be
>>> backported as far as to 3.10-stable but when the patches get some testing
>>> in mainline, I'll be porting them to 3.12-stable for our enterprise
>>> kernel...
>>
>> OK i tried to port them to 3.10 but it seems i can't handle this. There
>> are so many differences. Are there any workarounds possible? Currently
>> the 3.10 kernel is also completely crashing with this backtrace.
>    No workarounds I'm aware of. Sorry. When I have patches for 3.12, you can
> try porting them to 3.10. That should be an easier task...

Yes i think i'll be able todo this. Any idea when you'll have patches 
around?

Thanks,
Stefan

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-11-26 20:26                             ` Jan Kara
  2014-11-26 20:33                               ` Stefan Priebe
@ 2014-12-04 15:06                               ` Stefan Priebe - Profihost AG
  2014-12-04 18:35                                 ` Jan Kara
  1 sibling, 1 reply; 22+ messages in thread
From: Stefan Priebe - Profihost AG @ 2014-12-04 15:06 UTC (permalink / raw)
  To: Jan Kara
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu


Am 26.11.2014 um 21:26 schrieb Jan Kara:
> On Wed 26-11-14 16:11:37, Stefan Priebe - Profihost AG wrote:
>> Am 26.11.2014 um 09:25 schrieb Jan Kara:
>>>   Hi,
>>>
>>> On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
>>>> i'm still getting a lot of those call traces:
>>>> "
>>>> Call Trace:
>>>>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
>>>>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
>>>>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
>>>>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
>>>>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
>>>>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
>>>>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
>>>>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
>>>>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
>>>>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
>>>>  [<ffffffff811686de>] bmap+0x1e/0x30
>>>>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
>>>>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
>>>>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
>>>>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
>>>>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
>>>>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
>>>>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
>>>>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
>>>>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
>>>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>>>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
>>>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>>> "
>>>>
>>>> Is there any chance to fix them in vanilla 3.10.61?
>>>   Ted is just testing patches to fix these. You are welcome if you can give
>>> them a try as well (tarball attached). I'm not sure patches will be
>>> backported as far as to 3.10-stable but when the patches get some testing
>>> in mainline, I'll be porting them to 3.12-stable for our enterprise
>>> kernel...
>>
>> OK i tried to port them to 3.10 but it seems i can't handle this. There
>> are so many differences. Are there any workarounds possible? Currently
>> the 3.10 kernel is also completely crashing with this backtrace.
>   No workarounds I'm aware of. Sorry. When I have patches for 3.12, you can
> try porting them to 3.10. That should be an easier task...
> 
> 								Honza

those patches work absolutely fine on a 3.16 kernel. Do you have any
idea, when your 3.12 backport is done?


Thanks!

Greets,
Stefan Priebe

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-12-04 15:06                               ` Stefan Priebe - Profihost AG
@ 2014-12-04 18:35                                 ` Jan Kara
  2014-12-15  9:48                                   ` Stefan Priebe - Profihost AG
  0 siblings, 1 reply; 22+ messages in thread
From: Jan Kara @ 2014-12-04 18:35 UTC (permalink / raw)
  To: Stefan Priebe - Profihost AG
  Cc: Jan Kara, Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

On Thu 04-12-14 16:06:40, Stefan Priebe - Profihost AG wrote:
> 
> Am 26.11.2014 um 21:26 schrieb Jan Kara:
> > On Wed 26-11-14 16:11:37, Stefan Priebe - Profihost AG wrote:
> >> Am 26.11.2014 um 09:25 schrieb Jan Kara:
> >>>   Hi,
> >>>
> >>> On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
> >>>> i'm still getting a lot of those call traces:
> >>>> "
> >>>> Call Trace:
> >>>>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
> >>>>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
> >>>>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
> >>>>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
> >>>>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
> >>>>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
> >>>>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
> >>>>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
> >>>>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
> >>>>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
> >>>>  [<ffffffff811686de>] bmap+0x1e/0x30
> >>>>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
> >>>>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
> >>>>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
> >>>>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
> >>>>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
> >>>>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
> >>>>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
> >>>>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
> >>>>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
> >>>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
> >>>>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
> >>>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
> >>>> "
> >>>>
> >>>> Is there any chance to fix them in vanilla 3.10.61?
> >>>   Ted is just testing patches to fix these. You are welcome if you can give
> >>> them a try as well (tarball attached). I'm not sure patches will be
> >>> backported as far as to 3.10-stable but when the patches get some testing
> >>> in mainline, I'll be porting them to 3.12-stable for our enterprise
> >>> kernel...
> >>
> >> OK i tried to port them to 3.10 but it seems i can't handle this. There
> >> are so many differences. Are there any workarounds possible? Currently
> >> the 3.10 kernel is also completely crashing with this backtrace.
> >   No workarounds I'm aware of. Sorry. When I have patches for 3.12, you can
> > try porting them to 3.10. That should be an easier task...
> > 
> > 								Honza
> 
> those patches work absolutely fine on a 3.16 kernel. Do you have any
> idea, when your 3.12 backport is done?
  Thanks for confirmation. I hope to backport the patches sometime next
week.

								Honza
-- 
Jan Kara <jack@suse.cz>
SUSE Labs, CR

^ permalink raw reply	[flat|nested] 22+ messages in thread

* Re: Call trace in ext4_es_lru_add on 3.10 stable
  2014-12-04 18:35                                 ` Jan Kara
@ 2014-12-15  9:48                                   ` Stefan Priebe - Profihost AG
  0 siblings, 0 replies; 22+ messages in thread
From: Stefan Priebe - Profihost AG @ 2014-12-15  9:48 UTC (permalink / raw)
  To: Jan Kara
  Cc: Theodore Ts'o, linux-ext4,
	p.herz@profihost.ag >> Philipp Herz - Profihost AG, stable,
	Zheng Liu

Hi,
Am 04.12.2014 um 19:35 schrieb Jan Kara:
> On Thu 04-12-14 16:06:40, Stefan Priebe - Profihost AG wrote:
>>
>> Am 26.11.2014 um 21:26 schrieb Jan Kara:
>>> On Wed 26-11-14 16:11:37, Stefan Priebe - Profihost AG wrote:
>>>> Am 26.11.2014 um 09:25 schrieb Jan Kara:
>>>>>   Hi,
>>>>>
>>>>> On Wed 26-11-14 09:06:43, Stefan Priebe - Profihost AG wrote:
>>>>>> i'm still getting a lot of those call traces:
>>>>>> "
>>>>>> Call Trace:
>>>>>>  [<ffffffffa01d7006>] ext4_es_lru_add+0x26/0x80 [ext4]
>>>>>>  [<ffffffffa01d7286>] ext4_es_insert_extent+0x96/0x100 [ext4]
>>>>>>  [<ffffffffa01c3fd3>] ? ext4_find_delalloc_range+0x23/0x60 [ext4]
>>>>>>  [<ffffffffa019b781>] ext4_map_blocks+0x111/0x450 [ext4]
>>>>>>  [<ffffffffa019d167>] _ext4_get_block+0x87/0x190 [ext4]
>>>>>>  [<ffffffffa019d2c6>] ext4_get_block+0x16/0x20 [ext4]
>>>>>>  [<ffffffff8117f73f>] generic_block_bmap+0x3f/0x50
>>>>>>  [<ffffffffa013f4ae>] ? jbd2_journal_file_buffer+0x4e/0x80 [jbd2]
>>>>>>  [<ffffffff810f6242>] ? mapping_tagged+0x12/0x20
>>>>>>  [<ffffffffa019ad71>] ext4_bmap+0x91/0xf0 [ext4]
>>>>>>  [<ffffffff811686de>] bmap+0x1e/0x30
>>>>>>  [<ffffffffa0148063>] jbd2_journal_bmap+0x33/0xb0 [jbd2]
>>>>>>  [<ffffffffa014831d>] jbd2_journal_next_log_block+0x7d/0x90 [jbd2]
>>>>>>  [<ffffffffa0140238>] jbd2_journal_commit_transaction+0x7f8/0x1ae0 [jbd2]
>>>>>>  [<ffffffff81084e13>] ? idle_balance+0xd3/0x110
>>>>>>  [<ffffffff8105a018>] ? lock_timer_base.isra.35+0x38/0x70
>>>>>>  [<ffffffffa014593a>] kjournald2+0xba/0x230 [jbd2]
>>>>>>  [<ffffffff81070360>] ? finish_wait+0x80/0x80
>>>>>>  [<ffffffffa0145880>] ? jbd2_journal_release_jbd_inode+0x130/0x130 [jbd2]
>>>>>>  [<ffffffff8106fb60>] kthread+0xc0/0xd0
>>>>>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>>>>>  [<ffffffff81554c2c>] ret_from_fork+0x7c/0xb0
>>>>>>  [<ffffffff8106faa0>] ? kthread_create_on_node+0x130/0x130
>>>>>> "
>>>>>>
>>>>>> Is there any chance to fix them in vanilla 3.10.61?
>>>>>   Ted is just testing patches to fix these. You are welcome if you can give
>>>>> them a try as well (tarball attached). I'm not sure patches will be
>>>>> backported as far as to 3.10-stable but when the patches get some testing
>>>>> in mainline, I'll be porting them to 3.12-stable for our enterprise
>>>>> kernel...
>>>>
>>>> OK i tried to port them to 3.10 but it seems i can't handle this. There
>>>> are so many differences. Are there any workarounds possible? Currently
>>>> the 3.10 kernel is also completely crashing with this backtrace.
>>>   No workarounds I'm aware of. Sorry. When I have patches for 3.12, you can
>>> try porting them to 3.10. That should be an easier task...
>>>
>>> 								Honza
>>
>> those patches work absolutely fine on a 3.16 kernel. Do you have any
>> idea, when your 3.12 backport is done?
>   Thanks for confirmation. I hope to backport the patches sometime next
> week.

Do you had some time last week?

Stefan

^ permalink raw reply	[flat|nested] 22+ messages in thread

end of thread, other threads:[~2014-12-15  9:48 UTC | newest]

Thread overview: 22+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-09-18 13:08 Call trace in ext4_es_lru_add on 3.10 stable Stefan Priebe - Profihost AG
2014-09-18 19:21 ` Theodore Ts'o
2014-09-18 19:29   ` Stefan Priebe
2014-09-18 19:43     ` Theodore Ts'o
2014-09-22  6:56       ` Stefan Priebe
2014-09-22 16:47         ` Theodore Ts'o
2014-09-22 18:29           ` Stefan Priebe
2014-09-22 20:20             ` Theodore Ts'o
2014-09-23  7:50               ` Stefan Priebe - Profihost AG
2014-09-23  9:42                 ` Jan Kara
2014-09-23 12:23                   ` Stefan Priebe
2014-09-23 14:43                     ` Jan Kara
2014-11-26  8:06                       ` Stefan Priebe - Profihost AG
2014-11-26  8:25                         ` Jan Kara
2014-11-26 10:28                           ` Stefan Priebe - Profihost AG
2014-11-26 10:38                             ` Jan Kara
2014-11-26 15:11                           ` Stefan Priebe - Profihost AG
2014-11-26 20:26                             ` Jan Kara
2014-11-26 20:33                               ` Stefan Priebe
2014-12-04 15:06                               ` Stefan Priebe - Profihost AG
2014-12-04 18:35                                 ` Jan Kara
2014-12-15  9:48                                   ` Stefan Priebe - Profihost AG

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).