* A oops occur when it calls kmem_cache_alloc
@ 2016-03-04 8:36 zhong jiang
[not found] ` <56D96734.6050108@I-love.SAKURA.ne.jp>
0 siblings, 1 reply; 2+ messages in thread
From: zhong jiang @ 2016-03-04 8:36 UTC (permalink / raw)
To: LKML, Linux Memory Management List, Xishi Qiu
The vmcore file show the collapse reason that the page had been removed
when we acqure the page and prepare to remove the page from the slub
partial list.
The list is protected by the spin_lock from concurrent operation. And I find
that other core is wating the lock to alloc memory. Therefore , The concurrent
access should be impossible.
what situatios can happen ? or it is a kernel bug potentially. This question
almost impossible to produce again. The following is the call statck belonging to
the module.
PID: 114258 TASK: ffff8806aafc3300 CPU: 11 COMMAND: "JEM_UserTpool"
#0 [ffff8806aafc9050] machine_kexec at ffffffff810286ea
#1 [ffff8806aafc90b0] crash_kexec at ffffffff810a2503
#2 [ffff8806aafc9180] oops_end at ffffffff814593c8
#3 [ffff8806aafc91b0] die at ffffffff81005623
#4 [ffff8806aafc91e0] do_general_protection at ffffffff81458ed2
#5 [ffff8806aafc9210] general_protection at ffffffff814587c5
[exception RIP: get_partial_node+511]
RIP: ffffffff81450426 RSP: ffff8806aafc92c0 RFLAGS: 00010002
RAX: dead000000200200 RBX: 0000000000000000 RCX: 0000000180100010
RDX: dead000000100100 RSI: ffff8807e24e4000 RDI: dead000000100100
RBP: ffff8806aafc9370 R8: dead000000200200 R9: ffff880c41817000
R10: 20c49ba5e353f7cf R11: 0000000000000000 R12: ffff880bf90bb900
R13: ffffea0031060400 R14: 0000000000000000 R15: ffff8807e24e4000
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#6 [ffff8806aafc9378] __slab_alloc at ffffffff814508da
#7 [ffff8806aafc9478] kmem_cache_alloc at ffffffff8114a827
#8 [ffff8806aafc94c8] LIB_MemCacheAlloc at ffffffffa2998136 [libcfs]
#9 [ffff8806aafc94f8] RPC_PrepReqPool at ffffffffa5844949 [ptlrpc]
#10 [ffff8806aafc9578] RPC_PrepReq at ffffffffa5844b85 [ptlrpc]
#11 [ffff8806aafc95a8] Base_RpcPrepReq at ffffffffa5844c69 [ptlrpc]
#12 [ffff8806aafc9618] FC_MsgSendRequest at ffffffffa5cab26c [snas_fc]
#13 [ffff8806aafc96b8] FCC_DiskTrySendAllocWalReq at ffffffffa5caf47e [snas_fc]
#14 [ffff8806aafc96f8] FCC_DiskTrySendRequest at ffffffffa5cafe08 [snas_fc]
#15 [ffff8806aafc9768] FCC_AdClientSendRequest at ffffffffa5cb04e6 [snas_fc]
#16 [ffff8806aafc9828] Base_RpcAsynSendReq at ffffffffa5cabc2e [snas_fc]
#17 [ffff8806aafc9848] DS_RpcAsynSendReq at ffffffffa6330c80 [snas_ds]
#18 [ffff8806aafc9898] OPM_SendTransmitReqMsg at ffffffffa62f9c8c [snas_ds]
#19 [ffff8806aafc9958] OPM_SendWriteOthersMsg at ffffffffa63256de [snas_ds]
#20 [ffff8806aafc9988] OPM_WriteOthers at ffffffffa63259ba [snas_ds]
#21 [ffff8806aafc9a98] OPM_HandleWriteOthers at ffffffffa6325c09 [snas_ds]
#22 [ffff8806aafc9ae8] OPM_WriteHandle at ffffffffa632a8f8 [snas_ds]
#23 [ffff8806aafc9c08] OPM_Write at ffffffffa632b39f [snas_ds]
#24 [ffff8806aafc9e08] JEM_HandleUserRequest at ffffffffa62f2387 [snas_ds]
#25 [ffff8806aafc9e98] TP_WorkThreadMain at ffffffffa103cd6d [snas_base]
#26 [ffff8806aafc9ee8] kthread at ffffffff81060aee
#27 [ffff8806aafc9f48] kernel_thread_helper at ffffffff814613b4
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: A oops occur when it calls kmem_cache_alloc
[not found] ` <56D96734.6050108@I-love.SAKURA.ne.jp>
@ 2016-03-04 12:22 ` zhong jiang
0 siblings, 0 replies; 2+ messages in thread
From: zhong jiang @ 2016-03-04 12:22 UTC (permalink / raw)
To: Tetsuo Handa, LKML, Xishi Qiu, Linux Memory Management List
On 2016/3/4 18:45, Tetsuo Handa wrote:
> On 2016/03/04 17:36, zhong jiang wrote:
>> The vmcore file show the collapse reason that the page had been removed
>> when we acqure the page and prepare to remove the page from the slub
>> partial list.
>>
>> The list is protected by the spin_lock from concurrent operation. And I find
>> that other core is wating the lock to alloc memory. Therefore , The concurrent
>> access should be impossible.
>>
>> what situatios can happen ? or it is a kernel bug potentially. This question
>> almost impossible to produce again. The following is the call statck belonging to
>> the module.
> No kernel version, no clue.
>
>
>
Sorry, The kernel version is 3.4 stable version. And The mainline , by comparison, have no relative
modification.
Thanks
zhongjiang
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2016-03-04 12:26 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-03-04 8:36 A oops occur when it calls kmem_cache_alloc zhong jiang
[not found] ` <56D96734.6050108@I-love.SAKURA.ne.jp>
2016-03-04 12:22 ` zhong jiang
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).