* git-blame segfault
@ 2013-12-02 12:57 Markus Trippelsdorf
2013-12-02 14:15 ` Antoine Pelisse
0 siblings, 1 reply; 5+ messages in thread
From: Markus Trippelsdorf @ 2013-12-02 12:57 UTC (permalink / raw)
To: git
When git is compiled with current gcc and "-march=native"
git-blame segfaults:
For example:
% gdb --args /var/tmp/git/git-blame gcc/tree-object-size.c
...
Program received signal SIGSEGV, Segmentation fault.
0x0000000000000000 in ?? ()
(gdb) bt
#0 0x0000000000000000 in ?? ()
#1 0x000000000051240d in xdl_emit_hunk_hdr (s1=s1@entry=30, c1=<optimized out>, s2=s2@entry=30, c2=c2@entry=6, func=func@entry=0x7fffffffd2d8 "", funclen=0,
ecb=ecb@entry=0x7fffffffd580) at xdiff/xutils.c:460
#2 0x0000000000512af7 in xdl_emit_diff (xe=0x7fffffffd390, xscr=<optimized out>, ecb=0x7fffffffd580, xecfg=0x7fffffffd590) at xdiff/xemit.c:237
#3 0x0000000000510a5d in xdl_diff (mf1=mf1@entry=0x7fffffffd510, mf2=mf2@entry=0x7fffffffd520, xpp=xpp@entry=0x7fffffffd570, xecfg=xecfg@entry=0x7fffffffd590,
ecb=ecb@entry=0x7fffffffd580) at xdiff/xdiffi.c:601
#4 0x000000000050b005 in xdi_diff (mf1=<optimized out>, mf2=<optimized out>, xpp=xpp@entry=0x7fffffffd570, xecfg=xecfg@entry=0x7fffffffd590, xecb=xecb@entry=0x7fffffffd580)
at xdiff-interface.c:136
#5 0x00000000004104df in diff_hunks (file_a=<optimized out>, file_b=<optimized out>, ctxlen=ctxlen@entry=0, hunk_func=hunk_func@entry=0x411320 <blame_chunk_cb>,
cb_data=cb_data@entry=0x7fffffffd830) at builtin/blame.c:105
#6 0x0000000000412b54 in pass_blame_to_parent (parent=0x11da810, target=0x11dab50, sb=0x7fffffffd6e0) at builtin/blame.c:815
#7 pass_blame (opt=0, origin=0x11dab50, sb=0x7fffffffd6e0) at builtin/blame.c:1281
#8 assign_blame (opt=<optimized out>, sb=0x7fffffffd6e0) at builtin/blame.c:1559
#9 cmd_blame (argc=<optimized out>, argv=<optimized out>, prefix=<optimized out>) at builtin/blame.c:2523
#10 0x00000000004060b5 in run_builtin (argv=0x7fffffffe528, argc=2, p=0x578bd8 <commands.22612+120>) at git.c:314
#11 handle_internal_command (argc=2, argv=0x7fffffffe528) at git.c:478
#12 0x0000000000405772 in main (argc=2, av=<optimized out>) at git.c:575
(gdb) up
#1 0x000000000051240d in xdl_emit_hunk_hdr (s1=s1@entry=30, c1=<optimized out>, s2=s2@entry=30, c2=c2@entry=6, func=func@entry=0x7fffffffd2d8 "", funclen=0,
ecb=ecb@entry=0x7fffffffd580) at xdiff/xutils.c:460
460 if (ecb->outf(ecb->priv, &mb, 1) < 0)
(gdb) l
455 }
456 buf[nb++] = '\n';
457
458 mb.ptr = buf;
459 mb.size = nb;
460 if (ecb->outf(ecb->priv, &mb, 1) < 0)
461 return -1;
462
463 return 0;
464 }
(gdb) p *ecb
$1 = {
priv = 0x7fffffffd830,
outf = 0x0
}
If I leave xecfg uninitialized in the following function the issue goes
away.
>From builtin/blame.c:
94 static int diff_hunks(mmfile_t *file_a, mmfile_t *file_b, long ctxlen,
95 xdl_emit_hunk_consume_func_t hunk_func, void *cb_data)
96 {
97 xpparam_t xpp = {0};
98 xdemitconf_t xecfg = {0};
99 xdemitcb_t ecb = {NULL};
100
101 xpp.flags = xdl_opts;
102 xecfg.ctxlen = ctxlen;
103 xecfg.hunk_func = hunk_func;
104 ecb.priv = cb_data;
105 return xdi_diff(file_a, file_b, &xpp, &xecfg, &ecb);
106 }
107
I'm not sure if this a git bug or a gcc bug. In any case I've opened a
gcc bug-report here: http://gcc.gnu.org/bugzilla/show_bug.cgi?id=59363
--
Markus
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: git-blame segfault 2013-12-02 12:57 git-blame segfault Markus Trippelsdorf @ 2013-12-02 14:15 ` Antoine Pelisse 2013-12-02 15:05 ` Markus Trippelsdorf 0 siblings, 1 reply; 5+ messages in thread From: Antoine Pelisse @ 2013-12-02 14:15 UTC (permalink / raw) To: Markus Trippelsdorf; +Cc: git On Mon, Dec 2, 2013 at 1:57 PM, Markus Trippelsdorf <markus@trippelsdorf.de> wrote: > When git is compiled with current gcc and "-march=native" > git-blame segfaults: > > For example: > > % gdb --args /var/tmp/git/git-blame gcc/tree-object-size.c > ... > Program received signal SIGSEGV, Segmentation fault. > 0x0000000000000000 in ?? () > (gdb) bt > #0 0x0000000000000000 in ?? () > #1 0x000000000051240d in xdl_emit_hunk_hdr (s1=s1@entry=30, c1=<optimized out>, s2=s2@entry=30, c2=c2@entry=6, func=func@entry=0x7fffffffd2d8 "", funclen=0, > ecb=ecb@entry=0x7fffffffd580) at xdiff/xutils.c:460 > #2 0x0000000000512af7 in xdl_emit_diff (xe=0x7fffffffd390, xscr=<optimized out>, ecb=0x7fffffffd580, xecfg=0x7fffffffd590) at xdiff/xemit.c:237 xdl_emit_diff() should not be called, because xecfg->hunk_func should be non-null and called instead. xld_emit_diff() is the one that needs outf to be set. > #3 0x0000000000510a5d in xdl_diff (mf1=mf1@entry=0x7fffffffd510, mf2=mf2@entry=0x7fffffffd520, xpp=xpp@entry=0x7fffffffd570, xecfg=xecfg@entry=0x7fffffffd590, > ecb=ecb@entry=0x7fffffffd580) at xdiff/xdiffi.c:601 Here we decide that xecfg->hunk_func is empty, and ef is set to xdl_emit_diff. > #4 0x000000000050b005 in xdi_diff (mf1=<optimized out>, mf2=<optimized out>, xpp=xpp@entry=0x7fffffffd570, xecfg=xecfg@entry=0x7fffffffd590, xecb=xecb@entry=0x7fffffffd580) > at xdiff-interface.c:136 > #5 0x00000000004104df in diff_hunks (file_a=<optimized out>, file_b=<optimized out>, ctxlen=ctxlen@entry=0, hunk_func=hunk_func@entry=0x411320 <blame_chunk_cb>, > cb_data=cb_data@entry=0x7fffffffd830) at builtin/blame.c:105 As we can see in your code below, ecb.outf is not set here, because it's not needed by hunk_func. hunk_func is set to blame_chunck_cb, passed by the function below. > #6 0x0000000000412b54 in pass_blame_to_parent (parent=0x11da810, target=0x11dab50, sb=0x7fffffffd6e0) at builtin/blame.c:815 diff_hunks() is called with blame_chunck_cb as hunk_func. I think the best thing to do is to find out where xecfg->hunk_func loses the blame_chunck_cb value to NULL. You could start with a watchpoint. >[...] > 460 if (ecb->outf(ecb->priv, &mb, 1) < 0) > (gdb) l > 455 } > 456 buf[nb++] = '\n'; > 457 > 458 mb.ptr = buf; > 459 mb.size = nb; > 460 if (ecb->outf(ecb->priv, &mb, 1) < 0) > 461 return -1; > 462 > 463 return 0; > 464 } > (gdb) p *ecb > $1 = { > priv = 0x7fffffffd830, > outf = 0x0 > } > > If I leave xecfg uninitialized in the following function the issue goes > away. Would that mean that gcc is doing some steps in the wrong order ? That is setting xecfg.hunk_func and then emptying the structure ? I've already had a similar bug, but that's very unfortunate. > From builtin/blame.c: > 94 static int diff_hunks(mmfile_t *file_a, mmfile_t *file_b, long ctxlen, > 95 xdl_emit_hunk_consume_func_t hunk_func, void *cb_data) > 96 { > 97 xpparam_t xpp = {0}; > 98 xdemitconf_t xecfg = {0}; > 99 xdemitcb_t ecb = {NULL}; > 100 > 101 xpp.flags = xdl_opts; > 102 xecfg.ctxlen = ctxlen; > 103 xecfg.hunk_func = hunk_func; > 104 ecb.priv = cb_data; > 105 return xdi_diff(file_a, file_b, &xpp, &xecfg, &ecb); > 106 } > 107 ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: git-blame segfault 2013-12-02 14:15 ` Antoine Pelisse @ 2013-12-02 15:05 ` Markus Trippelsdorf 2013-12-03 8:45 ` Markus Trippelsdorf 0 siblings, 1 reply; 5+ messages in thread From: Markus Trippelsdorf @ 2013-12-02 15:05 UTC (permalink / raw) To: Antoine Pelisse; +Cc: git On 2013.12.02 at 15:15 +0100, Antoine Pelisse wrote: > On Mon, Dec 2, 2013 at 1:57 PM, Markus Trippelsdorf > <markus@trippelsdorf.de> wrote: > > When git is compiled with current gcc and "-march=native" > > git-blame segfaults: > > > > For example: > > > > % gdb --args /var/tmp/git/git-blame gcc/tree-object-size.c > > ... > > Program received signal SIGSEGV, Segmentation fault. > > 0x0000000000000000 in ?? () > > (gdb) bt > > #0 0x0000000000000000 in ?? () > > #1 0x000000000051240d in xdl_emit_hunk_hdr (s1=s1@entry=30, c1=<optimized out>, s2=s2@entry=30, c2=c2@entry=6, func=func@entry=0x7fffffffd2d8 "", funclen=0, > > ecb=ecb@entry=0x7fffffffd580) at xdiff/xutils.c:460 > > #2 0x0000000000512af7 in xdl_emit_diff (xe=0x7fffffffd390, xscr=<optimized out>, ecb=0x7fffffffd580, xecfg=0x7fffffffd590) at xdiff/xemit.c:237 > > xdl_emit_diff() should not be called, because xecfg->hunk_func should > be non-null and called instead. xld_emit_diff() is the one that needs > outf to be set. > > > #3 0x0000000000510a5d in xdl_diff (mf1=mf1@entry=0x7fffffffd510, mf2=mf2@entry=0x7fffffffd520, xpp=xpp@entry=0x7fffffffd570, xecfg=xecfg@entry=0x7fffffffd590, > > ecb=ecb@entry=0x7fffffffd580) at xdiff/xdiffi.c:601 > > Here we decide that xecfg->hunk_func is empty, and ef is set to xdl_emit_diff. > > > #4 0x000000000050b005 in xdi_diff (mf1=<optimized out>, mf2=<optimized out>, xpp=xpp@entry=0x7fffffffd570, xecfg=xecfg@entry=0x7fffffffd590, xecb=xecb@entry=0x7fffffffd580) > > at xdiff-interface.c:136 > > #5 0x00000000004104df in diff_hunks (file_a=<optimized out>, file_b=<optimized out>, ctxlen=ctxlen@entry=0, hunk_func=hunk_func@entry=0x411320 <blame_chunk_cb>, > > cb_data=cb_data@entry=0x7fffffffd830) at builtin/blame.c:105 > > As we can see in your code below, ecb.outf is not set here, because > it's not needed by hunk_func. > hunk_func is set to blame_chunck_cb, passed by the function below. > > > #6 0x0000000000412b54 in pass_blame_to_parent (parent=0x11da810, target=0x11dab50, sb=0x7fffffffd6e0) at builtin/blame.c:815 > > diff_hunks() is called with blame_chunck_cb as hunk_func. > > I think the best thing to do is to find out where xecfg->hunk_func > loses the blame_chunck_cb value to NULL. You could start with a > watchpoint. It happens in diff_hunks (from builtin/blame.c): (gdb) up #5 0x00000000004104df in diff_hunks (file_a=<optimized out>, file_b=<optimized out>, ctxlen=ctxlen@entry=0, hunk_func=hunk_func@entry=0x411320 <blame_chunk_cb>, cb_data=cb_data@entry=0x7fffffffd830) at builtin/blame.c:105 105 return xdi_diff(file_a, file_b, &xpp, &xecfg, &ecb); (gdb) l 100 101 xpp.flags = xdl_opts; 102 xecfg.ctxlen = ctxlen; 103 xecfg.hunk_func = hunk_func; 104 ecb.priv = cb_data; 105 return xdi_diff(file_a, file_b, &xpp, &xecfg, &ecb); 106 } 107 108 /* 109 * Prepare diff_filespec and convert it using diff textconv API (gdb) p hunk_func $1 = (xdl_emit_hunk_consume_func_t) 0x411320 <blame_chunk_cb> (gdb) p xecfg.hunk_func $2 = (xdl_emit_hunk_consume_func_t) 0x0 (gdb) p xecfg $3 = { ctxlen = 0, interhunkctxlen = 0, flags = 0, find_func = 0x0, find_func_priv = 0x0, hunk_func = 0x0 } > > Would that mean that gcc is doing some steps in the wrong order ? That > is setting xecfg.hunk_func and then emptying the structure ? I've > already had a similar bug, but that's very unfortunate. Yes. I think this might be the case: (gdb) disass Dump of assembler code for function diff_hunks: 0x0000000000410460 <+0>: sub $0x58,%rsp 0x0000000000410464 <+4>: xor %eax,%eax 0x0000000000410466 <+6>: mov %eax,%r9d 0x0000000000410469 <+9>: add $0x20,%eax 0x000000000041046c <+12>: cmp $0x20,%eax 0x000000000041046f <+15>: movq $0x0,0x20(%rsp,%r9,1) 0x0000000000410478 <+24>: movq $0x0,0x28(%rsp,%r9,1) 0x0000000000410481 <+33>: movq $0x0,0x30(%rsp,%r9,1) 0x000000000041048a <+42>: movq $0x0,0x38(%rsp,%r9,1) 0x0000000000410493 <+51>: jb 0x410466 <diff_hunks+6> 0x0000000000410495 <+53>: lea 0x20(%rsp),%r10 0x000000000041049a <+58>: mov %rdx,0x20(%rsp) 0x000000000041049f <+63>: mov %rcx,0x48(%rsp) 0x00000000004104a4 <+68>: add %r10,%rax 0x00000000004104a7 <+71>: mov %r8,0x10(%rsp) 0x00000000004104ac <+76>: mov %rsp,%rdx 0x00000000004104af <+79>: movq $0x0,(%rax) 0x00000000004104b6 <+86>: movq $0x0,0x8(%rax) 0x00000000004104be <+94>: lea 0x10(%rsp),%r8 0x00000000004104c3 <+99>: movslq 0x171882(%rip),%rax # 0x581d4c <xdl_opts> 0x00000000004104ca <+106>: mov %r10,%rcx 0x00000000004104cd <+109>: movq $0x0,0x18(%rsp) 0x00000000004104d6 <+118>: mov %rax,(%rsp) 0x00000000004104da <+122>: callq 0x50aee0 <xdi_diff> => 0x00000000004104df <+127>: add $0x58,%rsp 0x00000000004104e3 <+131>: retq End of assembler dump. -- Markus ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: git-blame segfault 2013-12-02 15:05 ` Markus Trippelsdorf @ 2013-12-03 8:45 ` Markus Trippelsdorf 2013-12-03 9:04 ` Antoine Pelisse 0 siblings, 1 reply; 5+ messages in thread From: Markus Trippelsdorf @ 2013-12-03 8:45 UTC (permalink / raw) To: Antoine Pelisse; +Cc: git On 2013.12.02 at 16:05 +0100, Markus Trippelsdorf wrote: > On 2013.12.02 at 15:15 +0100, Antoine Pelisse wrote: > > Would that mean that gcc is doing some steps in the wrong order ? That > > is setting xecfg.hunk_func and then emptying the structure ? I've > > already had a similar bug, but that's very unfortunate. > > Yes. I think this might be the case: > > (gdb) disass > Dump of assembler code for function diff_hunks: > 0x0000000000410460 <+0>: sub $0x58,%rsp > 0x0000000000410464 <+4>: xor %eax,%eax > 0x0000000000410466 <+6>: mov %eax,%r9d > 0x0000000000410469 <+9>: add $0x20,%eax > 0x000000000041046c <+12>: cmp $0x20,%eax > 0x000000000041046f <+15>: movq $0x0,0x20(%rsp,%r9,1) > 0x0000000000410478 <+24>: movq $0x0,0x28(%rsp,%r9,1) > 0x0000000000410481 <+33>: movq $0x0,0x30(%rsp,%r9,1) > 0x000000000041048a <+42>: movq $0x0,0x38(%rsp,%r9,1) > 0x0000000000410493 <+51>: jb 0x410466 <diff_hunks+6> > 0x0000000000410495 <+53>: lea 0x20(%rsp),%r10 > 0x000000000041049a <+58>: mov %rdx,0x20(%rsp) > 0x000000000041049f <+63>: mov %rcx,0x48(%rsp) > 0x00000000004104a4 <+68>: add %r10,%rax > 0x00000000004104a7 <+71>: mov %r8,0x10(%rsp) > 0x00000000004104ac <+76>: mov %rsp,%rdx > 0x00000000004104af <+79>: movq $0x0,(%rax) > 0x00000000004104b6 <+86>: movq $0x0,0x8(%rax) > 0x00000000004104be <+94>: lea 0x10(%rsp),%r8 > 0x00000000004104c3 <+99>: movslq 0x171882(%rip),%rax # 0x581d4c <xdl_opts> > 0x00000000004104ca <+106>: mov %r10,%rcx > 0x00000000004104cd <+109>: movq $0x0,0x18(%rsp) > 0x00000000004104d6 <+118>: mov %rax,(%rsp) > 0x00000000004104da <+122>: callq 0x50aee0 <xdi_diff> > => 0x00000000004104df <+127>: add $0x58,%rsp > 0x00000000004104e3 <+131>: retq > End of assembler dump. Should be fixed in gcc soon. For the curious, here is the assembler diff (bad vs. good): .type diff_hunks, @function diff_hunks: .LFB104: .cfi_startproc subq $88, %rsp .cfi_def_cfa_offset 96 xorl %eax, %eax .L31: movl %eax, %r9d addl $32, %eax cmpl $32, %eax movq $0, 32(%rsp,%r9) movq $0, 40(%rsp,%r9) movq $0, 48(%rsp,%r9) movq $0, 56(%rsp,%r9) jb .L31 leaq 32(%rsp), %r10 movq %rdx, 32(%rsp) - movq %rcx, 72(%rsp) - addq %r10, %rax movq %r8, 16(%rsp) + addq %r10, %rax + leaq 16(%rsp), %r8 movq %rsp, %rdx - movq $0, (%rax) movq $0, 8(%rax) - leaq 16(%rsp), %r8 + movq $0, (%rax) movslq xdl_opts(%rip), %rax + movq %rcx, 72(%rsp) movq %r10, %rcx movq $0, 24(%rsp) movq %rax, (%rsp) call xdi_diff addq $88, %rsp .cfi_def_cfa_offset 8 -- Markus ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: git-blame segfault 2013-12-03 8:45 ` Markus Trippelsdorf @ 2013-12-03 9:04 ` Antoine Pelisse 0 siblings, 0 replies; 5+ messages in thread From: Antoine Pelisse @ 2013-12-03 9:04 UTC (permalink / raw) To: Markus Trippelsdorf; +Cc: git On Tue, Dec 3, 2013 at 9:45 AM, Markus Trippelsdorf <markus@trippelsdorf.de> wrote: > Should be fixed in gcc soon. For the curious, here is the assembler diff > (bad vs. good): Cool, Thanks. Good to know this has nothing to do with Git :-) ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2013-12-03 9:05 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2013-12-02 12:57 git-blame segfault Markus Trippelsdorf 2013-12-02 14:15 ` Antoine Pelisse 2013-12-02 15:05 ` Markus Trippelsdorf 2013-12-03 8:45 ` Markus Trippelsdorf 2013-12-03 9:04 ` Antoine Pelisse
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).