* [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread
@ 2017-11-28 4:46 linzhecheng
2017-11-28 6:27 ` Fam Zheng
2017-11-29 16:18 ` Eric Blake
0 siblings, 2 replies; 6+ messages in thread
From: linzhecheng @ 2017-11-28 4:46 UTC (permalink / raw)
To: qemu-devel
Cc: arei.gonglei, wangxinxin.wang, pbonzini, famz, aliguori,
linzhecheng
If we create a thread with QEMU_THREAD_DETACHED mode, QEMU may get a segfault in a low probability.
The backtrace is:
#0 0x00007f46c60291d7 in __GI_raise (sig=sig@entry=6) at ../nptl/sysdeps/unix/sysv/linux/raise.c:56
#1 0x00007f46c602a8c8 in __GI_abort () at abort.c:90
#2 0x00000000008543c9 in PAT_abort ()
#3 0x000000000085140d in patchIllInsHandler ()
#4 <signal handler called>
#5 pthread_detach (th=139933037614848) at pthread_detach.c:50
#6 0x0000000000829759 in qemu_thread_create (thread=thread@entry=0x7ffdaa8205e0, name=name@entry=0x94d94a "io-task-worker", start_routine=start_routine@entry=0x7eb9a0 <qio_task_thread_worker>,
arg=arg@entry=0x3f5cf70, mode=mode@entry=1) at util/qemu_thread_posix.c:512
#7 0x00000000007ebc96 in qio_task_run_in_thread (task=0x31db2c0, worker=worker@entry=0x7e7e40 <qio_channel_socket_connect_worker>, opaque=0xcd23380, destroy=0x7f1180 <qapi_free_SocketAddress>)
at io/task.c:141
#8 0x00000000007e7f33 in qio_channel_socket_connect_async (ioc=ioc@entry=0x626c0b0, addr=<optimized out>, callback=callback@entry=0x55e080 <qemu_chr_socket_connected>, opaque=opaque@entry=0x42862c0,
destroy=destroy@entry=0x0) at io/channel_socket.c:194
#9 0x000000000055bdd1 in socket_reconnect_timeout (opaque=0x42862c0) at qemu_char.c:4744
#10 0x00007f46c72483b3 in g_timeout_dispatch () from /usr/lib64/libglib-2.0.so.0
#11 0x00007f46c724799a in g_main_context_dispatch () from /usr/lib64/libglib-2.0.so.0
#12 0x000000000076c646 in glib_pollfds_poll () at main_loop.c:228
#13 0x000000000076c6eb in os_host_main_loop_wait (timeout=348000000) at main_loop.c:273
#14 0x000000000076c815 in main_loop_wait (nonblocking=nonblocking@entry=0) at main_loop.c:521
#15 0x000000000056a511 in main_loop () at vl.c:2076
#16 0x0000000000420705 in main (argc=<optimized out>, argv=<optimized out>, envp=<optimized out>) at vl.c:4940
The root cause of this problem is a bug of glibc(version 2.17,the latest version has the same bug),
let's see what happened in glibc's code.
Here is the code slice of pthread_detach.c
25 int
26 pthread_detach (pthread_t th)
27 {
28 struct pthread *pd = (struct pthread *) th;
29
30 /* Make sure the descriptor is valid. */
31 if (INVALID_NOT_TERMINATED_TD_P (pd))
32 /* Not a valid thread handle. */
34 return ESRCH;
35
36 int result = 0;
37 /* Mark the thread as detached. */
38 if (atomic_compare_and_exchange_bool_acq (&pd->joinid, pd, NULL))
39 {
40 /* There are two possibilities here. First, the thread might
41 already be detached. In this case we return EINVAL.
42 Otherwise there might already be a waiter. The standard does
43 not mention what happens in this case. */
44 if (IS_DETACHED (pd))
45 result = EINVAL;
46 }
47 else
48 /* Check whether the thread terminated meanwhile. In this case we
49 will just free the TCB. */
50 if ((pd->cancelhandling & EXITING_BITMASK) != 0)
51 /* Note that the code in __free_tcb makes sure each thread
52 control block is freed only once. */
53 __free_tcb (pd);
54 return result;
55}
QEMU get a segfault at line 50, becasue pd is an invalid address.
pd is still valid at line 38 when set pd->joinid = pd, at this moment,
created thread is just exiting(only keeps runing for a short time),
created thread is running in code of start_thread:
404 /* If the thread is detached free the TCB. */
405 if (IS_DETACHED (pd))
406 /* Free the TCB. */
407 __free_tcb (pd);
created thread found that pd is detached, so it freed pd, in this case,
pd became an invalid address.
I rewrite qemu_thread_create to move detach_thread from creating thread to created
to avoid this concurrency problem.
Change-Id: I2293d5be1526241cf58785d701b922f2ffc6491b
Signed-off-by: linzhecheng <linzhecheng@huawei.com>
diff --git a/include/qemu/thread-posix.h b/include/qemu/thread-posix.h
index f3f47e4..4c6dbb8 100644
--- a/include/qemu/thread-posix.h
+++ b/include/qemu/thread-posix.h
@@ -44,4 +44,12 @@ struct QemuThread {
pthread_t thread;
};
+typedef struct {
+ void *(*start_routine)(void *);
+ void *arg;
+ char *name;
+ int mode;
+} QemuThreadArgs;
+
+
#endif
diff --git a/util/qemu-thread-posix.c b/util/qemu-thread-posix.c
index 7306475..e5bdb6b 100644
--- a/util/qemu-thread-posix.c
+++ b/util/qemu-thread-posix.c
@@ -489,6 +489,30 @@ static void qemu_thread_set_name(QemuThread *thread, const char *name)
#endif
}
+static void *qemu_thread_start(void *args)
+{
+ QemuThreadArgs *qemu_thread_args;
+ void *ret;
+ QemuThread qemu_thread;
+
+ qemu_thread_args = args;
+ qemu_thread_get_self(&qemu_thread);
+
+ if (qemu_thread_args->name) {
+ qemu_thread_set_name(&qemu_thread, qemu_thread_args->name);
+ g_free(qemu_thread_args->name);
+ }
+
+ if (qemu_thread_args->mode == QEMU_THREAD_DETACHED) {
+ pthread_detach(qemu_thread.thread);
+ }
+ ret = qemu_thread_args->start_routine(qemu_thread_args->arg);
+
+ g_free(qemu_thread_args);
+ return ret;
+}
+
+
void qemu_thread_create(QemuThread *thread, const char *name,
void *(*start_routine)(void*),
void *arg, int mode)
@@ -496,6 +520,7 @@ void qemu_thread_create(QemuThread *thread, const char *name,
sigset_t set, oldset;
int err;
pthread_attr_t attr;
+ QemuThreadArgs *qemu_thread_args;
err = pthread_attr_init(&attr);
if (err) {
@@ -505,20 +530,18 @@ void qemu_thread_create(QemuThread *thread, const char *name,
/* Leave signal handling to the iothread. */
sigfillset(&set);
pthread_sigmask(SIG_SETMASK, &set, &oldset);
- err = pthread_create(&thread->thread, &attr, start_routine, arg);
+
+ qemu_thread_args = g_new0(QemuThreadArgs, 1);
+ qemu_thread_args->mode = mode;
+ qemu_thread_args->name = name_threads ? g_strdup_printf("%s", name) : NULL;
+ qemu_thread_args->start_routine = start_routine;
+ qemu_thread_args->arg = arg;
+
+ err = pthread_create(&thread->thread, &attr,
+ qemu_thread_start, qemu_thread_args);
if (err)
error_exit(err, __func__);
- if (name_threads) {
- qemu_thread_set_name(thread, name);
- }
-
- if (mode == QEMU_THREAD_DETACHED) {
- err = pthread_detach(thread->thread);
- if (err) {
- error_exit(err, __func__);
- }
- }
pthread_sigmask(SIG_SETMASK, &oldset, NULL);
pthread_attr_destroy(&attr);
--
1.8.3.1
^ permalink raw reply related [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread
2017-11-28 4:46 [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread linzhecheng
@ 2017-11-28 6:27 ` Fam Zheng
2017-11-29 16:18 ` Eric Blake
1 sibling, 0 replies; 6+ messages in thread
From: Fam Zheng @ 2017-11-28 6:27 UTC (permalink / raw)
To: linzhecheng; +Cc: qemu-devel, aliguori, wangxinxin.wang, arei.gonglei, pbonzini
On Tue, 11/28 12:46, linzhecheng wrote:
> If we create a thread with QEMU_THREAD_DETACHED mode, QEMU may get a segfault in a low probability.
>
> The backtrace is:
> #0 0x00007f46c60291d7 in __GI_raise (sig=sig@entry=6) at ../nptl/sysdeps/unix/sysv/linux/raise.c:56
> #1 0x00007f46c602a8c8 in __GI_abort () at abort.c:90
> #2 0x00000000008543c9 in PAT_abort ()
> #3 0x000000000085140d in patchIllInsHandler ()
> #4 <signal handler called>
> #5 pthread_detach (th=139933037614848) at pthread_detach.c:50
> #6 0x0000000000829759 in qemu_thread_create (thread=thread@entry=0x7ffdaa8205e0, name=name@entry=0x94d94a "io-task-worker", start_routine=start_routine@entry=0x7eb9a0 <qio_task_thread_worker>,
> arg=arg@entry=0x3f5cf70, mode=mode@entry=1) at util/qemu_thread_posix.c:512
> #7 0x00000000007ebc96 in qio_task_run_in_thread (task=0x31db2c0, worker=worker@entry=0x7e7e40 <qio_channel_socket_connect_worker>, opaque=0xcd23380, destroy=0x7f1180 <qapi_free_SocketAddress>)
> at io/task.c:141
> #8 0x00000000007e7f33 in qio_channel_socket_connect_async (ioc=ioc@entry=0x626c0b0, addr=<optimized out>, callback=callback@entry=0x55e080 <qemu_chr_socket_connected>, opaque=opaque@entry=0x42862c0,
> destroy=destroy@entry=0x0) at io/channel_socket.c:194
> #9 0x000000000055bdd1 in socket_reconnect_timeout (opaque=0x42862c0) at qemu_char.c:4744
> #10 0x00007f46c72483b3 in g_timeout_dispatch () from /usr/lib64/libglib-2.0.so.0
> #11 0x00007f46c724799a in g_main_context_dispatch () from /usr/lib64/libglib-2.0.so.0
> #12 0x000000000076c646 in glib_pollfds_poll () at main_loop.c:228
> #13 0x000000000076c6eb in os_host_main_loop_wait (timeout=348000000) at main_loop.c:273
> #14 0x000000000076c815 in main_loop_wait (nonblocking=nonblocking@entry=0) at main_loop.c:521
> #15 0x000000000056a511 in main_loop () at vl.c:2076
> #16 0x0000000000420705 in main (argc=<optimized out>, argv=<optimized out>, envp=<optimized out>) at vl.c:4940
>
> The root cause of this problem is a bug of glibc(version 2.17,the latest version has the same bug),
> let's see what happened in glibc's code.
> Here is the code slice of pthread_detach.c
>
> 25 int
> 26 pthread_detach (pthread_t th)
> 27 {
> 28 struct pthread *pd = (struct pthread *) th;
> 29
> 30 /* Make sure the descriptor is valid. */
> 31 if (INVALID_NOT_TERMINATED_TD_P (pd))
> 32 /* Not a valid thread handle. */
> 34 return ESRCH;
> 35
> 36 int result = 0;
> 37 /* Mark the thread as detached. */
> 38 if (atomic_compare_and_exchange_bool_acq (&pd->joinid, pd, NULL))
> 39 {
> 40 /* There are two possibilities here. First, the thread might
> 41 already be detached. In this case we return EINVAL.
> 42 Otherwise there might already be a waiter. The standard does
> 43 not mention what happens in this case. */
> 44 if (IS_DETACHED (pd))
> 45 result = EINVAL;
> 46 }
> 47 else
> 48 /* Check whether the thread terminated meanwhile. In this case we
> 49 will just free the TCB. */
> 50 if ((pd->cancelhandling & EXITING_BITMASK) != 0)
> 51 /* Note that the code in __free_tcb makes sure each thread
> 52 control block is freed only once. */
> 53 __free_tcb (pd);
> 54 return result;
> 55}
>
> QEMU get a segfault at line 50, becasue pd is an invalid address.
> pd is still valid at line 38 when set pd->joinid = pd, at this moment,
> created thread is just exiting(only keeps runing for a short time),
> created thread is running in code of start_thread:
>
> 404 /* If the thread is detached free the TCB. */
> 405 if (IS_DETACHED (pd))
> 406 /* Free the TCB. */
> 407 __free_tcb (pd);
> created thread found that pd is detached, so it freed pd, in this case,
> pd became an invalid address.
>
> I rewrite qemu_thread_create to move detach_thread from creating thread to created
> to avoid this concurrency problem.
>
> Change-Id: I2293d5be1526241cf58785d701b922f2ffc6491b
> Signed-off-by: linzhecheng <linzhecheng@huawei.com>
>
> diff --git a/include/qemu/thread-posix.h b/include/qemu/thread-posix.h
> index f3f47e4..4c6dbb8 100644
> --- a/include/qemu/thread-posix.h
> +++ b/include/qemu/thread-posix.h
> @@ -44,4 +44,12 @@ struct QemuThread {
> pthread_t thread;
> };
>
> +typedef struct {
> + void *(*start_routine)(void *);
> + void *arg;
> + char *name;
> + int mode;
> +} QemuThreadArgs;
> +
> +
> #endif
> diff --git a/util/qemu-thread-posix.c b/util/qemu-thread-posix.c
> index 7306475..e5bdb6b 100644
> --- a/util/qemu-thread-posix.c
> +++ b/util/qemu-thread-posix.c
> @@ -489,6 +489,30 @@ static void qemu_thread_set_name(QemuThread *thread, const char *name)
> #endif
> }
>
> +static void *qemu_thread_start(void *args)
> +{
> + QemuThreadArgs *qemu_thread_args;
> + void *ret;
> + QemuThread qemu_thread;
> +
> + qemu_thread_args = args;
> + qemu_thread_get_self(&qemu_thread);
> +
> + if (qemu_thread_args->name) {
> + qemu_thread_set_name(&qemu_thread, qemu_thread_args->name);
> + g_free(qemu_thread_args->name);
> + }
> +
> + if (qemu_thread_args->mode == QEMU_THREAD_DETACHED) {
> + pthread_detach(qemu_thread.thread);
> + }
> + ret = qemu_thread_args->start_routine(qemu_thread_args->arg);
> +
> + g_free(qemu_thread_args);
> + return ret;
> +}
> +
> +
> void qemu_thread_create(QemuThread *thread, const char *name,
> void *(*start_routine)(void*),
> void *arg, int mode)
> @@ -496,6 +520,7 @@ void qemu_thread_create(QemuThread *thread, const char *name,
> sigset_t set, oldset;
> int err;
> pthread_attr_t attr;
> + QemuThreadArgs *qemu_thread_args;
>
> err = pthread_attr_init(&attr);
> if (err) {
> @@ -505,20 +530,18 @@ void qemu_thread_create(QemuThread *thread, const char *name,
> /* Leave signal handling to the iothread. */
> sigfillset(&set);
> pthread_sigmask(SIG_SETMASK, &set, &oldset);
> - err = pthread_create(&thread->thread, &attr, start_routine, arg);
> +
> + qemu_thread_args = g_new0(QemuThreadArgs, 1);
> + qemu_thread_args->mode = mode;
> + qemu_thread_args->name = name_threads ? g_strdup_printf("%s", name) : NULL;
> + qemu_thread_args->start_routine = start_routine;
> + qemu_thread_args->arg = arg;
> +
> + err = pthread_create(&thread->thread, &attr,
> + qemu_thread_start, qemu_thread_args);
> if (err)
> error_exit(err, __func__);
>
> - if (name_threads) {
> - qemu_thread_set_name(thread, name);
> - }
> -
> - if (mode == QEMU_THREAD_DETACHED) {
> - err = pthread_detach(thread->thread);
> - if (err) {
> - error_exit(err, __func__);
> - }
> - }
> pthread_sigmask(SIG_SETMASK, &oldset, NULL);
>
> pthread_attr_destroy(&attr);
> --
> 1.8.3.1
>
>
>
Reviewed-by: Fam Zheng <famz@redhat.com>
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread
2017-11-28 4:46 [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread linzhecheng
2017-11-28 6:27 ` Fam Zheng
@ 2017-11-29 16:18 ` Eric Blake
2017-11-29 16:28 ` Gonglei (Arei)
1 sibling, 1 reply; 6+ messages in thread
From: Eric Blake @ 2017-11-29 16:18 UTC (permalink / raw)
To: linzhecheng, qemu-devel
Cc: aliguori, famz, wangxinxin.wang, arei.gonglei, pbonzini
On 11/27/2017 10:46 PM, linzhecheng wrote:
> If we create a thread with QEMU_THREAD_DETACHED mode, QEMU may get a segfault in a low probability.
>
>
> The root cause of this problem is a bug of glibc(version 2.17,the latest version has the same bug),
> let's see what happened in glibc's code.
Have you reported this bug to the glibc folks, and if so, can we include
a URL to the glibc bugzilla?
Working around the glibc bug is nice, but glibc should really be fixed
so that other projects do not have to continue working around it.
>
> QEMU get a segfault at line 50, becasue pd is an invalid address.
> pd is still valid at line 38 when set pd->joinid = pd, at this moment,
> created thread is just exiting(only keeps runing for a short time),
s/runing/running/
--
Eric Blake, Principal Software Engineer
Red Hat, Inc. +1-919-301-3266
Virtualization: qemu.org | libvirt.org
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread
2017-11-29 16:18 ` Eric Blake
@ 2017-11-29 16:28 ` Gonglei (Arei)
2017-11-29 16:38 ` Paolo Bonzini
0 siblings, 1 reply; 6+ messages in thread
From: Gonglei (Arei) @ 2017-11-29 16:28 UTC (permalink / raw)
To: Eric Blake, linzhecheng, qemu-devel@nongnu.org
Cc: famz@redhat.com, wangxin (U), pbonzini@redhat.com
> -----Original Message-----
> From: Eric Blake [mailto:eblake@redhat.com]
> Sent: Thursday, November 30, 2017 12:19 AM
> To: linzhecheng; qemu-devel@nongnu.org
> Cc: aliguori@us.ibm.com; famz@redhat.com; wangxin (U); Gonglei (Arei);
> pbonzini@redhat.com
> Subject: Re: [Qemu-devel] [PATCH v4] thread: move detach_thread from
> creating thread to created thread
>
> On 11/27/2017 10:46 PM, linzhecheng wrote:
> > If we create a thread with QEMU_THREAD_DETACHED mode, QEMU may
> get a segfault in a low probability.
> >
>
> >
> > The root cause of this problem is a bug of glibc(version 2.17,the latest version
> has the same bug),
> > let's see what happened in glibc's code.
>
> Have you reported this bug to the glibc folks, and if so, can we include
> a URL to the glibc bugzilla?
>
No, we didn't do that yet. :(
> Working around the glibc bug is nice, but glibc should really be fixed
> so that other projects do not have to continue working around it.
>
>
Yes, agree.
Regards,
-Gonglei
> >
> > QEMU get a segfault at line 50, becasue pd is an invalid address.
> > pd is still valid at line 38 when set pd->joinid = pd, at this moment,
> > created thread is just exiting(only keeps runing for a short time),
>
> s/runing/running/
>
> --
> Eric Blake, Principal Software Engineer
> Red Hat, Inc. +1-919-301-3266
> Virtualization: qemu.org | libvirt.org
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread
2017-11-29 16:28 ` Gonglei (Arei)
@ 2017-11-29 16:38 ` Paolo Bonzini
2017-11-29 16:41 ` Gonglei (Arei)
0 siblings, 1 reply; 6+ messages in thread
From: Paolo Bonzini @ 2017-11-29 16:38 UTC (permalink / raw)
To: Gonglei (Arei), Eric Blake, linzhecheng, qemu-devel@nongnu.org
Cc: famz@redhat.com, wangxin (U)
On 29/11/2017 17:28, Gonglei (Arei) wrote:
>>> The root cause of this problem is a bug of glibc(version 2.17,the latest version
>> has the same bug),
>>> let's see what happened in glibc's code.
>> Have you reported this bug to the glibc folks, and if so, can we include
>> a URL to the glibc bugzilla?
>>
> No, we didn't do that yet. :(
It's here:
https://sourceware.org/bugzilla/show_bug.cgi?id=19951.
I've added a note to the commit message.
Paolo
^ permalink raw reply [flat|nested] 6+ messages in thread
* Re: [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread
2017-11-29 16:38 ` Paolo Bonzini
@ 2017-11-29 16:41 ` Gonglei (Arei)
0 siblings, 0 replies; 6+ messages in thread
From: Gonglei (Arei) @ 2017-11-29 16:41 UTC (permalink / raw)
To: Paolo Bonzini, Eric Blake, linzhecheng, qemu-devel@nongnu.org
Cc: famz@redhat.com, wangxin (U)
> -----Original Message-----
> From: Paolo Bonzini [mailto:pbonzini@redhat.com]
> Sent: Thursday, November 30, 2017 12:39 AM
> To: Gonglei (Arei); Eric Blake; linzhecheng; qemu-devel@nongnu.org
> Cc: famz@redhat.com; wangxin (U)
> Subject: Re: [Qemu-devel] [PATCH v4] thread: move detach_thread from
> creating thread to created thread
>
> On 29/11/2017 17:28, Gonglei (Arei) wrote:
> >>> The root cause of this problem is a bug of glibc(version 2.17,the latest
> version
> >> has the same bug),
> >>> let's see what happened in glibc's code.
> >> Have you reported this bug to the glibc folks, and if so, can we include
> >> a URL to the glibc bugzilla?
> >>
> > No, we didn't do that yet. :(
>
> It's here:
>
> https://sourceware.org/bugzilla/show_bug.cgi?id=19951.
>
> I've added a note to the commit message.
>
Nice~ :)
Thanks,
-Gonglei
^ permalink raw reply [flat|nested] 6+ messages in thread
end of thread, other threads:[~2017-11-29 16:41 UTC | newest]
Thread overview: 6+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-11-28 4:46 [Qemu-devel] [PATCH v4] thread: move detach_thread from creating thread to created thread linzhecheng
2017-11-28 6:27 ` Fam Zheng
2017-11-29 16:18 ` Eric Blake
2017-11-29 16:28 ` Gonglei (Arei)
2017-11-29 16:38 ` Paolo Bonzini
2017-11-29 16:41 ` Gonglei (Arei)
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).