Commit 81769990 authored by Tetsuo Handa's avatar Tetsuo Handa Committed by Tim Gardner

UBUNTU: SAUCE: kthread: Do not leave kthread_create() immediately upon SIGKILL.

Commit 786235ee "kthread: make kthread_create() killable" changed to
leave kthread_create() as soon as receiving SIGKILL. But this change
caused boot failures if systemd-udevd worker process received SIGKILL
due to systemd's hardcoded 30 seconds timeout while loading fusion
driver using finit_module() [1].

Linux kernel people think that the systemd's hardcoded timeout is a
systemd bug. But systemd people think that loading of kernel module
needs more than 30 seconds is a kernel module's bug.

Although Linux kernel people are expecting fusion driver module not
to take more than 30 seconds, it will definitely not in time for
trusty kernel. Also, nobody can prove that fusion driver module is
the only case which is affected by commit 786235ee.

Therefore, this patch changes kthread_create() to wait for up to 10
seconds after receiving SIGKILL, unless chosen by the OOM killer,
in order to give the kthreadd a chance to complete the request.
The side effect of this patch is that current thread's response to
SIGKILL is delayed for a bit (likely less than a second, unlikely
10 seconds).

  [1] https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1276705Reported-by: default avatarPierre Fersing <pierref@pierref.org>
Reported-by: default avatarFaidon Liambotis <paravoid@debian.org>
Signed-off-by: default avatarTetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>
Signed-off-by: default avatarTim Gardner <tim.gardner@canonical.com>
parent f0c41d20
...@@ -295,6 +295,17 @@ struct task_struct *kthread_create_on_node(int (*threadfn)(void *data), ...@@ -295,6 +295,17 @@ struct task_struct *kthread_create_on_node(int (*threadfn)(void *data),
* new kernel thread. * new kernel thread.
*/ */
if (unlikely(wait_for_completion_killable(&done))) { if (unlikely(wait_for_completion_killable(&done))) {
int i = 0;
/*
* I got SIGKILL, but wait for 10 more seconds for completion
* unless chosen by the OOM killer. This delay is there as a
* workaround for boot failure caused by SIGKILL upon device
* driver initialization timeout.
*/
while (i++ < 10 && !test_tsk_thread_flag(current, TIF_MEMDIE))
if (wait_for_completion_timeout(&done, HZ))
goto ready;
/* /*
* If I was SIGKILLed before kthreadd (or new kernel thread) * If I was SIGKILLed before kthreadd (or new kernel thread)
* calls complete(), leave the cleanup of this structure to * calls complete(), leave the cleanup of this structure to
...@@ -308,6 +319,7 @@ struct task_struct *kthread_create_on_node(int (*threadfn)(void *data), ...@@ -308,6 +319,7 @@ struct task_struct *kthread_create_on_node(int (*threadfn)(void *data),
*/ */
wait_for_completion(&done); wait_for_completion(&done);
} }
ready:
task = create->result; task = create->result;
if (!IS_ERR(task)) { if (!IS_ERR(task)) {
static const struct sched_param param = { .sched_priority = 0 }; static const struct sched_param param = { .sched_priority = 0 };
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment