mm, oom: do not fail __GFP_NOFAIL allocation if oom killer is disabled (e009d5dc) · Commits · nexedi / linux

Commit e009d5dc authored Mar 12, 2015 by

Michal Hocko Committed by Linus Torvalds Mar 12, 2015

mm, oom: do not fail __GFP_NOFAIL allocation if oom killer is disabled

Tetsuo Handa has pointed out that __GFP_NOFAIL allocations might fail
after OOM killer is disabled if the allocation is performed by a kernel
thread.  This behavior was introduced from the very beginning by
7f33d49a ("mm, PM/Freezer: Disable OOM killer when tasks are frozen").
 This means that the basic contract for the allocation request is broken
and the context requesting such an allocation might blow up unexpectedly.

There are basically two ways forward.

1) move oom_killer_disable after kernel threads are frozen.  This has a
   risk that the OOM victim wouldn't be able to finish because it would
   depend on an already frozen kernel thread.  This would be really tricky
   to debug.

2) do not fail GFP_NOFAIL allocation no matter what and risk a
   potential Freezable kernel threads will loop and fail the suspend.
   Incidental allocations after kernel threads are frozen will at least
   dump a warning - if we are lucky and the serial console is still active
   of course...

This patch implements the later option because it is safer.  We would see
warning rather than allocation failures for the kernel threads which would
blow up otherwise and have a higher chances to identify __GFP_NOFAIL users
from deeper pm code.
Signed-off-by: Michal Hocko <mhocko@suse.cz>
Acked-by: David Rientjes <rientjes@gooogle.com>
Cc: Johannes Weiner <hannes@cmpxchg.org>
Cc: Tetsuo Handa <penguin-kernel@i-love.sakura.ne.jp>
Cc: "Rafael J. Wysocki" <rjw@rjwysocki.net>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>

parent 8792f777

Hide whitespace changes

Inline Side-by-side

View file @ e009d5dc

...	@@ -2373,7 +2373,8 @@ __alloc_pages_may_oom(gfp_t gfp_mask, unsigned int order,	...	@@ -2373,7 +2373,8 @@ __alloc_pages_may_oom(gfp_t gfp_mask, unsigned int order,
	goto out;		goto out;
	}		}
	/* Exhausted what can be done so it's blamo time */		/* Exhausted what can be done so it's blamo time */
	if (out_of_memory(ac->zonelist, gfp_mask, order, ac->nodemask, false))		if (out_of_memory(ac->zonelist, gfp_mask, order, ac->nodemask, false)
			\|\| WARN_ON_ONCE(gfp_mask & __GFP_NOFAIL))
	*did_some_progress = 1;		*did_some_progress = 1;
	out:		out:
	oom_zonelist_unlock(ac->zonelist, gfp_mask);		oom_zonelist_unlock(ac->zonelist, gfp_mask);
...		...

Please register or to comment