Commits · bc5184b165b855bb02e22e0b09cf195b3e37059e · Kirill Smelkov / linux

27 Aug, 2004 38 commits

[PATCH] Unaccount VM_DONTCOPY vmas properly · bc5184b1

William Lee Irwin III authored Aug 26, 2004

Unaccount VM_DONTCOPY vmas properly; the child inherits the whole of the
parent's virtual accounting from the memcpy() in copy_mm(), but the
VM_DONTCOPY check here is where a decision is made for the child not to
inherit the vmas corresponding to some accounted memory usages.  Hence,
unaccount them when skipping over them here.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

bc5184b1

[PATCH] /proc/pid/statm accounting fixes · 591e0fdb

William Lee Irwin III authored Aug 26, 2004

Account reserved memory properly as per acahalan's speecified semantics.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

591e0fdb

[PATCH] speed up /proc/pid/statm for !CONFIG_PROC_FS · 323d129f

William Lee Irwin III authored Aug 26, 2004

Remove the accounting overhead when CONFIG_PROC_FS is not defined.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

323d129f

[PATCH] fix text reporting in O(1) proc_pid_statm() · 9683ede1

William Lee Irwin III authored Aug 26, 2004

Some kind of brainfart happened here, though it's not visible on the
default display from top(1) etc.  This patch fixes up the gibberish I
mistakenly put down for text with the proper text size, and subtracts it
from data as per the O(vmas) code beforehand.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

9683ede1

[PATCH] O(1) proc_pid_statm() · 6ac0a8d7

William Lee Irwin III authored Aug 26, 2004

Merely removing down_read(&mm->mmap_sem) from task_vsize() is too
half-assed to let stand. The following patch removes the vma iteration
as well as the down_read(&mm->mmap_sem) from both task_mem() and
task_statm() and callers for the CONFIG_MMU=y case in favor of
accounting the various stats reported at the times of vma creation,
destruction, and modification. Unlike the 2.4.x patches of the same
name, this has no per-pte-modification overhead whatsoever.

This patch quashes end user complaints of top(1) being slow as well as
kernel hacker complaints of per-pte accounting overhead simultaneously.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

6ac0a8d7

[PATCH] task_vsize() locking cleanup · 384aabda

William Lee Irwin III authored Aug 26, 2004

task_vsize() doesn't need mm->mmap_sem for the CONFIG_MMU case; the
semaphore doesn't prevent mm->total_vm from going stale or getting
inconsistent with other numbers regardless. Also, KSTK_EIP() and
KSTK_ESP() don't want or need protection from mm->mmap_sem either. So this
pushes mm->mmap_sem to task_vsize() in the CONFIG_MMU=n task_vsize().

Also, hoist the prototype of task_vsize() into proc_fs.h

The net result of this is a small speedup of procps for CONFIG_MMU.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

384aabda

[PATCH] include asm/page.h for virt_to_page() · 64bb92da

Dave Hansen authored Aug 26, 2004

asm/page.h seems to be the accepted place to declare virt_to_page() on a vast
majority of architectures.  This patch makes sure that a few files which use
that function also directly include the header.
Signed-off-by: Dave Hansen <haveblue@us.ibm.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

64bb92da

[PATCH] don't align virt_to_page() args · 7b35738e

Dave Hansen authored Aug 26, 2004

__pa() is always be consistent inside of a single page.  The next thing
virt_to_page() does after that is shift down the address, killing the bits
that __change_page_attr() just masked off.

Remove the superfluous masking.
Signed-off-by: Dave Hansen <haveblue@us.ibm.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

7b35738e

[PATCH] vmalloc_fault() cleanup · d1563e85

Dave Hansen authored Aug 26, 2004

Store the physical pgd address in a different variable than the virtual
address.

There's no real reason to only use 1 variable here, other than saving a
line of code.  But, the types really are different and we might as well
just spell that out explicitly.
Signed-off-by: Dave Hansen <haveblue@us.ibm.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

d1563e85

[PATCH] call virt_to_page() with void*, not UL · 3d5dc24c

Dave Hansen authored Aug 26, 2004

I'm sure there's a good reason for these functions to take virtual addresses
as unsigned longs, so suppress the warnings and cast them to the proper types
before calling the virt/phys conversion functions

A perfectly acceptable alternative would be to go and change free_pages() to
stop taking unsigned longs for virtual addresses, but this has a much smaller
impact.
Signed-off-by: Dave Hansen <haveblue@us.ibm.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

3d5dc24c

[PATCH] cast PAGE_OFFSET math to void* in early printk · 0d62f010

Dave Hansen authored Aug 26, 2004

__pa() should take a void*.  This adds the proper cast.
Signed-off-by: Dave Hansen <haveblue@us.ibm.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

0d62f010

[PATCH] reduce casting in sysenter.c · 14be1ebc

Dave Hansen authored Aug 26, 2004

Ran across this because it's another place where an unsigned long is passed
directly to __pa(). Making the "page" variable a void* seems a bit more
natural than an unsigned long and reduces the net number of casts by 1.
Without it, we probably need another (void *) cast in the __pa() call.

For more explanation as to why this was probably done originally, see this
post: http://marc.theaimsgroup.com/?l=linux-mm&m=109155379124628&w=2Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

14be1ebc

[PATCH] Split timer resources · b7515483

Pierre Ossman authored Aug 26, 2004

The kernel currently allocates the range 0x40-0x5f for timer calls.  This
causes conflicts with other hardware using these ports (In my case a
Winbond W83L519D SD/MMC card reader).  This patch splits the resource into
the ports actually needed.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

b7515483

[PATCH] improve OProfile on many-way systems · ddebb8b6

John Levon authored Aug 26, 2004

Anton prompted me to get this patch merged.  It changes the core buffer
sync algorithm of OProfile to avoid global locks wherever possible.  Anton
tested an earlier version of this patch with some success.  I've lightly
tested this applied against 2.6.8.1-mm3 on my two-way machine.

The changes also have the happy side-effect of losing less samples after
munmap operations, and removing the blind spot of tasks exiting inside the
kernel.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

ddebb8b6

[PATCH] copy_mount_options size fix · 6e7ca99d

Andrew Morton authored Aug 26, 2004

davem says that copy_mount_options is failing in obscure ways if the
architecture's copy_from_user() doesn't return an exact count of the number of
uncopied bytes.

Fixing that up in each architecture is a pain - it involves falling back to
byte-at-a-time copies.

It's simple to open-code this in namespace.c.  If we find other places in the
kernel which care about this we can promote this to a global function.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

6e7ca99d

[PATCH] fix MT reparenting when thread group leader dies · d00e2ab5

Roland McGrath authored Aug 26, 2004

When the initial thread in a multi-threaded program dies (the thread group
leader), its child processes are wrongly orphaned, and thereafter when
other threads die their child processes are also orphaned even though live
threads remain in the parent process that can call wait. I have a small
(under 100 lines), POSIX-compliant test program that demonstrates this
using -lpthread (NPTL) if anyone is interested in seeing it.

The bug is that forget_original_parent moves children to the dead parent's
group leader if it's alive, but if not it orphans them. I've changed it so
it instead reparents children to any other live thread in the dead parent's
group (not even preferring the group leader). Children go to init only if
there are no live threads in the parent's group at all. These are the
correct semantics for fork children of POSIX threads.

The second part of the change is to do the CLONE_PARENT behavior always for
CLONE_THREAD, i.e. make sure that each new thread's parent link points to
the real parent of the process and never another thread in its own group.
Without this, when the group leader dies leaving a sole live thread in the
group, forget_original_parent will try to reparent that thread to itself
because it's a child of the dying group leader. Rather handling this case
specially to reparent to the group leader's parent instead, it's more
efficient just to make sure that noone ever has a parent link to inside his
own thread group. Now the reparenting work never needs to be done for
threads created in the same group when their creator thread dies. The only
change from losing the who-created-whom information is when you look at
"PPid:" in /proc/PID/task/TID/status. For purposes of all direct system
calls, it was already as if CLONE_THREAD threads had the parent of the
group leader. (POSIX provides no way to keep track of which thread created
which other thread with pthread_create.)
Signed-off-by: Roland McGrath <roland@redhat.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

d00e2ab5

[PATCH] mostly remove module_parm() · 37440b38

Rusty Russell authored Aug 26, 2004

MODULE_PARM() was marked obsolete.  Remove it from everything except
drivers/ and arch/.

Naturally, such a widespread change may introduce bugs for some of the
non-trivial cases, and where in doubt I used "0" as permissions arg (ie.
won't appear in sysfs).  Individual authors should think about whether that
would be useful.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

37440b38

[PATCH] dnotify + autofs may create signal/restart syscall loop · 3a7d4fac

Jeff Mahoney authored Aug 26, 2004

I saw a recent bug report that showed when a process set up a dnotify against
the autofs root and then attempted an access(2) call inside the autofs
namespace on a mount that would fail, it would create a signal/restart loop.

The cause is that the autofs code checks to see if any signals are pending
after it waits on a response from the autofs daemon. If it finds any, it
assumes that autofs_wait was interrupted, and that it should return
-ERESTARTNOINTR. The problem with this is that a signal_pending(current)
check will return true if *any* signals were received, not just if a signal
that interrupted the wait was received. autofs_wait explicitly blocks all
signals except for SIGKILL, SIGQUIT, and SIGINT before calling
interruptible_sleep_on.

The effect is that if a dnotify is set against the autofs root, when the
autofs daemon creates the directory, a dnotify event will be sent to the
originating process. Since the code in autofs_root_lookup doesn't check to
see what signals are actually pending, it bails early, telling the caller to
try again. The loop goes on forever until interrupted via one of the actual
interrupting signals.

The following patch makes both autofs_root_lookup and autofs4_root_lookup
verify that one of its defined "shutdown" signals are pending before bailing
out early. Any other signal should be delivered later, as expected. It
doesn't matter if the signal occured outside of the sleep in autofs_wait. The
calling process will either go away or try again.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

3a7d4fac

[PATCH] tmpfs atomicity fix · 8843170e

Hugh Dickins authored Aug 26, 2004

tmpfs must use __copy_from_user_inatomic now, to avoid might_sleep warning,
when knowingly using __copy_from_user with an atomic kmap.
Signed-off-by: Hugh Dickins <hugh@veritas.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

8843170e

[PATCH] Add a few might_sleep() checks · 026a14f0

Ingo Molnar authored Aug 26, 2004

Add a whole bunch more might_sleep() checks.  We also enable might_sleep()
checking in copy_*_user().  This was non-trivial because of the "copy_*_user()
in atomic regions" trick would generate false positives.  Fix that up by
adding a new __copy_*_user_inatomic(), which avoids the might_sleep() check.

Only i386 is supported in this patch.

With: Arjan van de Ven <arjanv@redhat.com>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

026a14f0

[PATCH] ppc64: clean up unused macro · c46cde9d

David Gibson authored Aug 26, 2004

After the recent SLB and STAB cleanups, the ppc64 KERNEL_CONTEXT() macro is
no longer used anywhere.  This patch removes it.
Signed-off-by: David Gibson <dwg@au1.ibm.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

c46cde9d

[PATCH] ppc32: refactor common Book-E exception handling macros · 3c2dde6d

Kumar Gala authored Aug 26, 2004

Refefactor common Book-E exception handling macros into a single file to
reduce code duplication.
Signed-off-by: Kumar Gala <kumar.gala@freescale.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

3c2dde6d

[PATCH] reduce size of struct inode on 64bit · a4eadb56

Anton Blanchard authored Aug 26, 2004

Reduce the size of struct inode on 64bit architectures by reducing padding.
This assumes spinlocks are 32bit or less which is the case on most
architectures.

This reduces inode structs by 24 bytes on ppc64, and on ext2 increases the
number of inodes in a 4kB slab from 5 to 6.
Signed-off-by: Anton Blanchard <anton@samba.org>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

a4eadb56

[PATCH] [un]register_ioctl32_conversion() stubs · b04f69c9

Andrew Morton authored Aug 26, 2004

The megaraid driver is calling these, but they don't exist if !CONFIG_COMPAT. 
Add the necessary stubs, and clean a few things up.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

b04f69c9

[PATCH] ppc64: remove iseries profiling · 94ebcd44

William Lee Irwin III authored Aug 26, 2004

From: Anton Blanchard <anton@samba.org>

- Remove iseries specific profiling, there were no complaints when I
  suggested removal on the linuxppc64 list a few weeks ago.

- Also remove another instance of that pesky abs() function.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

94ebcd44

[PATCH] make prof_buffer atomic_t · 0794bff0

William Lee Irwin III authored Aug 26, 2004

Convert prof_buffer to an array of atomic_t instead of sometimes atomic_t,
sometimes unsigned int.  Also, bootmem rounds up internally, so blow away some
crap code there.
Signed-off-by: William Irwin <wli@holomorphy.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

0794bff0

[PATCH] make private profile state static · 9e22a072

William Lee Irwin III authored Aug 26, 2004

Make the various bits of state no longer used anywhere else static to
kernel/profile.c
Signed-off-by: William Irwin <wli@holomorphy.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

9e22a072

[PATCH] move profile_operations · 374053e1

William Lee Irwin III authored Aug 26, 2004

proc_misc.c is a trainwreck.  Move the file_operations for /proc/profile into
kernel/profile.c and call the profiling setup via initcall.
Signed-off-by: William Irwin <wli@holomorphy.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

374053e1

[PATCH] consolidate hit count increments in profile_tick() · f6853ce5

William Lee Irwin III authored Aug 26, 2004

With prof_cpu_mask and profile_pc() in hand, the core is now able to perform
all the profile accounting work on behalf of arches.  Consolidate the profile
accounting and convert all arches to call the core function.
Signed-off-by: William Irwin <wli@holomorphy.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

f6853ce5

[PATCH] introduce profile_pc() · f3549b15

William Lee Irwin III authored Aug 26, 2004

The program counter calculation from pt_regs is the only portion of profile
accounting that differs across various architectures.  This is usually
instruction_pointer(regs), but to handle the few arches where it isn't,
introduce profile_pc().
Signed-off-by: William Irwin <wli@holomorphy.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

f3549b15

[PATCH] consolidate prof_cpu_mask · 423284ee

William Lee Irwin III authored Aug 26, 2004

Handling of prof_cpu_mask is grossly inconsistent.  Some arches have it as a
cpumask_t, others unsigned long, and even within arches it's treated
inconsistently.  This makes it cpumask_t across the board, and consolidates
the handling in kernel/profile.c
Signed-off-by: William Irwin <wli@holomorphy.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

423284ee

[PATCH] schedule profileing · f0fd8a8d

Arjan van de Ven authored Aug 26, 2004

From: William Lee Irwin III <wli@holomorphy.com>

The patch (from Ingo) below is quite interesting, it allows the use of
readprofile not for statistical tine sampling, but for seeing where calls to
schedule() come from, so it can give some insight to the "where do my context
switches come from" question.

Boot with `profile=schedul2' to activate this feature.
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

f0fd8a8d

[PATCH] Hotplug CPU vs TASK_ZOMBIEs: The Sequel to Hotplug CPU vs TASK_DEAD · e271fc85

Rusty Russell authored Aug 26, 2004

release_task can sleep.  Sleeping allows a CPU to go down underneath you.
release_task removes you from the tasklist, so you don't get migrated off the
CPU: BUG() in sched.c.

In last week's episode, our dashing hero (Ingo Molnar) solved this for
self-reaping tasks by grabbing the hotplug cpu lock to prevent this. 
However, in an unexpected twist, the problem remains for tasks whose
parents call release_task on them: the zombies are off the task list, and
lurk on the dead CPU.

Fortunately, the comedic sidekick (Rusty Russell) has an answer: let's make
the hotplug callback walk the runqueue of the dead CPU as well, taking care
of the zombies.

1) Restore exit.c to its former form.  The comment is incorrect: sched.c
   checks PF_DEAD, not the state, to decide to do the final
   put_task_struct(), and it does it for all tasks, self-reaping or no.

2) Implement migrate_dead_tasks() in the sched.c hotplug CPU callback.

3) Rename migrate_all_tasks() to migrate_live_tasks().
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

e271fc85

[PATCH] Fix CPU Hotplug: neaten migrate_all_tasks. · a5212682

Rusty Russell authored Aug 26, 2004

A followup patch wants to do forced migration, so separate that part of the
code out of migrate_all_tasks().
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

a5212682

[PATCH] md: fix problems with checksum handling in MD superblocks. · e50873c0

Neil Brown authored Aug 26, 2004

md currently uses csum_partial to calculate checksums for superblocks.
However this function is not consistent across all architectures.  Some
(i386) to a 32bit csum.  Some (alpha) do a 16 bit csum.  This makes it hard
for userspace to keep up.

So we provide a generic routine (that does exactly what the i386
csum_partial does) and:

- When setting the csum, use csum_partial so that old kernels will still
  recognise the superblock

- When checking the csum, allow either csum_partial or the new generic
  code to provide the right csum.  This allows user-space to just use the
  common code and always work.

Also modify the csum for version-1 superblock (which currently aren't being
used) to always user a predictable checksum algorithm.

Thanks to Mike Tran <mhtran@us.ibm.com> for noticing this.
Signed-off-by: Neil Brown <neilb@cse.unsw.edu.au>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

e50873c0

[PATCH] fix sysrq support in sn_console.c · 9c9481ab

Jesse Barnes authored Aug 26, 2004

In porting the sn_console driver to the serial core, we lost sysrq support.
 This patch fixes it and removes a few unncessary #ifdefs.  Can you please
send it on to Linus asap?  sysrq is a *really* nice thing to have.
Signed-off-by: Jesse Barnes <jbarnes@sgi.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

9c9481ab

[PATCH] fix show_mem on discontig machines · 1685a771

Jesse Barnes authored Aug 26, 2004

Dave Hansen recently did some bootmem and paging init cleanups, but I
missed this little bit when I tested his original patches.  We need to
initialize pgdat->node_mem_map correctly since a) we're using vmem_map, and
b) the core won't do it for us since we have a valid node_start_pfn I
believe.
Signed-off-by: Jesse Barnes <jbarnes@sgi.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

1685a771

[PATCH] Use fixed size buffer instead of kmalloc for m_class in ip_map · 0a453383

Neil Brown authored Aug 26, 2004

This avoids lots of bothersome memory management and is generally
cleaner. 
Signed-off-by: Neil Brown <neilb@cse.unsw.edu.au>
Signed-off-by: J. Bruce Fields <bfields@citi.umich.edu>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>

0a453383

26 Aug, 2004 2 commits
- Merge bk://kernel.bkbits.net/davem/net-2.6 · 1e632d02
  Linus Torvalds authored Aug 26, 2004
```
into ppc970.osdl.org:/home/torvalds/v2.6/linux
```
  1e632d02
- Merge davem@nuts.davemloft.net:/disk1/BK/net-2.6 · d2de2127
  David S. Miller authored Aug 26, 2004
```
into kernel.bkbits.net:/home/davem/net-2.6
```
  d2de2127