Commits · c8c69dc68792e85b14646e8a8219dae923b34feb · mirror / ccan

31 Mar, 2010 1 commit
- Joey's charset validation module. · c8c69dc6
  Joseph Adams authored Mar 31, 2010
  
  c8c69dc6
24 Feb, 2010 5 commits

asort: fix gcc warning. · e96830ae
Rusty Russell authored Feb 24, 2010

e96830ae
tools: fix warnings from Ubuntu strict compiler. · b7c94538
Rusty Russell authored Feb 24, 2010

b7c94538

tdb: handle processes dying during transaction commit. · 729fc5b9

Rusty Russell authored Feb 24, 2010

tdb transactions were designed to be robust against the machine
powering off, but interestingly were never designed to handle the case
where an administrator kill -9's a process during commit.  Because
recovery is only done on tdb_open, processes with the tdb already
mapped will simply use it despite it being corrupt and needing
recovery.

The solution to this is to check for recovery every time we grab a
data lock: we could have gained the lock because a process just died.
This has no measurable cost: here is the time for tdbtorture -s 0 -n 1
-l 10000:

Before:
	2.75 2.50 2.81 3.19 2.91 2.53 2.72 2.50 2.78 2.77 = Avg 2.75

After:
	2.81 2.57 3.42 2.49 3.02 2.49 2.84 2.48 2.80 2.43 = Avg 2.74
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

729fc5b9

tdb: cleanup: tdb_lock_list helper to cover tdb_lock and tdb_lock_nonblock · 497e23b2

Rusty Russell authored Feb 24, 2010

Reduce code duplication, and also gives us a central point for the next
patch which wants to cover all list locks.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

497e23b2

tdb: remove lock ops · d665ff82

Rusty Russell authored Feb 24, 2010

Now the transaction code uses the standard allrecord lock, that stops
us from trying to grab any per-record locks anyway.  We don't need to
have special noop lock ops for transactions.

This is a nice simplification: if you see brlock, you know it's really
going to grab a lock.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

d665ff82

23 Feb, 2010 3 commits

tdb: more testcase fixup. · d0b36a7c
Rusty Russell authored Feb 24, 2010

d0b36a7c

tdb: suppress record write locks when allrecord lock is taken. · f4ed2127

Rusty Russell authored Feb 24, 2010

Records themselves get (read) locked by the traversal code against delete.
Interestingly, this locking isn't done when the allrecord lock has been
taken, though the allrecord lock until recently didn't cover the actual
records (it now goes to end of file).

The write record lock, grabbed by the delete code, is not suppressed by
the allrecord lock, which causes us to punch a hole in that lock when we
release the write record lock. Make this consistent: *no* record locks
of any kind when the allrecord lock is taken.

f4ed2127

tdb: new test, cleanup old tests by centralizing lock tracking. · c4a9fd1b
Rusty Russell authored Feb 24, 2010

c4a9fd1b

22 Feb, 2010 21 commits

tdb: fix test to remove warning, and don't fail when tdb_check() barfs. · 26da0601
Rusty Russell authored Feb 22, 2010

26da0601

tdb: don't reduce file size on transaction recovery. · 3497b4d4

Rusty Russell authored Feb 22, 2010

There's little point in ever shrinking the file, and it definitely breaks in the case where a process has died during a transaction commit and other processes have the tdb mapped.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

3497b4d4

tdb: fix recovery reuse after crash (from SAMBA) · 8321967a

Rusty Russell authored Feb 22, 2010

commit b37b452cb8c1f56b37b04abe7bffdede371ca361
Author: Rusty Russell <rusty@rustcorp.com.au>
Date:   Thu Feb 4 23:59:54 2010 +1030

    tdb: fix recovery reuse after crash
    
    If a process (or the machine) dies after just after writing the
    recovery head (pointing at the end of file), the recovery record will filled
    with 0x42.  This will not invoke a recovery on open, since rec.magic
    != TDB_RECOVERY_MAGIC.
    
    Unfortunately, the first transaction commit will happily reuse that
    area: tdb_recovery_allocate() doesn't check the magic.  The recovery
    record has length 0x42424242, and it writes that back into the
    now-valid-looking transaction header) for the next comer (which
    happens to be tdb_wipe_all in my tests).
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

8321967a

tdb: fix tdbtorture seed printing, plus remove CLEAR_IF_FIRST · 6ad04b02
Rusty Russell authored Feb 22, 2010
```
With killing children, CLEAR_IF_FIRST can happen quite a bit.
```
6ad04b02

tdb: cleanup: remove ltype argument from _tdb_transaction_cancel. · f0d790f1

Rusty Russell authored Feb 22, 2010

Now the transaction allrecord lock the standard one, and thus is cleaned
in tdb_release_extra_locks(), _tdb_transaction_cancel() doesn't need to
know what type it is.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

f0d790f1

tdb: tdb_allrecord_lock/tdb_allrecord_unlock/tdb_allrecord_upgrade · 88541fe7

Rusty Russell authored Feb 22, 2010

Centralize locking of all chains of the tdb; rename _tdb_lockall to
tdb_allrecord_lock and _tdb_unlockall to tdb_allrecord_unlock, and
tdb_brlock_upgrade to tdb_allrecord_upgrade.

Then we use this in the transaction code. Unfortunately, if the transaction
code records that it has grabbed the allrecord lock read-only, write locks
will fail, so we treat this upgradable lock as a write lock, and mark it
as upgradable using the otherwise-unused offset field.

One subtlety: now the transaction code is using the allrecord_lock, the
tdb_release_extra_locks() function drops it for us, so we no longer need
to do it manually in _tdb_transaction_cancel.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

88541fe7

tdb: cleanup: always grab allrecord lock to infinity. · bbf2f9ce

Rusty Russell authored Feb 22, 2010

We were previously inconsistent with our "global" lock: the
transaction code grabbed it from FREELIST_TOP to end of file, and the
rest of the code grabbed it from FREELIST_TOP to end of the hash
chains. Change it to always grab to end of file for simplicity.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

bbf2f9ce

tdb: remove num_locks · 5945c4dc

Rusty Russell authored Feb 22, 2010

This was redundant before this patch series: it mirrored num_lockrecs
exactly.  It still does.

Also, skip useless branch when locks == 1: unconditional assignment is
cheaper anyway.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

5945c4dc

tdb: use tdb_nest_lock() for seqnum lock. · 564427be

Rusty Russell authored Feb 22, 2010

This is pure overhead, but it centralizes the locking. Realloc (esp. as
most implementations are lazy) is fast compared to the fnctl anyway.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

564427be

tdb: use tdb_nest_lock() for active lock. · a879e3e2

Rusty Russell authored Feb 22, 2010

Rather than a boutique lock and a separate nest count, use our
newly-generic nested lock tracking for the active lock.

Note that the tdb_have_extra_locks() and tdb_release_extra_locks()
functions have to skip over this lock now it is tracked.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

a879e3e2

tdb: use tdb_nest_lock() for open lock. · e9989ef0

Rusty Russell authored Feb 22, 2010

This never nests, so it's overkill, but it centralizes the locking into
lock.c and removes the ugly flag in the transaction code to track whether
we have the lock or not.

Note that we have a temporary hack so this places a real lock, despite
the fact that we are in a transaction.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

e9989ef0

tdb: use tdb_nest_lock() for transaction lock. · 99e576ac

Rusty Russell authored Feb 22, 2010

Rather than a boutique lock and a separate nest count, use our
newly-generic nested lock tracking for the transaction lock.

Note that the tdb_have_extra_locks() and tdb_release_extra_locks()
functions have to skip over this lock now it is tracked.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

99e576ac

tdb: cleanup: find_nestlock() helper. · bc1c29d5

Rusty Russell authored Feb 22, 2010

Factor out two loops which find locks; we are going to introduce a couple
more so a helper makes sense.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

bc1c29d5

tdb: cleanup: tdb_release_extra_locks() helper · c730c0d2

Rusty Russell authored Feb 22, 2010

Move locking intelligence back into lock.c.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

c730c0d2

tdb: cleanup: tdb_have_extra_locks() helper · d92b8a7d

Rusty Russell authored Feb 22, 2010

In many places we check whether locks are held: add a helper to do this.

The _tdb_lockall() case has already checked for the allrecord lock, so
the extra work done by tdb_have_extra_locks() is merely redundant.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

d92b8a7d

tdb: don't suppress the transaction lock because of the allrecord lock. · 57c618f3

Rusty Russell authored Feb 22, 2010

tdb_transaction_lock() and tdb_transaction_unlock() do nothing if we
hold the allrecord lock.  However, the two locks don't overlap, so
this is wrong.

This simplification makes the transaction lock a straight-forward nested
lock.

There are two callers for these functions:
1) The transaction code, which already makes sure the allrecord_lock
   isn't held.
2) The traverse code, which wants to stop transactions whether it has the
   allrecord lock or not.  There have been deadlocks here before, however
   this should not bring them back (I hope!)
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

57c618f3

tdb: cleanup: tdb_nest_lock/tdb_nest_unlock · acf1a21e

Rusty Russell authored Feb 22, 2010

Because fcntl locks don't nest, we track them in the tdb->lockrecs array
and only place/release them when the count goes to 1/0.  We only do this
for record locks, so we simply place the list number (or -1 for the free
list) in the structure.

To generalize this:

1) Put the offset rather than list number in struct tdb_lock_type.
2) Rename _tdb_lock() to tdb_nest_lock, make it non-static and move the
   allrecord check out to the callers (except the mark case which doesn't
   care).
3) Rename _tdb_unlock() to tdb_nest_unlock(), make it non-static and
   move the allrecord out to the callers (except mark again).
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

acf1a21e

tdb: cleanup: rename global_lock to allrecord_lock. · 1d03cc6d

Rusty Russell authored Feb 22, 2010

The word global is overloaded in tdb.  The global_lock inside struct
tdb_context is used to indicate we hold a lock across all the chains.

Rename it to allrecord_lock.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

1d03cc6d

tdb: cleanup: rename GLOBAL_LOCK to OPEN_LOCK. · 05f69a47

Rusty Russell authored Feb 22, 2010

The word global is overloaded in tdb.  The GLOBAL_LOCK offset is used at
open time to serialize initialization (and by the transaction code to block
open).

Rename it to OPEN_LOCK.
Signed-off-by: Rusty Russell <rusty@rustcorp.com.au>

05f69a47

tdb: commit 9b987e3923f7e8714acec6a2929435b1e6016645 from SAMBA · ec4174af

Rusty Russell authored Feb 22, 2010

Author: Volker Lendecke <vl@samba.org>
Date:   Tue Aug 12 22:31:52 2008 +0200

    Attempt to fix bug 5684
    
    With the ctdb checkin dde9f3f006 tdb optimized out write lock checks for
    write-enabled transaction. Sadly, this also removed the possibility to ever
    remove dead records left over from tdb_delete calls within a transaction.
    
    Tridge, please check this! Did dde9f3f006 have any reason beyond performance
    optimizations?
    
    Thanks,
    
    Volker
    (This used to be commit 3f884c4ae36f3260e63626bdd4989d9258ae6497)

ec4174af

tdb: port trivial differences across from SAMBA · d6b6780d
Rusty Russell authored Feb 22, 2010
```
Whitespace and C++-compatibility mainly.
```
d6b6780d

11 Feb, 2010 1 commit
- Fix Joey's report of rename failing across moint points. · 354c772e
  Rusty Russell authored Feb 11, 2010
  
  354c772e
04 Feb, 2010 4 commits
- tdb: use TDB_RECOVERY_INVALID_MAGIC rather than 0 · f7b3eb1e
  Rusty Russell authored Feb 04, 2010
  
  f7b3eb1e
- Add -k option to tdbtorture, run tdb_check at end. · 5ebc1701
  Rusty Russell authored Feb 04, 2010
```
This patch is made larger by moving the work out to a separate function: with -k, the parent can't run the ops itself but the children must do it.
```
  5ebc1701
- Don't use ~ in path, seems less reliable? · aae6674a
  Rusty Russell authored Feb 04, 2010
  
  aae6674a
- tdb: new test for dying during a transaction. · b86f8688
  Rusty Russell authored Feb 04, 2010
```
This demonstrates some serious failings, which get cleaned up in
following patches.
```
  b86f8688
02 Feb, 2010 5 commits

ccanlint: fix scoring, add score details to --verbose. · 39323f37
Rusty Russell authored Feb 02, 2010

39323f37
Use raw .o files rather than -lccan · 1f2fc897
Rusty Russell authored Feb 02, 2010

1f2fc897

Import 898b5edfe757cb145960b8f3631029bfd5592119 from ctdb: · 176844c8

Rusty Russell authored Feb 02, 2010

Author: Volker Lendecke <vl@samba.org>  2010-01-30 03:51:09

    tdb: fix an early release of the global lock that can cause data corruption
    
    There was a bug in tdb where the
    
                    tdb_brlock(tdb, GLOBAL_LOCK, F_UNLCK, F_SETLKW, 0, 1);
    
    (ending the transaction-"mutex") was done before the
    
                            /* remove the recovery marker */
    
    This means that when a transaction is committed there is a window where another
    opener of the file sees the transaction marker while the transaction committer
    is still fully functional and working on it. This led to transaction being
    rolled back by that second opener of the file while transaction_commit() gave
    no error to the caller.
    
    This patch moves the F_UNLCK to after the recovery marker was removed, closing
    this window.

176844c8

Fix bug where we used old clear_if_first when doing I/O in open. · 9d045ca0
Rusty Russell authored Feb 02, 2010
```
Add more logging.
```
9d045ca0
Change default behaviour: disallow nesting (safer). · c92d0d63
Rusty Russell authored Feb 02, 2010
```
Unlike the upstream, we don't have a stable ABI/API.
```
c92d0d63