• Kent Overstreet's avatar
    bcachefs: Fix bch2_extents_match() false positive · d2693569
    Kent Overstreet authored
    This was caught as a very rare nonce inconsistency, on systems with
    encryption and replication (and tiering, or some form of rebalance
    operation running):
    
    [Wed Jul 17 13:30:03 2024] about to insert invalid key in data update path
    [Wed Jul 17 13:30:03 2024] old: u64s 10 type extent 671283510:6392:U32_MAX len 16 ver 106595503: durability: 2 crc: c_size 8 size 16 offset 0 nonce 0 csum chacha20_poly1305_80 compress zstd ptr: 3:355968:104 gen 7 ptr: 4:513244:48 gen 6 rebalance: target hdd compression zstd
    [Wed Jul 17 13:30:03 2024] k:   u64s 10 type extent 671283510:6400:U32_MAX len 16 ver 106595508: durability: 2 crc: c_size 8 size 16 offset 0 nonce 0 csum chacha20_poly1305_80 compress zstd ptr: 3:355968:112 gen 7 ptr: 4:513244:56 gen 6 rebalance: target hdd compression zstd
    [Wed Jul 17 13:30:03 2024] new: u64s 14 type extent 671283510:6392:U32_MAX len 8 ver 106595508: durability: 2 crc: c_size 8 size 16 offset 0 nonce 0 csum chacha20_poly1305_80 compress zstd ptr: 3:355968:112 gen 7 cached ptr: 4:513244:56 gen 6 cached rebalance: target hdd compression zstd crc: c_size 8 size 16 offset 8 nonce 0 csum chacha20_poly1305_80 compress zstd ptr: 1:10860085:32 gen 0 ptr: 0:17285918:408 gen 0
    [Wed Jul 17 13:30:03 2024] bcachefs (cca5bc65-fe77-409d-a9fa-465a6e7f4eae): fatal error - emergency read only
    
    bch2_extents_match() was reporting true for extents that did not
    actually point to the same data.
    
    bch2_extent_match() iterates over pairs of pointers, looking for
    pointers that point to the same location on disk (with matching
    generation numbers). However one or both extents may have been trimmed
    (or merged) and they might not have the same disk offset: it corrects
    for this by subtracting the key offset and the checksum entry offset.
    
    However, this failed when an extent was immediately partially
    overwritten, and the new overwrite was allocated the next adjacent disk
    space.
    
    Normally, with compression off, this would never cause a bug, since the
    new extent would have to be immediately after the old extent for the
    pointer offsets to match, and the rebalance index update path is not
    looking for an extent outside the range of the extent it moved.
    
    However with compression enabled, extents take up less space on disk
    than they do in the btree index space - and spuriously matching after
    partial overwrite is possible.
    
    To fix this, add a secondary check, that strictly checks that the
    regions pointed to on disk overlap.
    
    https://github.com/koverstreet/bcachefs/issues/717Signed-off-by: default avatarKent Overstreet <kent.overstreet@linux.dev>
    d2693569
extents.c 42.1 KB