• Naohiro Aota's avatar
    btrfs: zoned: mark block groups to copy for device-replace · 78ce9fc2
    Naohiro Aota authored
    This is the 1/4 patch to support device-replace on zoned filesystems.
    
    We have two types of IOs during the device replace process. One is an IO
    to "copy" (by the scrub functions) all the device extents from the source
    device to the destination device. The other one is an IO to "clone" (by
    handle_ops_on_dev_replace()) new incoming write IOs from users to the
    source device into the target device.
    
    Cloning incoming IOs can break the sequential write rule in on target
    device. When a write is mapped in the middle of a block group, the IO is
    directed to the middle of a target device zone, which breaks the
    sequential write requirement.
    
    However, the cloning function cannot be disabled since incoming IOs
    targeting already copied device extents must be cloned so that the IO is
    executed on the target device.
    
    We cannot use dev_replace->cursor_{left,right} to determine whether a bio
    is going to a not yet copied region. Since we have a time gap between
    finishing btrfs_scrub_dev() and rewriting the mapping tree in
    btrfs_dev_replace_finishing(), we can have a newly allocated device extent
    which is never cloned nor copied.
    
    So the point is to copy only already existing device extents. This patch
    introduces mark_block_group_to_copy() to mark existing block groups as a
    target of copying. Then, handle_ops_on_dev_replace() and dev-replace can
    check the flag to do their job.
    
    Also, btrfs_finish_block_group_to_copy() will check if the copied stripe
    is the last stripe in the block group. With the last stripe copied,
    the to_copy flag is finally disabled. Afterwards we can safely clone
    incoming IOs on this block group.
    Reviewed-by: default avatarJosef Bacik <josef@toxicpanda.com>
    Signed-off-by: default avatarNaohiro Aota <naohiro.aota@wdc.com>
    Signed-off-by: default avatarDavid Sterba <dsterba@suse.com>
    78ce9fc2
block-group.h 10.3 KB