• Keith Busch's avatar
    PCI/ERR: Run error recovery callbacks for all affected devices · 9de276a8
    Keith Busch authored
    [ Upstream commit bfcb79fc ]
    
    If an Endpoint reported an error with ERR_FATAL, we previously ran driver
    error recovery callbacks only for the Endpoint's driver.  But if we reset a
    Link to recover from the error, all downstream components are affected,
    including the Endpoint, any multi-function peers, and children of those
    peers.
    
    Initiate the Link reset from the deepest Downstream Port that is
    reliable, and call the error recovery callbacks for all its children.
    
    If a Downstream Port (including a Root Port) reports an error, we assume
    the Port itself is reliable and we need to reset its downstream Link.  In
    all other cases (Switch Upstream Ports, Endpoints, Bridges, etc), we assume
    the Link leading to the component needs to be reset, so we initiate the
    reset at the parent Downstream Port.
    
    This allows two other clean-ups.  First, we currently only use a Link
    reset, which can only be initiated using a Downstream Port, so we can
    remove checks for Endpoints.  Second, the Downstream Port where we initiate
    the Link reset is reliable (unlike components downstream from it), so the
    special cases for error detect and resume are no longer necessary.
    Signed-off-by: default avatarKeith Busch <keith.busch@intel.com>
    [bhelgaas: changelog]
    Signed-off-by: default avatarBjorn Helgaas <bhelgaas@google.com>
    Reviewed-by: default avatarSinan Kaya <okaya@kernel.org>
    Signed-off-by: default avatarSasha Levin <sashal@kernel.org>
    9de276a8
err.c 9.18 KB