• Tomer Tayar's avatar
    habanalabs/gaudi2: mark PCIE access error as fatal · 21fc7933
    Tomer Tayar authored
    F/W events are enabled in a late phase of the device init, so an event
    for a PCIE access error during the init, can be received after the init
    is already done and considered as successful.
    A resulting device reset, which does the same H/W init, can end
    similarly with this event right after the reset is done and considered
    as successful, and a loop of this sequence can continue.
    
    To avoid it mark the PCIE access error as a fatal event, so after 2
    consecutive events no more resets will be done.
    Signed-off-by: default avatarTomer Tayar <ttayar@habana.ai>
    Reviewed-by: default avatarOded Gabbay <ogabbay@kernel.org>
    Signed-off-by: default avatarOded Gabbay <ogabbay@kernel.org>
    21fc7933
gaudi2.c 327 KB