• Hawking Zhang's avatar
    drm/amdgpu: allow query error counters for specific IP block · 4a1c9a44
    Hawking Zhang authored
    amdgpu_ras_block_late_init will be invoked in IP
    specific ras_late_init call as a common helper for
    all the IP blocks.
    
    However, when amdgpu_ras_block_late_init call
    amdgpu_ras_query_error_count to query ras error
    counters, amdgpu_ras_query_error_count queries
    all the IP blocks that support ras query interface.
    
    This results to wrong error counters cached in
    software copies when there are ras errors detected
    at time zero or warm reset procedure. i.e., in
    sdma_ras_late_init phase, it counts on sdma/mmhub
    errors, while, in mmhub_ras_late_init phase, it
    still counts on sdma/mmhub errors.
    
    The change updates amdgpu_ras_query_error_count
    interface to allow query specific ip error counter.
    It introduces a new input parameter: query_info. if
    query_info is NULL,  it means query all the IP blocks,
    otherwise, only query the ip block specified by
    query_info.
    Signed-off-by: default avatarHawking Zhang <Hawking.Zhang@amd.com>
    Reviewed-by: default avatarTao Zhou <tao.zhou1@amd.com>
    Signed-off-by: default avatarAlex Deucher <alexander.deucher@amd.com>
    4a1c9a44
amdgpu_ras.h 20.8 KB