Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

drm/amdgpu: add uncorrectable error count print in UMC ecc irq cb

Uncorrectable error count printing is missed when issuing UMC
UE injection. When going to the error count log function in GPU
recover work thread, there is no chance to get correct error count
value by last error injection and print, because the error status
register is automatically cleared after reading in UMC ecc irq
callback. So add such message printing in UMC ecc irq cb to be
consistent with other RAS error interrupt cases.

Signed-off-by: Guchun Chen <guchun.chen@amd.com>
Reviewed-by: Tao Zhou <tao.zhou1@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

authored by

Guchun Chen and committed by
Alex Deucher
1f3ef0ef 7adf5619

+3
+3
drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c
··· 120 120 121 121 /* only uncorrectable error needs gpu reset */ 122 122 if (err_data->ue_count) { 123 + dev_info(adev->dev, "%ld uncorrectable errors detected in UMC block\n", 124 + err_data->ue_count); 125 + 123 126 if (err_data->err_addr_cnt && 124 127 amdgpu_ras_add_bad_pages(adev, err_data->err_addr, 125 128 err_data->err_addr_cnt))