Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git
kernel os linux

drm/amdgpu: removed GFX RAS support check in UMC ECC callback

enable GPU recovery in event of uncorrectable UMC error

Signed-off-by: John Clements <john.clements@amd.com>
Reviewed-by: Hawking Zhang <Hawking.Zhang@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

authored by

John Clements and committed by
Alex Deucher
61130c74 097dc53e

+1 -7
+1 -7
drivers/gpu/drm/amd/amdgpu/amdgpu_umc.c
··· 95 95 { 96 96 struct ras_err_data *err_data = (struct ras_err_data *)ras_error_status; 97 97 98 - /* When “Full RAS” is enabled, the per-IP interrupt sources should 99 - * be disabled and the driver should only look for the aggregated 100 - * interrupt via sync flood 101 - */ 102 - if (amdgpu_ras_is_supported(adev, AMDGPU_RAS_BLOCK__GFX)) 103 - return AMDGPU_RAS_SUCCESS; 104 - 105 98 kgd2kfd_set_sram_ecc_flag(adev->kfd.dev); 106 99 if (adev->umc.funcs && 107 100 adev->umc.funcs->query_ras_error_count) ··· 106 113 err_data->err_addr = 107 114 kcalloc(adev->umc.max_ras_err_cnt_per_query, 108 115 sizeof(struct eeprom_table_record), GFP_KERNEL); 116 + 109 117 /* still call query_ras_error_address to clear error status 110 118 * even NOMEM error is encountered 111 119 */