mirror_ubuntu-kernels/drivers/gpu/drm/amd
Philip Yang 373e3ccd85 drm/amdkfd: handle stale retry fault
Retry fault interrupt maybe pending in IH ring after GPU page table
is updated to recover the vm fault, because each page of the range
generate retry fault interrupt. There is race if application unmap
range to remove and free the range first and then retry fault work
restore_pages handle the retry fault interrupt, because range can not be
found, this vm fault can not be recovered and report incorrect GPU vm
fault to application.

Before unmap to remove and free range, drain retry fault interrupt
from IH ring1 to ensure no retry fault comes after the range is removed.

Drain retry fault interrupt skip the range which is on deferred list
to remove, or the range is child range, which is split by unmap, does
not add to svms and have interval notifier.

Signed-off-by: Philip Yang <Philip.Yang@amd.com>
Reviewed-by: Felix Kuehling <Felix.Kuehling@amd.com>
Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
2021-04-28 23:36:05 -04:00
..
acp drm: amd/acp: fix broken menu structure 2020-03-13 11:52:35 -04:00
amdgpu drm/amdgpu: return IH ring drain finished if ring is empty 2021-04-28 23:36:05 -04:00
amdkfd drm/amdkfd: handle stale retry fault 2021-04-28 23:36:05 -04:00
display drm/amd/display: 3.2.133 2021-04-28 23:35:51 -04:00
include drm/amdgpu: only harvest gcea/mmea error status in arcturus 2021-04-20 21:35:45 -04:00
pm amdgpu/pm: set pp_dpm_dcefclk to readonly on NAVI10 and newer gpus 2021-04-28 23:35:49 -04:00