From mboxrd@z Thu Jan 1 00:00:00 1970 From: Joerg Roedel Subject: Re: [PATCH v1 3/3] iommu/amd: Optimize the IOMMU queue flush Date: Mon, 26 Jun 2017 14:14:30 +0200 Message-ID: <20170626121430.GX30388@8bytes.org> References: <20170605195235.11512.52995.stgit@tlendack-t1.amdoffice.net> <1496954035.4188.1.camel@rutgers.edu> <1498062018.17007.6.camel@rutgers.edu> <1498079371.17007.18.camel@rutgers.edu> <20170622092053.GV30388@8bytes.org> <1498144389.17007.25.camel@rutgers.edu> <20170622215735.GW30388@8bytes.org> <1498227647.17007.31.camel@rutgers.edu> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Return-path: Content-Disposition: inline In-Reply-To: <1498227647.17007.31.camel-kgbqMDwikbSVc3sceRu5cw@public.gmane.org> List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: iommu-bounces-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org Errors-To: iommu-bounces-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org To: Jan Vesely Cc: Tom Lendacky , "Nath, Arindam" , "iommu-cunTk1MwBs9QetFLy7KEm3xJsTq8ys+cHZ5vskTnxNA@public.gmane.org" , Craig Stein List-Id: iommu@lists.linux-foundation.org On Fri, Jun 23, 2017 at 10:20:47AM -0400, Jan Vesely wrote: > I was able to trigger "Completion-Wait loop timed out" messages in the > following situation: > Hung OpenCL task running on dGPU. > dGPU goes to sleep. > sigterm to hung task. > it seems to recover OK after the dGPU is powered back on How does that 'dGPU goes to sleep' work? Do you put it to sleep manually via sysfs or something? Or is that something that amdgpu does on its own? It looks like the GPU just switches the ATS unit off when it goes to sleep and doesn't answer the invalidation anymore, which explains the completion-wait timeouts. Joerg