From mboxrd@z Thu Jan  1 00:00:00 1970
From: =?ISO-8859-1?Q?Christian_K=F6nig?= <deathsimple@vodafone.de>
Subject: Re: [PATCH 14/20] drm/radeon: multiple ring allocator v2
Date: Mon, 07 May 2012 22:38:48 +0200
Message-ID: <4FA832D8.3090205@vodafone.de>
References: <1336390975-30945-1-git-send-email-deathsimple@vodafone.de>
	<1336390975-30945-15-git-send-email-deathsimple@vodafone.de>
	<CAH3drwbQJnaEyFV09QKifhrgAtVt+E2QAYUQPTNFf3b7vWeYOg@mail.gmail.com>
	<4FA7FC26.7040504@vodafone.de>
	<CAH3drwb=qkpeLrSL9dejg0WEt_bCk98TkR0vtggpn3_maN6gZg@mail.gmail.com>
	<CAH3drwaSnSXdzm_q0-PkK5J3BBeHAzrEURBFiUpyMWC=ZBTrQg@mail.gmail.com>
	<CAH3drwZSYiZhquE2weS+cEu8td_WqPbzTrW7dtO-=e4hwctcMA@mail.gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset="iso-8859-1"; Format="flowed"
Content-Transfer-Encoding: quoted-printable
Return-path: <dri-devel-bounces+sf-dri-devel=m.gmane.org@lists.freedesktop.org>
Received: from outgoing.email.vodafone.de (outgoing.email.vodafone.de
	[139.7.28.128])
	by gabe.freedesktop.org (Postfix) with ESMTP id A49999E936
	for <dri-devel@lists.freedesktop.org>;
	Mon,  7 May 2012 13:38:52 -0700 (PDT)
In-Reply-To: <CAH3drwZSYiZhquE2weS+cEu8td_WqPbzTrW7dtO-=e4hwctcMA@mail.gmail.com>
List-Unsubscribe: <http://lists.freedesktop.org/mailman/options/dri-devel>,
	<mailto:dri-devel-request@lists.freedesktop.org?subject=unsubscribe>
List-Archive: <http://lists.freedesktop.org/archives/dri-devel>
List-Post: <mailto:dri-devel@lists.freedesktop.org>
List-Help: <mailto:dri-devel-request@lists.freedesktop.org?subject=help>
List-Subscribe: <http://lists.freedesktop.org/mailman/listinfo/dri-devel>,
	<mailto:dri-devel-request@lists.freedesktop.org?subject=subscribe>
Sender: dri-devel-bounces+sf-dri-devel=m.gmane.org@lists.freedesktop.org
Errors-To: dri-devel-bounces+sf-dri-devel=m.gmane.org@lists.freedesktop.org
To: Jerome Glisse <j.glisse@gmail.com>
Cc: dri-devel@lists.freedesktop.org
List-Id: dri-devel@lists.freedesktop.org

On 07.05.2012 20:52, Jerome Glisse wrote:
> On Mon, May 7, 2012 at 1:59 PM, Jerome Glisse<j.glisse@gmail.com>  wrote:
>>> On 07.05.2012 17:23, Jerome Glisse wrote:
>>>> On Mon, May 7, 2012 at 7:42 AM, Christian K=F6nig<deathsimple@vodafone=
.de>
>>>>   wrote:
>>>>> A startover with a new idea for a multiple ring allocator.
>>>>> Should perform as well as a normal ring allocator as long
>>>>> as only one ring does somthing, but falls back to a more
>>>>> complex algorithm if more complex things start to happen.
>>>>>
>>>>> We store the last allocated bo in last, we always try to allocate
>>>>> after the last allocated bo. Principle is that in a linear GPU ring
>>>>> progression was is after last is the oldest bo we allocated and thus
>>>>> the first one that should no longer be in use by the GPU.
>>>>>
>>>>> If it's not the case we skip over the bo after last to the closest
>>>>> done bo if such one exist. If none exist and we are not asked to
>>>>> block we report failure to allocate.
>>>>>
>>>>> If we are asked to block we wait on all the oldest fence of all
>>>>> rings. We just wait for any of those fence to complete.
>>>>>
>>>>> v2: We need to be able to let hole point to the list_head, otherwise
>>>>>     try free will never free the first allocation of the list. Also
>>>>>     stop calling radeon_fence_signalled more than necessary.
>>>>>
>>>>> Signed-off-by: Christian K=F6nig<deathsimple@vodafone.de>
>>>>> Signed-off-by: Jerome Glisse<jglisse@redhat.com>
>>>> This one is NAK please use my patch. Yes in my patch we never try to
>>>> free anything if there is only on sa_bo in the list if you really care
>>>> about this it's a one line change:
>>>>
>>>> http://people.freedesktop.org/~glisse/reset5/0001-drm-radeon-multiple-=
ring-allocator-v2.patch
>>> Nope that won't work correctly, "last" is pointing to the last allocati=
on
>>> and that's the most unlikely to be freed at this time. Also in this ver=
sion
>>> (like in the one before) radeon_sa_bo_next_hole lets hole point to the
>>> "prev" of the found sa_bo without checking if this isn't the lists head.
>>> That might cause a crash if an to be freed allocation is the first one =
in
>>> the buffer.
>>>
>>> What radeon_sa_bo_try_free would need to do to get your approach workin=
g is
>>> to loop over the end of the buffer and also try to free at the beginnin=
g,
>>> but saying that keeping the last allocation results in a whole bunch of
>>> extra cases and "if"s, while just keeping a pointer to the "hole" (e.g.
>>> where the next allocation is most likely to succeed) simplifies the code
>>> quite a bit (but I agree that on the down side it makes it harder to
>>> understand).
>>>
>>>> Your patch here can enter in infinite loop and never return holding
>>>> the lock. See below.
>>>>
>>>> [SNIP]
>>>>
>>>>> +               } while (radeon_sa_bo_next_hole(sa_manager, fences));
>>>> Here you can infinite loop, in the case there is a bunch of hole in
>>>> the allocator but none of them allow to full fill the allocation.
>>>> radeon_sa_bo_next_hole will keep returning true looping over and over
>>>> on all the all. That's why i only restrict my patch to 2 hole skeeping
>>>> and then fails the allocation or try to wait. I believe sadly we need
>>>> an heuristic and 2 hole skeeping at most sounded like a good one.
>>> Nope, that can't be an infinite loop, cause radeon_sa_bo_next_hole in
>>> conjunction with radeon_sa_bo_try_free are eating up the opportunities =
for
>>> holes.
>>>
>>> Look again, it probably will never loop more than RADEON_NUM_RINGS + 1,=
 with
>>> the exception for allocating in a complete scattered buffer, and even t=
hen
>>> it will never loop more often than halve the number of current allocati=
ons
>>> (and that is really really unlikely).
>>>
>>> Cheers,
>>> Christian.
>> I looked again and yes it can loop infinitly, think of hole you can
>> never free ie radeon_sa_bo_try_free can't free anything. This
>> situation can happen if you have several thread allocating sa bo at
>> the same time while none of them are yet done with there sa_bo (ie
>> none have call sa_bo_free yet). I updated a v3 that track oldest and
>> fix all things you were pointing out above.
No that isn't a problem, radeon_sa_bo_next_hole takes the firsts entries =

of the flist, so it only considers holes that have a signaled fence and =

so can be freed.

Having multiple threads allocate objects that can't be freed yet will =

just result in empty flists, and so radeon_sa_bo_next_hole will return =

false, resulting in calling radeon_fence_wait_any with an empty fence =

list, which in turn will result in an ENOENT and abortion of allocation =

(ok maybe we should catch that and return -ENOMEM instead).

So even the corner cases should now be handled fine.

>>
>> http://people.freedesktop.org/~glisse/reset5/0001-drm-radeon-multiple-ri=
ng-allocator-v3.patch
>>
>> Cheers,
>> Jerome
> Of course by tracking oldest it defeat the algo so updated patch :
>   http://people.freedesktop.org/~glisse/reset5/0001-drm-radeon-multiple-r=
ing-allocator-v3.patch
>
> Just fix the corner case of list of single entry.
That still won't work correctly, cause the corner case isn't that there =

is just one allocation left on the list, the corner case is that we need =

to be able to allocate something before the first sa_bo, just consider =

the following with your current implementation:

B F F F F 1 2 3 4 F E

B is the beginning of the buffer.
F is free space.
1,2,3,4 are allocations.
E is the end of the buffer.

So lets say that we have an allocation that won't fit in the free space =

between "4" and "E", now even if if radeon_sa_next_hole sets "last" to =

1, we aren't able to allocate anything at the beginning of the buffer...

Christian.