From mboxrd@z Thu Jan  1 00:00:00 1970
From: jassi brar <jassisinghbrar@gmail.com>
Subject: Re: [PATCH 0/7] DMAENGINE: fixes and PrimeCells
Date: Sun, 9 May 2010 19:06:17 +0900
Message-ID: <l2r1b68c6791005090306ld948bebfqa1e7d22321e302aa@mail.gmail.com>
References: <1272848060-28049-1-git-send-email-linus.walleij@stericsson.com>
	 <i2t63386a3d1005070213u2ad32613w694be1ecbeb46856@mail.gmail.com>
	 <20100507093256.GB19936@n2100.arm.linux.org.uk>
	 <x2j63386a3d1005070443yfadbaf3ct8686a8f7aae6412f@mail.gmail.com>
	 <j2r1b68c6791005070531u57b4b1basc56d45d0e06b1d04@mail.gmail.com>
	 <k2k63386a3d1005070910kcdb516dcga60c24043b1b44de@mail.gmail.com>
	 <m2u1b68c6791005071937s5b2bbb60p6de395a6c06a963e@mail.gmail.com>
	 <w2re9c3a7c21005081524z2a52a078nad8d316553dfde80@mail.gmail.com>
	 <l2j1b68c6791005082048j5deaf863g2887c968ff841f2f@mail.gmail.com>
	 <h2ie9c3a7c21005090047lf1157930t9a625073c87e8cea@mail.gmail.com>
Mime-Version: 1.0
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: QUOTED-PRINTABLE
Return-path: <linux-mmc-owner@vger.kernel.org>
Received: from mail-vw0-f46.google.com ([209.85.212.46]:61612 "EHLO
	mail-vw0-f46.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
	with ESMTP id S1751868Ab0EIKGT convert rfc822-to-8bit (ORCPT
	<rfc822;linux-mmc@vger.kernel.org>); Sun, 9 May 2010 06:06:19 -0400
In-Reply-To: <h2ie9c3a7c21005090047lf1157930t9a625073c87e8cea@mail.gmail.com>
Sender: linux-mmc-owner@vger.kernel.org
List-Id: linux-mmc@vger.kernel.org
To: Dan Williams <dan.j.williams@intel.com>
Cc: Linus Walleij <linus.ml.walleij@gmail.com>, Russell King - ARM Linux <linux@arm.linux.org.uk>, Ben Dooks <ben-linux@fluff.org>, linux-mmc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org

On Sun, May 9, 2010 at 4:47 PM, Dan Williams <dan.j.williams@intel.com>=
 wrote:
> On Sat, May 8, 2010 at 8:48 PM, jassi brar <jassisinghbrar@gmail.com>=
 wrote:
>> On Sun, May 9, 2010 at 7:24 AM, Dan Williams <dan.j.williams@intel.c=
om> wrote:
>>> On Fri, May 7, 2010 at 7:37 PM, jassi brar <jassisinghbrar@gmail.co=
m> wrote:
>>>> =C2=A0IMHO, a DMA api should be as quick as possible - callbacks d=
one in IRQ context.
>>>> =C2=A0But since there maybe clients that need to do sleepable stuf=
f in
>>>
>>> None of the current clients sleep in the callback, it's done in
>>> soft-irq context. =C2=A0The only expectation is that hard-irqs are =
enabled
>>> during the callback just like timer callbacks. =C2=A0I also would l=
ike to
>>> see numbers to quantify the claims of slowness.
>> The clients evolve around the API so they don't do what the API does=
n't
>> allow. Any API should try to put as least contraints as possible - y=
ou never
>> know what kinda clients are gonna arise.
>
> Running a callback in hard-irq context definitely puts constraints on
> the callback implementation to be as minimal as possible... and there
> is nothing stopping you from doing that today with the existing
> dmaengine interface: see idmac_interrupt.
We must plan for SoCs that have same peripheral IPs but different
DMACs. In that case, we have one client driver and two DMAC drivers
behind the same DMA API. So, if the DMA API doesn't fix such
assumptions, we can't have the driver to work for both SoCs.

>> Lets say a protocol requires 'quick' ACK(within few usecs) on contro=
l bus after
>> xfer'ing a large packet on data bus. All the client needs is to be
>> able to toggle
>> some bit of the device controller after the DMA done, which can very=
 well be
>> done in IRQ context but maybe too late if the callback is done from =
a tasklet
>> scheduled from DMAC ISR.
>> The point being, a DMA API should be able to do callbacks from the I=
RQ context
>> too. That is, assuming the clients know what they do.
>
> You are confusing async_tx constraints and dmaengine. =C2=A0If your d=
river
> is providing the backend of an async_tx operation (currently only
> md-raid acceleration) then md-raid can assume that the callback is
> being performed in an irq-enabled non-sleepable context. =C2=A0If you=
 are
> not providing an async_tx backend service then those constraints are
> lifted. =C2=A0I think I would like to make this explicit
> CONFIG_DMA_SUPPORTS_ASYNC_TX option to clearly mark the intended use
> model of the dma controller.
Again, the client shouldn't need to know about the backend DMAC driver.
If some client can't stand 'async' ops, there should be some way for it=
 to
ask DMA API and know.
IMHO, DMA API should see async_tx as a special 'relaxed' case. Adding
new flags to differentiate only complicate the client drivers.

>> Also, I think it is possible to have an API that allows request subm=
ission from
>> callbacks, which will be a very useful feature.
>> Of course, assuming the clients know what they can/can't do (just li=
ke current
>> DMA API or any other API).
>
> It's a driver specific implementation detail if it supports submissio=
n
> from the callback. =C2=A0As a "general" rule clients should not assum=
e that
> all drivers support this, but in the architecture specific case you
> know which driver you are talking to, so this should not be an issue.
Again, please look at the "Same client driver for different SoCs with
different DMACs" situation. DMA API needs to take a stand.

>>>> callbacks, the API
>>>> =C2=A0may do two callbacks - 'quick' in irq context and 'lazy' fro=
m
>>>> tasklets scheduled from
>>>> =C2=A0the IRQ. Most clients will provide either, while some may pr=
ovide
>>>> both callback functions.
>>>>
>>>> b) There seems to be no clear way of reporting failed transfers. T=
he
>>>> device_tx_status
>>>> =C2=A0 =C2=A0can get FAIL/SUCSESS but the call is open ended and c=
an be performed
>>>> =C2=A0 =C2=A0without any time bound after tx_submit. It is not ver=
y optimal for
>>>> DMAC drivers
>>>> =C2=A0 =C2=A0to save descriptors of all failed transactions until =
the channel
>>>> is released.
>>>> =C2=A0 =C2=A0IMHO, provision of status checking by two mechanisms:=
 cookie and dma-done
>>>> =C2=A0 callbacks is complication more than a feature. Perhaps the =
dma
>>>> engine could provide
>>>> =C2=A0 a default callback, should the client doesn't do so, and tr=
ack
>>>> done/pending xfers
>>>> =C2=A0for such requests?
>>>
>>> I agree the error handling was designed around mem-to-mem assumptio=
ns
>>> where failures are due to double-bit ECC errors and other rare even=
ts.
>> well, neither have I ever seen DMA failure, but a good API shouldn't=
 count
>> upon h/w perfection.
>>
>
> It doesn't count on perfection, it treats failures the same way the
> cpu would react to a unhandled data abort i.e. panic. =C2=A0I was thi=
nking
> of a case like sata where you might see dma errors on a daily basis.
panic'ing is extreme reaction, esp when DMA API doesn't provide any
guarantee of time-bound operations - the client or API could simply
retry after taking appropriate action.

>>>> c) Conceptually, the channels are tightly coupled with the DMACs,
>>>> there seems to be
>>>> =C2=A0 no way to be able to schedule a channel among more than one=
 DMACs
>>>> in the runtime,
>>>> =C2=A0 that is if more then one DMAC support the same channel/peri=
pheral.
>>>> =C2=A0 For example, Samsung's S5Pxxxx have many channels available=
 on more
>>>> than 1 DMAC
>>>> =C2=A0 but for this dma api we have to statically assign channels =
to
>>>> DMACs, which may result in
>>>> =C2=A0 a channel acquire request rejected just because the DMAC we=
 chose
>>>> for it is already
>>>> =C2=A0 fully busy while another DMAC, which also supports the chan=
nel, is idling.
>>>> =C2=A0 Unless we treat the same peripheral as, say, I2STX_viaDMAC1=
 and
>>>> I2STX_viaDMAC2
>>>> =C2=A0 and allocate double resources for these "mutually exclusive=
" channels.
>>>
>>> I am not understanding this example. =C2=A0If both DMACs are regist=
ered the
>>> dma_filter function to dma_request_channel() can select between the=
m,
>>> right?
>> Let me be precise. I2S_Tx fifo(I2S peripheral/channel) can be be rea=
ched
>> by two DMACs but, of course, the channel can only be active with
>> exactly one DMAC.
>> So, it is desirable to be able to reach the peripheral via second DM=
AC should
>> the first one is too busy to handle the request. Clearly this is a
>> runtime decision.
>> FWIHS, I can associate the channel with either of the DMACs and if t=
hat DMAC
>> can't handle the I2S_Tx request (say due to its all h/w threads
>> allocated to other
>> request), I can't play audio even if the DMAC might be simply idling=
=2E
>>
>
> Ah ok, you want load balancing between channels. =C2=A0In that case t=
he 1:1
> nature of dma_request_channel() is not the right interface. =C2=A0We =
would
> need to develop something like an architecture specific implementatio=
n
> of dma_find_channel() to allow dynamic channel allocation at runtime.
> But at that point we will have written something that is very
> architecture specific, how could we implement that in a generic api?
In S3C DMA API driver for PL330 DMAC, I add all unique channels and
DMACs to system wide channel and DMAC pool. DMACs have
a capability mask to help find which channels can be reached vai that D=
MAC.
That way it becomes possible to do runtime mapping of channels on
DMACs.
About current DMA API? I don't know any easy way. But is sure possible
to implement a generic API to do that.

> Basically if the driver does not want to present resources to generic
> clients, does want to use any of the existing generic channel
> allocation mechanisms, and has narrow platform-specific needs then wh=
y
> code to/extend a generic api?
Sometimes there is limit on the number of concurrrent channels that
the DMAC can handle. For ex, PL330 can have only 8 h/w threads but
32 possible peripherals/channels, allowing only max. 8 channels to be a=
ctive
at any time. In such situations, the DMAC may refuse channel allocation
until some other client releases a channel thereby freeing up a h/w thr=
ead.

>>>> e) There seems to be no ScatterGather support for Mem to Mem trans=
fers.
>>>
>>> There has never been a use case, what did you have in mind. =C2=A0I=
f
>>> multiple prep_memcpy commands is too inefficient we could always ad=
d
>>> another operation.
>> Just that I believe any API should be as exhaustive and generic as p=
ossible.
>> I see it possible for multimedia devices/drivers to evolve to start =
needing
>> such capabilities.
>> Also, the way DMA API treats memcpy/memset and assume SG reqs to be
>> equivalent to MEM<=3D>DEV request is not very impressive.
>> IMHO, any submitted request should be a list of xfers. And an xfer i=
s a
>> 'memset' with 'src_len' bytes from 'src_addr' to be copied 'n' times
>> at 'dst_addr'.
>> Memcpy is just a special case of memset, where n :=3D 1
>> This covers most possible use cases while being more compact and fut=
ure-proof.
>
> No, memset is an operation that does not have a source address and
> instead writes a pattern.
That sounds like assuming memset to be writing a value multiple times.

Most DMACs come with optional SRC/DST increment bit to control the
copy operation. That makes memset nicely blend with memcpy requests.

Memset might not have source address, but for most DMACs the driver
would have to allocate 4 dma coherent bytes(if memset unit is int) and
treat it most likely as memcpy request with src_inc :=3D 0 and dst_inc =
:=3D 1
=46or DMACs that does support direct memset operation, there wud be a
limit on size of the pattern, making them 'lesser' flexible than those =
without
direct memset support.

So, IMHO, memset is better seen as writing a data pattern multiple time=
s rather
than writing a value multiple times.
Because with clients and over time, the unit size may vary.
Lets not put limit on unit size for memset operation.
What if i want to set 1800 bytes of memory with a pattern of 9bytes?
200 memcpy requests OR allocate 9 dma coherent bytes and do 1 memset?

>  As for the sg support for mem-to-mem
> operations... like most things in Linux it was designed around its
> users and none of the users at the time (md-raid, net-dma) required
> scatter gather support.
Yes, but if a request is defined as a 'SG list of memset ops', we pay
no extra price for having capability for Mem to Mem SG. And one
doesn't need to look at the potential users for the API.
All DMAC drivers would have had just one 'prepare' rather than three.

> Without seeing code its hard to make a judgment on what can and canno=
t
> fit in dmaengine, but it needs to be judged on what fits in a generic
> api and the feasibility of forcing mem-to-mem device-to-mem and
> device-to-device dma into one api. =C2=A0I am skeptical we can addres=
s all
> those concerns, but we at least have something passably functional fo=
r
> the first two.
I admit I have little idea about Dev->Dev implementation, but the other=
 two
should be possible to implement in a generic way.
This discussion is purely about what the current DMA API misses and wha=
t
a generic DMA API should do. So, that the current DMA API fills up thos=
e
gap, if possible. I would love to get started implementing the generic
DMA API for reference but my priorities are decided by my employer.

>  On the other hand, it's perfectly sane for subarchs
> like pxa to have their own dma api.  If at the end of the day all tha=
t
> matters is $arch-specific-dma then why mess around with a generic api=
?
This is unlikely to hold for long. SoCs are more and more becoming a
cocktail of off-the-shelf third party device IPs, where the device IPs =
may
be same but different DMAC IP.

Regards.