About buffer sychronization mechanism and cache operation

dri-devel.lists.freedesktop.org archive mirror
 help / color / mirror / Atom feed

From: Inki Dae <inki.dae@samsung.com>
To: linux-fbdev@vger.kernel.org, dri-devel@lists.freedesktop.org,
	'Linux ARM Kernel' <linux-arm-kernel@lists.infradead.org>,
	linux-media@vger.kernel.org, linaro-kernel@lists.linaro.org
Subject: About buffer sychronization mechanism and cache operation
Date: Mon, 12 Aug 2013 18:55:25 +0900	[thread overview]
Message-ID: <000001ce9742$11b5dcd0$35219670$%dae@samsung.com> (raw)
In-Reply-To: 

Hello all,

The purpose of this email is to get other opinions and advices to buffer synchronization mechanism, and coupling cache operation feature with the buffer synchronization mechanism. First of all, I am not a native English speaker so I'm not sure that I can convey my intention to you. And I'm not a specialist in Linux than other people so also there might be my missing points. 

I had posted the buffer synchronization mechanism called dmabuf sync framework like below,
http://lists.infradead.org/pipermail/linux-arm-kernel/2013-June/177045.html

And I'm sending this email before posting next version with more stable patch set and features. The purpose of this framework is to provide not only buffer access control to CPU and DMA but also easy-to-use interfaces for device drivers and user application. This framework can be used for all DMA devices using system memory as DMA buffer, especially for most ARM based SoCs.

There are two cases we are using this buffer synchronization framework. One is to primarily enhance GPU rendering performance on Tizen platform in case of 3d app with compositing mode that the 3d app draws something in off-screen buffer.

And other is to couple buffer access control and cache operation between CPU and DMA; the cache operation is done by the dmabuf sync framework in kernel side.

Why do we need buffer access control between CPU and DMA?
---------------------------------------------------------

The below shows simple 3D software layers,

                    3D Application
      -----------------------------------------
        Standard OpenGL ES and EGL Interfaces  ------- [A]
      -----------------------------------------
      MALI OpenGL ES and EGL Native modules --------- [B]
      -----------------------------------------
         Exynos DRM Driver    |    GPU Driver ---------- [C]

3d application requests 3d rendering through A. And then B fills a 3d command buffer with the requests from A. And then 3D application calls glFlush so that the 3d command buffer can be submitted to C, and rendered by GPU hardware. Internally, glFlush(also glFinish) will call a function of native module[B] glFinish blocks caller's task until all GL execution is complete. On the other hand, glFlush forces execution of GL commands but doesn't block the caller's task until the completion.

In composting mode, in case of using glFinish, the 3d rendering performance is quite lower than using glFlush because glFinish makes CPU blocked until the execution of the 3d commands is completed. However, the use of glFlush has one issue that the shared buffer with GPU could be broken when CPU accesses the shared buffer at once after glFlush because CPU cannot be aware of the completion of GPU access to the shared buffer: actually, Tizen platform internally calls only eglSwapBuffer instead of glFlush and glFinish, and whether flushing or finishing is decided according to compositing mode or not. So in such case, we would need buffer access control between CPU and DMA for more performance.

About cache operation
---------------------

The dmabuf sync framework can include cache operation feature, and the below shows how the cache operation based on dmabuf sync framework is performed,
   device driver in kernel or fctrl in user land
          dmabuf_sync_lock or dmabuf_sync_single_lock
               check before and after buffer access
                  dma_buf_begin_cpu_access or dma_buf_end_cpu_access
                         begin_cpu_access or end_cpu_access of exporter
                                dma_sync_sg_for_device or dma_sync_sg_for_cpu

In case that using dmabuf sync framework, kernel can be aware of when CPU and DMA access to a shared buffer is completed so we can do cache operation in kernel so that way, we can couple buffer access control and cache operation. So with this, we can avoid that user land overuses cache operation.

I guess most Linux based platforms are using cachable mapped buffer for more performance: in case that CPU frequently accesses the shared buffer which is shared with DMA, the use of cachable mapped buffer is more fast than the use of non-cachable. However, this way could make cache operation overused by user land because only user land can be aware of the completion of CPU or DMA access to the shared buffer so user land could request cache operations every time it wants even the cache operation is unnecessary. That is how user land could overuse cache operations.

To Android, Chrome OS, and other forks,

Are there other cases that buffer access control between CPU and DMA is needed? I know Android sync driver and KDS are already being used for Android, Chrome OS, and so on.
How does your platform do cache operation? And How do you think about coupling buffer access control and cache operation between CPU and DMA?.

Lastly, I think we may need Linux generic buffer synchronization mechanism that uses only Linux standard interfaces (dmabuf) including user land interfaces (fcntl and select system calls), and the dmabuf sync framework could meet it.

Thanks,
Inki Dae

next prev parent reply	other threads:[~2013-08-12  9:55 UTC|newest]

Thread overview: 50+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2013-06-13  8:28 [RFC PATCH] dmabuf-sync: Introduce buffer synchronization framework Inki Dae
2013-06-13 11:25 ` Inki Dae
2013-06-13 17:26 ` Russell King - ARM Linux
2013-06-14  2:32   ` Inki Dae
2013-06-17 11:15 ` [RFC PATCH v2] " Inki Dae
2013-06-17 11:34   ` Maarten Lankhorst
2013-06-17 13:04     ` Inki Dae
2013-06-17 13:31       ` Russell King - ARM Linux
2013-06-17 15:03         ` Inki Dae
2013-06-17 15:42           ` Russell King - ARM Linux
2013-06-17 16:01             ` Russell King - ARM Linux
2013-06-17 17:19             ` Inki Dae
2013-06-17 18:21               ` Russell King - ARM Linux
2013-06-18  5:27                 ` Inki Dae
2013-06-18  8:43                   ` Russell King - ARM Linux
2013-06-18  9:04                     ` Inki Dae
2013-06-18  9:38                       ` Russell King - ARM Linux
2013-06-18  9:47                       ` Lucas Stach
2013-06-19  5:45                         ` Inki Dae
2013-06-19 10:22                           ` Lucas Stach
2013-06-19 10:44                             ` Inki Dae
2013-06-19 12:34                               ` Lucas Stach
2013-06-19 15:10                                 ` Inki Dae
2013-06-19 18:29                                   ` Russell King - ARM Linux
2013-06-20  6:43                                     ` Inki Dae
2013-06-20  7:47                                       ` Lucas Stach
2013-06-20  8:17                                         ` Russell King - ARM Linux
2013-06-20  8:26                                           ` Lucas Stach
2013-06-20  8:24                                         ` Inki Dae
2013-06-20 10:11                                           ` Lucas Stach
2013-06-20 11:15                                             ` Inki Dae
2013-06-21  8:54                                               ` Lucas Stach
2013-06-21 11:01                                                 ` Inki Dae
2013-06-21 12:27                                                   ` Lucas Stach
2013-06-21 16:55                                                     ` Inki Dae
2013-06-21 19:02                                                       ` Jerome Glisse
2013-06-22  1:36                                                         ` Inki Dae
2013-06-25  9:09                                                         ` [RFC PATCH] " Inki Dae
2013-06-25 11:32                                                           ` Rob Clark
2013-06-25 14:17                                                             ` Inki Dae
2013-06-25 14:49                                                               ` Jerome Glisse
2013-06-26 16:06                                                                 ` Inki Dae
2013-08-12  9:55                                             ` Inki Dae [this message]
2013-06-18  7:00             ` [RFC PATCH v2] " Daniel Vetter
2013-06-18 10:46               ` Russell King - ARM Linux
2013-06-25  9:23                 ` Daniel Vetter
2013-06-26 17:18                   ` Russell King - ARM Linux
2013-06-17 13:31       ` Maarten Lankhorst
2013-06-17 15:20         ` Inki Dae
2013-06-19  9:10   ` [RFC PATCH v3] " Inki Dae

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='000001ce9742$11b5dcd0$35219670$%dae@samsung.com' \
    --to=inki.dae@samsung.com \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=linaro-kernel@lists.linaro.org \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-fbdev@vger.kernel.org \
    --cc=linux-media@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link

Be sure your reply has a Subject: header at the top and a blank line before the message body.

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).