Re: Query: Multiple Mappings to Mem and ARMV6+

20 Feb 2012


      On Mon, Feb 20, 2012 at 09:48:56AM -0800, viresh kumar wrote:
...
On Feb 20, 2012 4:31 PM, "Catalin Marinas" catalin.marinas@arm.com wrote:
...
On 16 February 2012 18:14, viresh kumar viresh.linux@gmail.com wrote:
...
On Thu, Feb 16, 2012 at 9:48 AM, Catalin Marinas
catalin.marinas@arm.com wrote:
...
The DMA API implementation on ARM takes care of the cache cleaning and
invalidating.
I believe that this is the reason why we have cache re-invalidation
(we invalidated
it in dma_map_*() earlier) in dma_unmap_*() calls for ARMv6+ for
DMA_FROM_DEVICE.
Am i Correct?
Yes.
But why isn't keeping only the second one sufficient? Why don't we remove
it from dma_map_* routines?
Please don't think for a moment that anyone likes the idea of having to
walk over the cache twice for every DMA operation.  We don't.  But I can
assure you that it's very very necessary.
The first run through the affected cache lines on dma_map_*() is there
to get rid of any cache lines for the buffer which may be marked 'dirty'.
A dirty cache line can be evicted (written back) to memory at _any_ time.
If this occurs while the DMA controller is reading data from a device,
the results depend on the order which the particular cache line is written
by the DMA controller, and by the cache line eviction.
If the cache line eviction happens after the DMA controller has written,
the DMA'd data will be overwritten by old stale data.
So, we must get rid of these dirty cache lines.  We can do that either
by cleaning the cache lines or invalidating them.  We chose to invalidate
them because it makes very little difference.
For the case where the DMA controller needs to read from memory, we
obviously have to ensure that all data is pushed out of the cache for it
to be visible to the DMA controller.  For this case, merely cleaning the
cache is sufficient.
The second run through the affected cache lines is to prevent speculative
prefetches occuring while the DMA controller is on operation, which could
result in old data (before the DMA controller has written its data) being
read by the CPU, resulting again in old stale data.
I hope it's now clear why we need to run over the buffer twice.

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

Re: Query: Multiple Mappings to Mem and ARMV6+