On Wed, Jul 7, 2021 at 10:06 AM Christian König ckoenig.leichtzumerken@gmail.com wrote:
Am 06.07.21 um 12:12 schrieb Daniel Vetter:
Specifically document the new/clarified rules around how the shared fences do not have any ordering requirements against the exclusive fence.
But also document all the things a bit better, given how central struct dma_resv to dynamic buffer management the docs have been very inadequat.
Lots more links to other pieces of the puzzle. Unfortunately ttm_buffer_object has no docs, so no links :-(
Explain/complain a bit about dma_resv_locking_ctx(). I still don't like that one, but fixing the ttm call chains is going to be horrible. Plus we want to plug in real slowpath locking when we do that anyway.
Main part of the patch is some actual docs for struct dma_resv.
Overall I think we still have a lot of bad naming in this area (e.g. dma_resv.fence is singular, but contains the multiple shared fences), but I think that's more indicative of how the semantics and rules are just not great.
Another thing that's real awkard is how chaining exclusive fences right now means direct dma_resv.exclusive_fence pointer access with an rcu_assign_pointer. Not so great either.
Signed-off-by: Daniel Vetter daniel.vetter@intel.com Cc: Sumit Semwal sumit.semwal@linaro.org Cc: "Christian König" christian.koenig@amd.com Cc: linux-media@vger.kernel.org Cc: linaro-mm-sig@lists.linaro.org
drivers/dma-buf/dma-resv.c | 22 ++++++-- include/linux/dma-resv.h | 104 +++++++++++++++++++++++++++++++++++-- 2 files changed, 116 insertions(+), 10 deletions(-)
diff --git a/drivers/dma-buf/dma-resv.c b/drivers/dma-buf/dma-resv.c index f26c71747d43..898f8d894bbd 100644 --- a/drivers/dma-buf/dma-resv.c +++ b/drivers/dma-buf/dma-resv.c @@ -48,6 +48,8 @@
- write operations) or N shared fences (read operations). The RCU
- mechanism is used to protect read access to fences from locked
- write-side updates.
- See struct dma_resv for more details.
*/
DEFINE_WD_CLASS(reservation_ww_class);
@@ -137,7 +139,11 @@ EXPORT_SYMBOL(dma_resv_fini);
- @num_fences: number of fences we want to add
- Should be called before dma_resv_add_shared_fence(). Must
- be called with obj->lock held.
- be called with @obj locked through dma_resv_lock().
- Note that the preallocated slots need to be re-reserved if @obj is unlocked
- at any time before callind dma_resv_add_shared_fence(). This is validate when
- CONFIG_DEBUG_MUTEXES is enabled.
- RETURNS
- Zero for success, or -errno
@@ -234,8 +240,10 @@ EXPORT_SYMBOL(dma_resv_reset_shared_max);
- @obj: the reservation object
- @fence: the shared fence to add
- Add a fence to a shared slot, obj->lock must be held, and
- Add a fence to a shared slot, @obj must be locked with dma_resv_lock(), and
- dma_resv_reserve_shared() has been called.
*/ void dma_resv_add_shared_fence(struct dma_resv *obj, struct dma_fence *fence) {
- See also &dma_resv.fence for a discussion of the semantics.
@@ -280,7 +288,9 @@ EXPORT_SYMBOL(dma_resv_add_shared_fence);
- @obj: the reservation object
- @fence: the shared fence to add
- Add a fence to the exclusive slot. The obj->lock must be held.
- Add a fence to the exclusive slot. @obj must be locked with dma_resv_lock().
- Note that this function replaces all fences attached to @obj, see also
*/ void dma_resv_add_excl_fence(struct dma_resv *obj, struct dma_fence *fence) {
- &dma_resv.fence_excl for a discussion of the semantics.
@@ -609,9 +619,11 @@ static inline int dma_resv_test_signaled_single(struct dma_fence *passed_fence)
- fence
- Callers are not required to hold specific locks, but maybe hold
- dma_resv_lock() already
- dma_resv_lock() already.
- RETURNS
- true if all fences signaled, else false
*/ bool dma_resv_test_signaled(struct dma_resv *obj, bool test_all) {
- True if all fences signaled, else false.
diff --git a/include/linux/dma-resv.h b/include/linux/dma-resv.h index e1ca2080a1ff..c77fd54d033f 100644 --- a/include/linux/dma-resv.h +++ b/include/linux/dma-resv.h @@ -62,16 +62,90 @@ struct dma_resv_list {
/**
- struct dma_resv - a reservation object manages fences for a buffer
- @lock: update side lock
- @seq: sequence count for managing RCU read-side synchronization
- @fence_excl: the exclusive fence, if there is one currently
- @fence: list of current shared fences
- There are multiple uses for this, with sometimes slightly different rules in
- how the fence slots are used.
- One use is to synchronize cross-driver access to a struct dma_buf, either for
- dynamic buffer management or just to handle implicit synchronization between
- different users of the buffer in userspace. See &dma_buf.resv for a more
- in-depth discussion.
- The other major use is to manage access and locking within a driver in a
- buffer based memory manager. struct ttm_buffer_object is the canonical
- example here, since this is were reservation objects originated from. But use
- in drivers is spreading and some drivers also manage struct
- drm_gem_object with the same scheme.
I would still make that even harder, e.g. mentioning that you run into use after free and the resulting memory corruption if you don't obey the rules.
Hm I think that's best documented in dma_buf.resv kerneldoc, pointing at the rules here and explaining what can go wrong on the other driver if we just overwrite fences and breakt the DAG. I'll add something there. -Daniel
Apart from that with the spelling stuff pointed out by others fixed the patch is Reviewed-by: Christian König christian.koenig@amd.com
Regards, Christian.
*/ struct dma_resv {
/**
* @lock:
*
* Update side lock. Don't use directly, instead use the wrapper
* functions like dma_resv_lock() and dma_resv_unlock().
*
* Drivers which use the reservation object to manage memory dynamically
* also use this lock to protect buffer object state like placement,
* allocation policies or throughout command submission.
*/ struct ww_mutex lock;
/**
* @seq:
*
* Sequence count for managing RCU read-side synchronization, allows
* read-only access to @fence_excl and @fence while ensuring we take a
* consistent snapshot.
*/ seqcount_ww_mutex_t seq;
/**
* @fence_excl:
*
* The exclusive fence, if there is one currently.
*
* There are two was to update this fence:
*
* - First by calling dma_resv_add_excl_fence(), which replaces all
* fences attached to the reservation object. To guarantee that no
* fences are lost this new fence must signal only after all previous
* fences, both shared and exclusive, have signalled. In some cases it
* is convenient to achieve that by attaching a struct dma_fence_array
* with all the new and old fences.
*
* - Alternatively the fence can be set directly, which leaves the
* shared fences unchanged. To guarantee that no fences are lost this
* new fence must signale only after the previous exclusive fence has
* singalled. Since the shared fences are staying intact, it is not
* necessary to maintain any ordering against those. If semantically
* only a new access is added without actually treating the previous
* one as a dependency the exclusive fences can be strung together
* using struct dma_fence_chain.
*
* Note that actual semantics of what an exclusive or shared fence mean
* is defined by the user, for reservation objects shared across drivers
* see &dma_buf.resv.
*/ struct dma_fence __rcu *fence_excl;
/**
* @fence:
*
* List of current shared fences.
*
* There are no ordering constraints of shared fences against the
* exclusive fence slot. If a waiter needs to wait for all access, it
* has to wait for both set of fences to signal.
*
* A new fence is added by calling dma_resv_add_shared_fence(). Since
* this often needs to be done past the point of no return in command
* submission it cannot fail, and therefor sufficient slots need to be
* reserved by calling dma_resv_reserve_shared().
*
* Note that actual semantics of what an exclusive or shared fence mean
* is defined by the user, for reservation objects shared across drivers
* see &dma_buf.resv.
*/ struct dma_resv_list __rcu *fence;
};
@@ -98,6 +172,13 @@ static inline void dma_resv_reset_shared_max(struct dma_resv *obj) {}
- undefined order, a #ww_acquire_ctx is passed to unwind if a cycle
- is detected. See ww_mutex_lock() and ww_acquire_init(). A reservation
- object may be locked by itself by passing NULL as @ctx.
- When a die situation is indicated by returning -EDEADLK all locks held by
- @ctx must be unlocked and then dma_resv_lock_slow() called on @obj.
- Unlocked by calling dma_resv_lock().
*/ static inline int dma_resv_lock(struct dma_resv *obj, struct ww_acquire_ctx *ctx)
- See also dma_resv_lock_interruptible() for the interruptible variant.
@@ -119,6 +200,12 @@ static inline int dma_resv_lock(struct dma_resv *obj,
- undefined order, a #ww_acquire_ctx is passed to unwind if a cycle
- is detected. See ww_mutex_lock() and ww_acquire_init(). A reservation
- object may be locked by itself by passing NULL as @ctx.
- When a die situation is indicated by returning -EDEADLK all locks held by
- @ctx must be unlocked and then dma_resv_lock_slow_interruptible() called on
- @obj.
*/ static inline int dma_resv_lock_interruptible(struct dma_resv *obj, struct ww_acquire_ctx *ctx)
- Unlocked by calling dma_resv_lock().
@@ -134,6 +221,8 @@ static inline int dma_resv_lock_interruptible(struct dma_resv *obj,
- Acquires the reservation object after a die case. This function
- will sleep until the lock becomes available. See dma_resv_lock() as
- well.
*/ static inline void dma_resv_lock_slow(struct dma_resv *obj, struct ww_acquire_ctx *ctx)
- See also dma_resv_lock_slow_interruptible() for the interruptible variant.
@@ -167,7 +256,7 @@ static inline int dma_resv_lock_slow_interruptible(struct dma_resv *obj,
- if they overlap with a writer.
- Also note that since no context is provided, no deadlock protection is
- possible.
*/
- possible, which is also not needed for a trylock.
- Returns true if the lock was acquired, false otherwise.
@@ -193,6 +282,11 @@ static inline bool dma_resv_is_locked(struct dma_resv *obj)
- Returns the context used to lock a reservation object or NULL if no context
- was used or the object is not locked at all.
- WARNING: This interface is pretty horrible, but TTM needs it because it
- doesn't pass the struct ww_acquire_ctx around in some very long callchains.
- Everyone else just uses it to check whether they're holding a reservation or
*/ static inline struct ww_acquire_ctx *dma_resv_locking_ctx(struct dma_resv *obj) {
- not.