Consistency Rules for cudaHostAllocMapped

Question

Consistency Rules for cudaHostAllocMapped

557 views Asked by agrippa At 05 April 2013 at 16:01

Does anyone know of documentation on the memory consistency model guarantees for a memory region allocated with cudaHostAlloc(..., cudaHostAllocMapped)? For instance, when writes from the device become visible to reads from the host would be useful (could be after the kernel completes, at earliest possible time during kernel execution, etc).

Original Q&A

There are 1 answers

**tera** · Accepted Answer · 2013-04-05T17:05:24+00:00

Writes from the device are guaranteed to be visible on the host (or on peer devices) after the performing thread has executed a __threadfence_system() call (which is only available on compute capability 2.0 or higher).
They are also visible after the kernel has finished, i.e. after a cudaDeviceSynchronize() or after one of the other synchronization methods listed in the "Explicit Synchronization" section of the Programming Guide has been successfully completed.

Mapped memory should never be modified from the host while a kernel using it is or could be running, as CUDA currently does not provide any way of synchronization in that direction.

TechQA.

Consistency Rules for cudaHostAllocMapped

There are 1 answers

Related Questions in MEMORY

Related Questions in CUDA

Related Questions in GPGPU

Related Questions in CONSISTENCY

Related Questions in MAPPED-MEMORY

Popular Questions

Trending Questions