Although using polling is sufficient to learn the mapping, it can be expensive in terms of time.
To this end, we first discuss common notions of instruction throughput and port usage, and introduce a more precise definition of latency that, in contrast to previous definitions, considers dependencies between different pairs of input and output operands.
This paper explores how datacenters can exploit PIM architectures in the context of latency-critical applications.