In Table 1 and Figure 4, the runtime comparisons between SMC and MCMC methods do not explicitly address potential differences in hardware utilization efficiency across shared and distributed memory architectures. Could the authors clarify whether variations in cache coherence, memory contention, or inter-node communication latency were accounted for during the runtime measurements?