Proceedings paper
Title:
	On the Influence of Thread Allocation for Irregular Codes in NUMA Systems
Authors:
	J. Lorenzo, F. Rivera, P. Tůma, J. Pichel
Publication:
	
		
			Proc. 10th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)
		
	
Year:
	2009
Abstract:
	This work presents a study undertaken to characterise the FINISTERRAE supercomputer, one of the biggest NUMA systems in Europe. The main objective was to determine the performance effect of bus contention and cache coherency as well as the suitability of porting strategies regarding irregular codes in such a complex architecture. Results show that: (1) cores which share a socket can be considered as independent processors in this context; (2) for big data sizes, the effect of sharing a bus degrades the final performance but masks the cache coherency effects; (3) the NUMA factor (remote to local memory latency ratio) is an important factor on irregular codes and (4) the default kernel allocation policy is not optimal in this system. These results allow us to understand the behaviour of thread-to-core mappings and memory allocation policies.
BibTeX:
	@inproceedings{lorenzo_influence_2009,
    title = {{On the Influence of Thread Allocation for Irregular Codes in NUMA Systems}},
    author = {Lorenzo, J.A. and Rivera, F.F. and Tuma, P. and Pichel, J.C.},
    year = {2009},
    booktitle = {{Proc. 10th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT)}},
    doi = {10.1109/PDCAT.2009.42},
    pages = {146--153},
}