"A Lightweight OpenMP4 Run-time for Embedded Systems", In the 21st Asia and South Pacific Design Automation Conference (ASP-DAC), Jan, 25-28, 2016, Macao (China)
OpenMP is increasingly being adopted by current many-core embedded processors to exploit their parallel computation capabilities. Unfortunately, current run-time implementations of the latest specification (v4.0) are not suitable for processors relying on small and fast on-chip memories, due to its memory consumption. This paper proposes an OpenMP4 run-time that reduces the memory consumption while providing the same performance. Our run-time relies on a new compiler pass capable to generate the task dependency graph of OpenMP programs, which is then efficiently stored in memory.