Data Cluster Detection for Low Power Embedded Memory Subsystems

Article Preview

Abstract:

This work proposes a technique for optimizing data placement of application-wide reused data so that it resides in scratchpad memory of processing elements in multiprocessor system on chips. The proposed technique identifies data elements with fine granularity that can profitably be placed in the scratchpad memories to maximize performance and energy gains. We present a heuristic approach that efficiently exploits the scratchpad memories using memory access footprint. Our experimental results indicate that our approach is able to reduce energy consumption by 30% over cache based memory subsystems.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

833-836

Citation:

Online since:

September 2015

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2015 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] R. Banakar, S. Steinke, and P. Marwedel, Scratchpad memory: a design alternative for cache on-chip memory in embedded systems, Proceedings of the international workshop on hardware/software codesign (2002).

DOI: 10.1145/774789.774805

Google Scholar

[2] M.T. Kandemir, M. J. Irwin, N. Vijaykrishnan, I. Kadayif, and A. Parikh, Dynamic management of scratchpad memory space, Design Automation Conference (2001) 690-695.

DOI: 10.1145/378239.379049

Google Scholar

[3] M. Verma, L. Wehmeyer, and P. Marwedel, Dynamic overlay of scratchpad memory for energy minimization, proceeding of the ACM international conference on hardware/software codesign and system synthesis (2004) 104-109.

DOI: 10.1145/1016720.1016748

Google Scholar

[4] M. Gschwind, H. P. Hofstee, B. K. Flachs, and T. Yamazaki, Synergistic processing in cell's multicore architecture, IEEE Micro, Vol. 26, No. 2 (2006) 10-24.

DOI: 10.1109/mm.2006.41

Google Scholar

[5] D. C. Burger and T. M. Austin, The simplescalar tool set version 2. 0, Technical Report CS-TR-1997-1342 (1997).

Google Scholar

[6] P. Shivakumar and N. P. Jouppi, Cacti 3. 0: an integrated cache timing power and area model, HP Labs Technical Report WRL-2001-2 (2001).

Google Scholar

[7] C. Lee, M. Potkonjak, and W. H. Mangione Smith, Mediabench: A tool for evaluating and synthesizing multimedia and communications systems, International Symposium on Microarchitecture (1997) 330-335.

DOI: 10.1109/micro.1997.645830

Google Scholar