Customized MMRF: Efficient Matrix Operations on SIMD Processors

Kai Zhang; Yao Hua Wang; Shu Ming Chen; Zhen Tao Li; Liang Wen

doi:10.4028/www.scientific.net/AMM.347-350.1727

Paper Titles

Single Channel Signal Separation of GMSK Signals Based on MLP
p.1705

Development of Embedded Practice Platform for Control System Based on ARM
p.1711

Cognitive Radio as Enabling Technology for Dynamic Spectrum Access
p.1716

A Study on Turbo Code Performance Based on AWGN Channel
p.1720

Customized MMRF: Efficient Matrix Operations on SIMD Processors
p.1727

Optimal Power Consumption Analysis of Two-Level Hierarchical Model and Non-Hierarchical Model
p.1732

Improvement on LEACH Protocol of Wireless Sensor Network
p.1738

Research on Signal "Batch-Increasing" Phenomenon of Superheterodyne Reconnaissance Receiver
p.1743

Analysis of MMSE-Based Anti-Interference Performance for MIMO-UWB Systems
p.1748

HomeApplied Mechanics and MaterialsApplied Mechanics and Materials Vols. 347-350Customized MMRF: Efficient Matrix Operations on...

Customized MMRF: Efficient Matrix Operations on SIMD Processors

Abstract:

Wireless communication and multimedia applications feature a large amount of matrix operations with different matrix size. These operations require accessing matrix in column order. This paper implements a Multi-Grained Matrix Register File (MMRF) that supports multi-grained parallel row-wise and column-wise access. We implement a 4*4 MIMO decoding with the help of MMRF to illustrate the efficient matrix operations on SIMD processors. Experimental results show that, compared with TMS320C64x+, our SIMD processor can achieve about 5.65x to 7.71x performance improvement by employing the MMRF. By customized design technology, we reduce the area and critical-path delay of MMRF by 17.9% and 39.1% respectively.

You might also be interested in these eBooks

View Preview

Info:

Periodical:

Applied Mechanics and Materials (Volumes 347-350)

Pages:

1727-1731

DOI:

https://doi.org/10.4028/www.scientific.net/AMM.347-350.1727

Citation:

Cite this paper

Online since:

August 2013

Authors:

Kai Zhang, Yao Hua Wang, Shu Ming Chen, Zhen Tao Li, Liang Wen

Keywords:

Customize, Matrix Operations, SIMD

Export:

RIS, BibTeX

Price:

Permissions CCC:

Request Permissions

Permissions PLS:

Request Permissions

Сopyright:

Citation:

References

[1] Samsung. Downlink MIMO for EUTRA. 3GPP TSG RAN WG1 meeting #44, 2006. 3GPP R1-060335.

Google Scholar

[2] J. Andrews, A. Ghosh, R. Muhamed, Fundamentals of WiMAX: Understanding Broadband Wireless Networking, Prentice Hall, Mar. (2007).

Google Scholar

[3] Jesus Corbal, Roger Espasa, and Mateo Valero, MOM: a Matrix SIMD Instruction Set Architecture for Multimedia Applications, In Proceedings of the ACM/IEEE SC99 Conference, p.1–12, (1999).

DOI: 10.1145/331532.331547

Google Scholar

[4] Asadollah Shahbahrami, Ben Juurlink, and Stamatis Vassiliadis, Versatility of Extended Subwords and the Matrix Register File, ACM Transactions on Architecture and Code Optimization, Vol. 5, No. 1, Article 5, Publication date: May. (2008).

DOI: 10.1145/1369396.1369401

Google Scholar

[5] Mark Woh, Sangwon Seo, Scott Mahlke, Trevor Mudge, Chaitali Chakrabarti and Krisztian Flautner, AnySP: Anytime Anywhere Anyway Signal Processing, " ISCA, 09, June 20–24, (2009).

DOI: 10.1145/1555754.1555773

Google Scholar

[6] Brian Flachs, Shigehiro Asano, Sang H. Dhong, et al, The Microarchitecture of the Synergistic Processor for a Cell Processor, IEEE Journal of Solid-State Circuits, Vol. 41, NO. 1, Jan. (2006).

DOI: 10.1109/jssc.2005.859332

Google Scholar

[7] Ronny Krashinsky et al, The Vector-Thread Architecture, In Proceedings of the 31st Annual International Symposium on Computer Architecture, 2004., pp.52-63, Jun. (2004).

DOI: 10.1109/isca.2004.1310763

Google Scholar

[8] Catalin Ciobanu, Georgi Kuzmanov, Georgi Gaydadjiev, Alex Ramirez, A Polymorphic Register File for Matrix Operations, International Conference on Embedded Systems: Architectures, Modeling and Simulation, July. (2006).

DOI: 10.1109/icsamos.2010.5642059

Google Scholar

[9] Kai Zhang, Shuming Chen, Hu Chen, Yaohua Wang, Xiaowen Chen, Sheng Liu and Wei Liu, CMRF: a Configurable Matrix Register File for accelerating matrix operations on SIMD processors, IEICE Electron. Express, Vol. 9, No. 4, pp.283-289, (2012).

DOI: 10.1587/elex.9.283

Google Scholar