32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors

Nowadays mobile devices represent a significant portion of the market for embedded systems, and are continuously demanded in daily life. From the end-user perspective size, weight, features are the key quality criteria. These benchmarks criteria became the usual design constraints in the embedded...

Full description

Saved in:
Bibliographic Details
Main Authors: Mohamed, Izzeldin Ibrahim, Chin, Fatt Chay, Marsono, Muhammad Nadzir
Format: Article
Published: Foundation of Computer Science 2012
Subjects:
Online Access:http://eprints.utm.my/id/eprint/30383/
Tags: Add Tag
No Tags, Be the first to tag this record!
id my.utm.30383
record_format eprints
spelling my.utm.303832019-07-23T09:01:24Z http://eprints.utm.my/id/eprint/30383/ 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors Mohamed, Izzeldin Ibrahim Chin, Fatt Chay Marsono, Muhammad Nadzir TK Electrical engineering. Electronics Nuclear engineering Nowadays mobile devices represent a significant portion of the market for embedded systems, and are continuously demanded in daily life. From the end-user perspective size, weight, features are the key quality criteria. These benchmarks criteria became the usual design constraints in the embedded systems design process and put a high impact on the power consumption. This paper survey and explore different low power design techniques for FPGA and processors. We compare, evaluate, and analyze, the power and energy consumption in three different designs namely, Altera FPGA Cyclone II which has a systolic array matrix multiplication implemented, i5 Clarkdale, and Atom Pineview-D Intel general purpose processors, which multiply two nxn 32-bit matrices and produce a 64-bit matrix as an output. We concluded that FPGA is a more power and energy efficient on low matrix size. However, general purpose processor performance is close to FPGA on larger matrix size as the larger cache size in general purpose processor help in reducing latency. We also concluded that the performance of FPGA can be improved in terms of latency if more systolic array processing elements are implemented in parallel to allow more concurrency. Foundation of Computer Science 2012-08 Article PeerReviewed Mohamed, Izzeldin Ibrahim and Chin, Fatt Chay and Marsono, Muhammad Nadzir (2012) 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors. International Journal of Computer Applications, 52 (11). pp. 17-23. ISSN 0975-8887
institution Universiti Teknologi Malaysia
building UTM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Teknologi Malaysia
content_source UTM Institutional Repository
url_provider http://eprints.utm.my/
topic TK Electrical engineering. Electronics Nuclear engineering
spellingShingle TK Electrical engineering. Electronics Nuclear engineering
Mohamed, Izzeldin Ibrahim
Chin, Fatt Chay
Marsono, Muhammad Nadzir
32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors
description Nowadays mobile devices represent a significant portion of the market for embedded systems, and are continuously demanded in daily life. From the end-user perspective size, weight, features are the key quality criteria. These benchmarks criteria became the usual design constraints in the embedded systems design process and put a high impact on the power consumption. This paper survey and explore different low power design techniques for FPGA and processors. We compare, evaluate, and analyze, the power and energy consumption in three different designs namely, Altera FPGA Cyclone II which has a systolic array matrix multiplication implemented, i5 Clarkdale, and Atom Pineview-D Intel general purpose processors, which multiply two nxn 32-bit matrices and produce a 64-bit matrix as an output. We concluded that FPGA is a more power and energy efficient on low matrix size. However, general purpose processor performance is close to FPGA on larger matrix size as the larger cache size in general purpose processor help in reducing latency. We also concluded that the performance of FPGA can be improved in terms of latency if more systolic array processing elements are implemented in parallel to allow more concurrency.
format Article
author Mohamed, Izzeldin Ibrahim
Chin, Fatt Chay
Marsono, Muhammad Nadzir
author_facet Mohamed, Izzeldin Ibrahim
Chin, Fatt Chay
Marsono, Muhammad Nadzir
author_sort Mohamed, Izzeldin Ibrahim
title 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors
title_short 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors
title_full 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors
title_fullStr 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors
title_full_unstemmed 32-bit NxN matrix multiplication: performance evaluation for altera FPGA, i5 Clarkdale, and atom pineview-D intel general purpose processors
title_sort 32-bit nxn matrix multiplication: performance evaluation for altera fpga, i5 clarkdale, and atom pineview-d intel general purpose processors
publisher Foundation of Computer Science
publishDate 2012
url http://eprints.utm.my/id/eprint/30383/
_version_ 1643648539803254784
score 13.211869