Thumbnail
Access Restriction
Subscribed

Author Zekri, A.S. ♦ Sedukhin, S.G.
Source IEEE Xplore Digital Library
Content type Text
Publisher Institute of Electrical and Electronics Engineers, Inc. (IEEE)
File Format PDF
Copyright Year ©2007
Language English
Subject Domain (in DDC) Computer science, information & general works ♦ Data processing & computer science
Subject Keyword Concurrent computing ♦ Graphics ♦ Costs ♦ Algorithms ♦ High performance computing ♦ Linear algebra ♦ Vectors ♦ Registers ♦ Application software ♦ Coprocessors
Abstract The basic linear algebra subroutines (BLAS) are standard operations to efficiently solve the linear algebra problems on high performance and parallel systems. In this paper, we study the implementation of some important BLAS operations on a NtimesN torus array processor. We show that the performance of the Level-3 BLAS represented by the nxn matrix multiply-add operation, n>N, approaches the theoretical peak as n increases since the degree of data reusing is high. While the performance of Level-1 and Level-2 BLAS operations is low as a result of low data reusing. Fortunately, many applications are based on intensive use of Level-3 BLAS with small percentage of Level-1 and Level-2 BLAS.
Description Author affiliation: Univ. of Aizu, Aizuwakamatsu (Zekri, A.S.; Sedukhin, S.G.)
ISBN 9780769529837
Educational Role Student ♦ Teacher
Age Range above 22 year
Educational Use Research ♦ Reading
Education Level UG and PG
Learning Resource Type Article
Publisher Date 2007-10-16
Publisher Place Japan
Rights Holder Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Size (in Bytes) 288.28 kB
Page Count 6
Starting Page 300
Ending Page 305


Source: IEEE Xplore Digital Library