Home         Authors   Papers   Year of conference   Themes   Organizations        To MES conference

The Results of the Implementation of the Copy Function on a Vector Coprocessor  

Authors
 Aryashev S.I.
 Zubkovskiy P.S.
 Tsvetkov V.V.
Date of publication
 2018
DOI
 10.31114/2078-7707-2018-3-144-147

Abstract
 In this paper we present the results of the implementation of the copy function on a vector coprocessor. The proposed implementation supports the execution of copying arrays with different level of address alignment and automatically selects loading/saving instructions corresponding to the level of alignment of arrays. So on the arrays aligned on the boundary of the 256-bit word (align32), vector instructions vldq/vsdq are used that load/store two vectors using CACHE memory of the second level, on arrays aligned along the 128-bit word boundary (align16), vector instructions vldm/vsdm are used that load/store one vector using the first and second level cached memory. On arrays aligned on the boundary of a 64-bit or 32-bit word (align8 or align4), copying is performed through real-arithmetic coprocessor (FPU) registers using the ldc1/sdc1 or lwc1/swc1 instructions that load/store FPU registers. The testing of the copy program was performed on different versions of models and different versions of the hardware implementation of the vector coprocessor. Based on the measurement of the number of processor cycles spent on the execution of the program, the acceleration factor of the execution of the copy program on the vector coprocessor (CPV) was evaluated relative to the implementation of the program implementation on the coprocessor for real operations (FPU). The greatest acceleration is observed when using the vldq/vsdq commands on float arrays aligned on the 32-byte boundary, the maximum value of the acceleration coefficient is K = 5.5. For arrays double, the corresponding value of the acceleration coefficient is about one and a half times smaller.
Keywords
 vector coprocessor, coprocessor of real arithmetic, acceleration factor, copy function, loading instructions, save instructions.
Library reference
 Aryashev S.I., Zubkovskiy P.S., Tsvetkov V.V. The Results of the Implementation of the Copy Function on a Vector Coprocessor // Problems of Perspective Micro- and Nanoelectronic Systems Development - 2018. Issue 3. P. 144-147. doi:10.31114/2078-7707-2018-3-144-147
URL of paper
 http://www.mes-conference.ru/data/year2018/pdf/D080.pdf

Copyright © 2009-2024 IPPM RAS. All Rights Reserved.

Design of site: IPPM RAS