JOURNAL ARTICLE

An efficient stream memory architecture for heterogeneous multicore processor

Abstract

It is challenging to design a high performance memory sub-system for heterogeneous multicore processor FT64-3, which features 18 on chip 64-bit float function units. In this paper, we propose a parallel stream memory architecture that can greatly leverage the design idea of exploiting memory level parallelism for higher memory throughput., Experimental results and analysis for kernel algorithms are presented in the paper to show the efficiency and rationale of our design. By employing our parallel stream memory architercture, the performance of FT64-3 with a is 2–3 orders better than FT64-2 when running at the same clock frequency of 500 MHz, and is comparable to Itanium2 running at 1.6GHz but with less hardware cost.

Keywords:
Computer science Parallel computing Multi-core processor Memory architecture Clock rate Interleaved memory Leverage (statistics) Kernel (algebra) Throughput Uniform memory access Memory management Computer architecture Embedded system Computer hardware Chip Semiconductor memory Operating system Wireless

Metrics

3
Cited By
0.24
FWCI (Field Weighted Citation Impact)
7
Refs
0.70
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Parallel Computing and Optimization Techniques
Physical Sciences →  Computer Science →  Hardware and Architecture
Advanced Data Storage Technologies
Physical Sciences →  Computer Science →  Computer Networks and Communications
Interconnection Networks and Systems
Physical Sciences →  Computer Science →  Computer Networks and Communications
© 2026 ScienceGate Book Chapters — All rights reserved.