For the increasing demands of embedded computation, hardware accelerators are widely used with processors. FPGA provides flexibility to design such accelerators because it is a programmable device. But developing a custom accelerator for each application is time-consuming and not reusable. On the other hand, vector processing brings the opportunity to accelerate computation by taking advantage of data-level parallelism.
Ahmed KamaleldinSalma HeshamDiana Göhringer
Hong GuanYichuan GaoChenlu MiaoHaoyang WuHaiyan ZhuM LinHailian Liang
Quan ZhangYujie HuangYujie CaiYalong PangJun Han