JOURNAL ARTICLE

Latency-Critical Quantized Inference With Transformer Decoders on ARM and RISC-V CPUs

Hèctor MartínezSandra CatalánAdrián CastellóJose I. MestreEnrique S. Quintana–Ort́ı

Year: 2025 Journal:   IEEE Internet of Things Journal Vol: 12 (13)Pages: 25676-25690   Publisher: Institute of Electrical and Electronics Engineers
Keywords:
Computer science Reduced instruction set computing Latency (audio) Transformer Parallel computing Inference Embedded system Instruction set Electrical engineering Voltage Telecommunications Engineering

Metrics

2
Cited By
9.64
FWCI (Field Weighted Citation Impact)
22
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Seismic Imaging and Inversion Techniques
Physical Sciences →  Earth and Planetary Sciences →  Geophysics
Geophysical Methods and Applications
Physical Sciences →  Engineering →  Ocean Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.