JOURNAL ARTICLE

A Transformer-based Neural Architecture Search Method

Abstract

This paper presents a neural architecture search method based on Transformer architecture, searching cross multihead attention computation ways for different number of encoder and decoder combinations. In order to search for neural network structures with better translation results, we considered perplexity as an auxiliary evaluation metric for the algorithm in addition to BLEU scores and iteratively improved each individual neural network within the population by a multi-objective genetic algorithm. Experimental results show that the neural network structures searched by the algorithm outperform all the baseline models, and that the introduction of the auxiliary evaluation metric can find better models than considering only the BLEU score as an evaluation metric.

Keywords:
Perplexity Computer science Transformer Metric (unit) Artificial intelligence Artificial neural network Encoder Genetic algorithm Computation Population Architecture Beam search Machine translation Search algorithm Machine learning Algorithm Language model Engineering

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
5
Refs
0.08
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Neural Networks and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Multi-Objective Optimization Algorithms
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Fault Detection and Control Systems
Physical Sciences →  Engineering →  Control and Systems Engineering

Related Documents

JOURNAL ARTICLE

EEG-based Emotion Recognition via Transformer Neural Architecture Search

Chang LiZhongzhen ZhangXiaodong ZhangGuoning HuangYü LiuXun Chen

Journal:   IEEE Transactions on Industrial Informatics Year: 2022 Vol: 19 (4)Pages: 6016-6025
BOOK-CHAPTER

NASformer: Neural Architecture Search for Vision Transformer

Bolin NiGaofeng MengShiming XiangChunhong Pan

Lecture notes in computer science Year: 2022 Pages: 47-61
JOURNAL ARTICLE

Semantic Segmentation Method Based on Neural Architecture Search

烜 朱

Journal:   Advances in Applied Mathematics Year: 2023 Vol: 12 (08)Pages: 3587-3597
© 2026 ScienceGate Book Chapters — All rights reserved.