Bidformer: A Transformer-Based Model via Bidirectional Sparse Self-Attention Mechanism for Long Sequence Time-Series Forecasting

Wei Li; Xiangxu Meng; Chuhan Chen; Hailin Mi; Huiqiang Wang

doi:10.1109/smc53992.2023.10394310

ScienceGate Book Chapters

JOURNAL ARTICLE

Bidformer: A Transformer-Based Model via Bidirectional Sparse Self-Attention Mechanism for Long Sequence Time-Series Forecasting

Wei Li Xiangxu Meng Chuhan Chen Hailin Mi Huiqiang Wang

Year: 2023 Pages: 4076-4082

DOI: 10.1109/smc53992.2023.10394310

Get Full-Text PDF Get Analytical Report

Abstract

Long Sequence Time-Series Forecasting (LSTF) is an important and challenging research with broad applications. Recent studies have shown that Transformer-based models can be effective in solving correlation problems in time-series data, but they also introduce issues of quadratic time and memory complexity, which make them unsuitable for LSTF problems. As a response, we investigate the impact of the long-tail distribution of attention scores on prediction accuracy and propose a Bis-Attention mechanism based on the mean measurement to bi-directionally sparse the self-attention matrix as a way to enhance the differentiation of attention scores and to reduce the complexity of the Transformer-based models from $O(L^{2})$ to $O((logL)^{2})$ . Moreover, we reduce memory consumption and optimize the model architecture through the use of a shared-QK method. The effectiveness of the proposed method is verified by theoretical analysis and visualisation. Extensive experiments on three benchmarks demonstrate that our method achieves better performance compared to other state-of-the-art methods, including an average reduction of 19.2% in MSE and 12% in MAE compared to Informer.

Keywords:

Transformer Computer science Quadratic equation Time series Visualization Artificial intelligence Sequence (biology) Algorithm Data mining Machine learning Mathematics Engineering

Metrics

Cited By

0.27

FWCI (Field Weighted Citation Impact)

Refs

0.51

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Time Series Analysis and Forecasting

Physical Sciences → Computer Science → Signal Processing

Stock Market Forecasting Methods

Social Sciences → Decision Sciences → Management Science and Operations Research

Forecasting Techniques and Applications

Social Sciences → Decision Sciences → Management Science and Operations Research

Bidformer: A Transformer-Based Model via Bidirectional Sparse Self-Attention Mechanism for Long Sequence Time-Series Forecasting

Abstract

Metrics

Citation History

Topics

Related Documents

Transformer with Sparse Attention Mechanism for Industrial Time Series Forecasting

Rethink the Top-u Attention in Sparse Self-attention for Long Sequence Time-Series Forecasting

A Transformer Self-attention Model for Time Series Forecasting

BSAformer: bidirectional sequence splitting aggregation attention mechanism for long term series forecasting

TFformer: A time–frequency domain bidirectional sequence-level attention based transformer for interpretable long-term sequence forecasting