JOURNAL ARTICLE

Fine-grained Multi-user Device-Free Gesture Tracking on Today’s Smart Speakers

Abstract

Smart speakers play an important role in smart home envision. Active acoustic sensing can enable convenient gesture interaction on smart speakers to complement voice interaction in mandatory quiet scenarios and address privacy concerns. However, existing solutions did not consider the impact of the widely adopted uniform circular geometry of commercial smart speakers on gesture tracking. To fill this gap, we propose SparseTrack to achieve fine-grained multi-user device-free gesture tracking on commercial smart speakers. We cast gesture tracking to sparse recovery intuition to address signal coherence issue on uniform circular mic-array. We then synthesize wideband measurement to eliminate spatial ambiguity caused by the insufficient spatial sampling rate of today's smart speakers in the ultrasonic frequency band. We further design a robust trace extraction approach and properly handle the impact of the doppler effect on gesture tracking. We implement SparseTrack on COTS circular mic-array and conduct extensive evaluations. The results show that our system can simultaneously track up to 4 users' gestures with a mean tracking error of 2.66 cm.

Keywords:
Gesture Computer science Gesture recognition Ambiguity Tracking (education) Computer vision Speech recognition Artificial intelligence

Metrics

4
Cited By
0.58
FWCI (Field Weighted Citation Impact)
32
Refs
0.68
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Speech and Audio Processing
Physical Sciences →  Computer Science →  Signal Processing
Indoor and Outdoor Localization Technologies
Physical Sciences →  Engineering →  Electrical and Electronic Engineering
Advanced Adaptive Filtering Techniques
Physical Sciences →  Engineering →  Computational Mechanics

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.