Bibliographic Details
Title: |
ED-sKWS: Early-Decision Spiking Neural Networks for Rapid,and Energy-Efficient Keyword Spotting |
Authors: |
Song, Zeyang, Liu, Qianhui, Yang, Qu, Peng, Yizhou, Li, Haizhou |
Publication Year: |
2024 |
Collection: |
Computer Science |
Subject Terms: |
Computer Science - Sound, Computer Science - Artificial Intelligence, Electrical Engineering and Systems Science - Audio and Speech Processing |
More Details: |
Keyword Spotting (KWS) is essential in edge computing requiring rapid and energy-efficient responses. Spiking Neural Networks (SNNs) are well-suited for KWS for their efficiency and temporal capacity for speech. To further reduce the latency and energy consumption, this study introduces ED-sKWS, an SNN-based KWS model with an early-decision mechanism that can stop speech processing and output the result before the end of speech utterance. Furthermore, we introduce a Cumulative Temporal (CT) loss that can enhance prediction accuracy at both the intermediate and final timesteps. To evaluate early-decision performance, we present the SC-100 dataset including 100 speech commands with beginning and end timestamp annotation. Experiments on the Google Speech Commands v2 and our SC-100 datasets show that ED-sKWS maintains competitive accuracy with 61% timesteps and 52% energy consumption compared to SNN models without early-decision mechanism, ensuring rapid response and energy efficiency. Comment: Accepted by INTERSPEECH2024 |
Document Type: |
Working Paper |
Access URL: |
http://arxiv.org/abs/2406.12726 |
Accession Number: |
edsarx.2406.12726 |
Database: |
arXiv |