RADE: A Neural Codec for Transmitting Speech over HF Radio Channels
By: David Rowe, Jean-Marc Valin
Potential Business Impact:
Makes radio voices clearer, even with bad signals.
Speech compression is commonly used to send voice over radio channels in applications such as mobile telephony and two-way push-to-talk (PTT) radio. In classical systems, the speech codec is combined with forward error correction, modulation and radio hardware. In this paper we describe an autoencoder that replaces many of the traditional signal processing elements with a neural network. The encoder takes a vocoder feature set (short term spectrum, pitch, voicing), and produces discrete time, but continuously valued quadrature amplitude modulation (QAM) symbols. We use orthogonal frequency domain multiplexing (OFDM) to send and receive these symbols over high frequency (HF) radio channels. The decoder converts received QAM symbols to vocoder features suitable for synthesis. The autoencoder has been trained to be robust to additive Gaussian noise and multipath channel impairments while simultaneously maintaining a Peak To Average Power Ratio (PAPR) of less than 1 dB. Over simulated and real world HF radio channels we have achieved output speech intelligibility that clearly surpasses existing analog and digital radio systems over a range of SNRs.
Similar Papers
RADE for Land Mobile Radio: A Neural Codec for Transmission of Speech over Baseband FM Radio Channels
Audio and Speech Processing
Makes old radios send clearer voices.
Neural Network-based Information-Theoretic Transceivers for High-Order Modulation Schemes
Signal Processing
Makes computers send and get messages better.
High Performance Signal Design for Optical OFDM Systems using Variational Autoencoder
Systems and Control
Makes internet signals send more data, faster, and clearer.