VoiceFilter-Lite is an on-device speech separation model that enhances speech recognition in noisy environments by filtering out non-target speech.
VoiceFilter-Lite is an advancement in on-device speech recognition technology, designed to improve the accuracy of voice assistive technologies, especially in scenarios with overlapping speech. It leverages a speaker's enrolled voice (via Google's Voice Match) to personalize interaction and isolate the target speaker's voice from background noise or other speakers. The system is optimized for mobile devices, addressing constraints like model size, CPU/memory limitations, battery usage, and latency. Unlike its predecessor, VoiceFilter-Lite processes log Mel-filterbanks directly, rather than audio waveforms, and enhances these features by filtering out components not belonging to the target speaker in real-time. It features a compact 2.2 MB model size after quantization with TensorFlow Lite, making it suitable for on-device applications even without an internet connection. The model also incorporates novel approaches, such as asymmetric loss during training and adaptive suppression strength, to mitigate over-suppression errors, which are particularly problematic for modern speech recognition models.
Integrations: TensorFlow Lite, Google Voice Match
Platforms: Android, iOS
View full VoiceFilter-Lite profile on Tools-Radar | Browse Voice & Audio tools | Alternatives to VoiceFilter-Lite
Tools-Radar is a free directory of 10,000+ AI tools — discover, compare, and choose the right AI software for your needs. Visit tools-radar.com