Speech Detection Using Multiple Acoustic Sensors
DRIVE
August 17, 2023
Aspects of the disclosure relate to voice activity detection (VAD) on wearable and other resource-constrained devices, to classify speech recorded by a microphone of the device as belonging to a wearer of the device versus another speech source. A computing device can include a microphone and an inertial measurement unit (IMU). The wearable device can use signals measured by the IMU for providing motion-tracking features, such as head tracking for augmented reality or virtual reality applications. Aspects of the disclosure provide for a device for leveraging existing data collected for these motion-tracking features for use in VAD. A device can pre-process data streamed from an IMU to use only signals predetermined to be indicative of whether or not a wearer of the device is speaking.
Discussion in the ATmosphere