MULTI-MODAL COGNITIVE MECHANISM FOR ROAD SECTION RECOGNITION
DRIVE
June 1, 2023
In an approach for road section recognition using multi-modal cognitive mechanism, a processor receives an audio signal from a road test. A processor processes the audio signal to generate an acoustic spectrum density distribution map to identify a respective at least one road section switching point in a first mode. A processor processes a spectrogram of the audio signal to identify the respective at least one road section switching point in a second mode. A processor uses a machine learning model to predict an expected sound at each frame of the audio signal, to calculate a similarity between the expected sound and an actual sound, and to identify the respective at least one road switching point when the similarity is lower than a pre-set similarity threshold in a third mode. A processor combines results of the three modes to obtain a final set of road section switching points.
Discussion in the ATmosphere