|
arduino-audio-tools
|
Template-based wake word detector for microcontrollers using dominant frequency patterns. More...
#include <WakeWordDetector.h>
Classes | |
| struct | Template |
Public Types | |
| using | WakeWordCallback = void(*)(const char *name) |
Public Member Functions | |
| WakeWordDetector (AudioFFTBase &fft) | |
| virtual void | addNotifyAudioChange (AudioInfoSupport &bi) |
| Adds target to be notified about audio changes. | |
| void | addTemplate (const Vector< FrequencyFrame< N > > &frames, float threshold_percent, const char *name) |
| virtual AudioInfo | audioInfo () override |
| provides the actual input AudioInfo | |
| virtual AudioInfo | audioInfoOut () |
| virtual int | availableForWrite () override |
| virtual bool | begin () |
| virtual bool | begin (AudioInfo info) |
| virtual void | clearNotifyAudioChange () |
| Deletes all change notify subscriptions. | |
| virtual void | end () |
| virtual void | flush () |
| virtual bool | isDeletable () |
| If true we need to release the related memory in the destructor. | |
| bool | isNotifyActive () |
| Checks if the automatic AudioInfo update is active. | |
| bool | isRecording () const |
| virtual | operator bool () |
| virtual bool | removeNotifyAudioChange (AudioInfoSupport &bi) |
| Removes a target in order not to be notified about audio changes. | |
| virtual void | setAudioInfo (AudioInfo newInfo) override |
| Defines the input AudioInfo. | |
| void | setNotifyActive (bool flag) |
| Deactivate/Reactivate automatic AudioInfo updates: (default is active) | |
| void | setWakeWordCallback (WakeWordCallback cb) |
| void | startRecording () |
| Vector< FrequencyFrame< N > > | stopRecording () |
| size_t | write (const uint8_t *buf, size_t size) override |
| virtual size_t | write (uint8_t ch) override |
| virtual void | writeSilence (size_t len) |
Static Public Member Functions | |
| static void | fftResult (AudioFFTBase &fft) |
Protected Member Functions | |
| float | matchTemplate (Template &tmpl) |
| void | notifyAudioChange (AudioInfo info) |
Protected Attributes | |
| Vector< T > | _buffer |
| Buffer for incoming PCM samples. | |
| WakeWordCallback | _callback = nullptr |
| size_t | _frame_pos |
| Current position in frame buffer. | |
| bool | _is_recording = false |
| True if currently recording a template. | |
| size_t | _max_template_len = 0 |
| Length of the longest template. | |
| Vector< FrequencyFrame< N > > | _recent_frames |
| Recent frames for comparison. | |
| Vector< Template > | _templates |
| List of wake word templates. | |
| int | _timeout = 10 |
| AudioInfo | cfg |
| bool | is_active = false |
| bool | is_notify_active = true |
| Vector< AudioInfoSupport * > | notify_vector |
| AudioFFTBase * | p_fft = nullptr |
| SingleBuffer< uint8_t > | tmp {MAX_SINGLE_CHARS} |
| int | tmpPos = 0 |
Template-based wake word detector for microcontrollers using dominant frequency patterns.
This class detects wake words by comparing the sequence of the top N dominant frequencies in each audio frame to stored templates for each wake word. When the percentage of matching frames exceeds a configurable threshold, the corresponding wake word is considered detected.
| N | Number of dominant frequencies to track per frame (default: 3) |
Usage:
Example:
|
inlinevirtualinherited |
Adds target to be notified about audio changes.
Reimplemented in CodecNOP, EncodedAudioOutput, EncodedAudioStream, AACDecoderFDK, DecoderBasic, CodecChain, MP3DecoderHelix, MP3DecoderMAD, OggContainerDecoder, RTSPClient< TcpClient, UdpSocket >, Pipeline, and Pipeline::ModifyingStreamAdapter.
|
inlineoverridevirtualinherited |
provides the actual input AudioInfo
Implements AudioInfoSupport.
Reimplemented in AdapterPrintToAudioOutput, EncodedAudioOutput, and AdapterAudioStreamToAudioOutput.
|
inlinevirtualinherited |
provides the actual output AudioInfo: this is usually the same as audioInfo() unless we use a transforming stream
Reimplemented in PureDataStream, PWMAudioOutput, ChannelFormatConverterStreamT< T >, ChannelFormatConverterStream, NumberFormatConverterStreamT< TFrom, TTo >, NumberFormatConverterStream, FormatConverterStream, Pipeline, ResampleStream, and ResampleStreamT< TInterpolator >.
|
inlineoverridevirtualinherited |
Reimplemented from Print.
Reimplemented in RTSPOutput< Platform >.
|
inlinevirtualinherited |
|
inlinevirtualinherited |
Reimplemented in AudioESP32ULP, RTSPOutput< Platform >, CsvOutput< T >, and EncodedAudioOutput.
|
inlinevirtualinherited |
Deletes all change notify subscriptions.
Reimplemented in RTSPClient< TcpClient, UdpSocket >.
|
inlinevirtualinherited |
Reimplemented in RTSPOutput< Platform >, PitchShiftOutput< T, BufferT >, EncodedAudioOutput, and OggContainerOutput.
|
inlinevirtualinherited |
Reimplemented from Print.
|
inlinevirtualinherited |
If true we need to release the related memory in the destructor.
Reimplemented in AdapterPrintToAudioOutput, and AdapterAudioStreamToAudioOutput.
|
inlinevirtualinherited |
Reimplemented in RTSPOutput< Platform >, and EncodedAudioOutput.
|
inlinevirtualinherited |
Removes a target in order not to be notified about audio changes.
Reimplemented in RTSPClient< TcpClient, UdpSocket >.
|
inlineoverridevirtualinherited |
Defines the input AudioInfo.
Implements AudioInfoSupport.
Reimplemented in ChannelsSelectOutput, AdapterPrintToAudioOutput, MultiOutput, AdapterAudioStreamToAudioOutput, CsvOutput< T >, PWMAudioOutput, and EncodedAudioOutput.
|
inlineoverridevirtual |
Implements AudioOutput.
|
inlinevirtualinherited |
Writes n 0 values (= silence)
| len |