|
arduino-audio-tools
|
Template-based wake word detector for microcontrollers using dominant frequency patterns. More...
#include <WakeWordDetector.h>
Classes | |
| struct | Template |
Public Types | |
| using | WakeWordCallback = void(*)(const char *name) |
Static Public Member Functions | |
| static void | fftResult (AudioFFTBase &fft) |
Protected Member Functions | |
| float | matchTemplate (Template &tmpl) |
| void | notifyAudioChange (AudioInfo info) |
Protected Attributes | |
| Vector< T > | _buffer |
| Buffer for incoming PCM samples. | |
| WakeWordCallback | _callback = nullptr |
| size_t | _frame_pos |
| Current position in frame buffer. | |
| bool | _is_recording = false |
| True if currently recording a template. | |
| size_t | _max_template_len = 0 |
| Length of the longest template. | |
| Vector< FrequencyFrame< N > > | _recent_frames |
| Recent frames for comparison. | |
| Vector< Template > | _templates |
| List of wake word templates. | |
| int | _timeout = 10 |
| AudioInfo | cfg |
| bool | is_active = false |
| bool | is_notify_active = true |
| Vector< AudioInfoSupport * > | notify_vector |
| AudioFFTBase * | p_fft = nullptr |
| SingleBuffer< uint8_t > | tmp {MAX_SINGLE_CHARS} |
| int | tmpPos = 0 |
Template-based wake word detector for microcontrollers using dominant frequency patterns.
This class detects wake words by comparing the sequence of the top N dominant frequencies in each audio frame to stored templates for each wake word. When the percentage of matching frames exceeds a configurable threshold, the corresponding wake word is considered detected.
| N | Number of dominant frequencies to track per frame (default: 3) |
Usage:
Example:
|
inline |
|
inlinevirtualinherited |
Adds target to be notified about audio changes.
Reimplemented in CodecNOP, EncodedAudioOutput, EncodedAudioStream, AACDecoderFDK, DecoderBasic, CodecChain, MP3DecoderHelix, MP3DecoderMAD, OggContainerDecoder, RTSPClient< TcpClient, UdpSocket >, Pipeline, and Pipeline::ModifyingStreamAdapter.
|
inline |
provides the actual input AudioInfo
Implements AudioInfoSupport.
Reimplemented in AdapterPrintToAudioOutput, EncodedAudioOutput, and AdapterAudioStreamToAudioOutput.
provides the actual output AudioInfo: this is usually the same as audioInfo() unless we use a transforming stream
Reimplemented in PureDataStream, PWMAudioOutput, ChannelFormatConverterStreamT< T >, ChannelFormatConverterStream, NumberFormatConverterStreamT< TFrom, TTo >, NumberFormatConverterStream, FormatConverterStream, Pipeline, ResampleStream, and ResampleStreamT< TInterpolator >.
Reimplemented from Print.
Reimplemented in AudioESP32ULP, RTSPOutput< Platform >, EncodedAudioOutput, AudioServerEx, AudioSyncWriter, AdapterAudioStreamToAudioOutput, CsvOutput< T >, MemoryOutput, and PWMAudioOutput.
Reimplemented in OSCContainerEncoder::OSCOutput, AudioOutputWithCallback, AudioServerEx, RTSPOutput< Platform >, PitchShiftOutput< T, BufferT >, PWMAudioOutput, RTTTLOutput< T >, EncodedAudioOutput, OggContainerOutput, FFTEffect, FFTPitchShift, R2ROutput, AdapterAudioStreamToAudioOutput, ChannelsSelectOutput, MetaDataFilter, CsvOutput< T >, HexDumpOutput, MemoryOutput, and RTTTLOutput< T >.
Reimplemented in AudioESP32ULP, RTSPOutput< Platform >, RTTTLOutput< T >, ChannelsSelectOutput, CsvOutput< T >, and EncodedAudioOutput.
Deletes all change notify subscriptions.
Reimplemented in RTSPClient< TcpClient, UdpSocket >.
|
inlinestatic |
Reimplemented from Print.
Reimplemented in MultiOutput, and HexDumpOutput.
If true we need to release the related memory in the destructor.
Reimplemented in AdapterPrintToAudioOutput, and AdapterAudioStreamToAudioOutput.
Reimplemented in RTSPOutput< Platform >, EncodedAudioOutput, and AdapterAudioStreamToAudioOutput.
|
inlinevirtualinherited |
Removes a target in order not to be notified about audio changes.
Reimplemented in RTSPClient< TcpClient, UdpSocket >.
Defines the input AudioInfo.
Implements AudioInfoSupport.
Reimplemented in ChannelsSelectOutput, AdapterPrintToAudioOutput, MultiOutput, AdapterAudioStreamToAudioOutput, CsvOutput< T >, PWMAudioOutput, and EncodedAudioOutput.
Deactivate/Reactivate automatic AudioInfo updates: (default is active)
|
inline |
|
inline |
Implements AudioOutput.
Reimplemented in MetaDataOutput, and MultiOutput.
Writes n 0 values (= silence)
| len |
Buffer for incoming PCM samples.
|
protected |
Current position in frame buffer.
True if currently recording a template.
Length of the longest template.
|
protected |
Recent frames for comparison.
List of wake word templates.
|
protectedinherited |
|
protectedinherited |
|
protectedinherited |
|
protected |
|
protectedinherited |
|
protectedinherited |