Tiny LLM inference for ESP32 microcontrollers with INT8/INT4 quantization, multi-chip federation, RuVector semantic memory, and SNN-gated energy optimization
An adapter for various NLU web services like Aws Lex, Google Dialogflow etc.