Button Press, ESP32 starts recording via the INMP441 microphone. I2S Audio Capture, 16 kHz samples are streamed in real-time over WebSocket. AI Processing (Server) Whisper converts audio → text Gemini ...