v0.8
Docker Runtime & Voice Mode
January 10, 2025
Major architecture overhaul - fully Dockerized deployment with one-click setup. Local voice mode with Whisper speech-to-text and browser text-to-speech. SearXNG search engine for private, unlimited web search.
✨ New Features
- Docker runtime - fully containerized deployment, no more Python/conda setup required
- Local speech-to-text - Whisper models (base, small, large, turbo) run in Docker container
- Browser text-to-speech - native browser API for low-latency voice output
- SearXNG search engine - local, private, unlimited web search aggregating Google, Brave, DuckDuckGo
- Knowledge tool - unified search combining web results, memory, and future RAG sources
- File browser - integrated UI for browsing Docker container files
- Attachments support - upload and work with files in the UI
⚡ Improvements
- Revamped UI with settings panel and modern design
- Context window management system for efficient token usage
- Unified environment - agent knows Docker filesystem and capabilities
- Pre-installed software in Docker image - complex tools ready out-of-the-box
- Voice configuration - silence threshold, duration, language selection
- Privacy-focused - all speech processing local, no data sent to OpenAI
- Stop speech button - interrupt long agent responses anytime
- Docker Desktop support - easy one-click deployment with UI
- Port mapping and volume mounting for persistent storage