v0.8

Docker Runtime & Voice Mode

January 10, 2025

Major architecture overhaul - fully Dockerized deployment with one-click setup. Local voice mode with Whisper speech-to-text and browser text-to-speech. SearXNG search engine for private, unlimited web search.

✨ New Features

  • Docker runtime - fully containerized deployment, no more Python/conda setup required
  • Local speech-to-text - Whisper models (base, small, large, turbo) run in Docker container
  • Browser text-to-speech - native browser API for low-latency voice output
  • SearXNG search engine - local, private, unlimited web search aggregating Google, Brave, DuckDuckGo
  • Knowledge tool - unified search combining web results, memory, and future RAG sources
  • File browser - integrated UI for browsing Docker container files
  • Attachments support - upload and work with files in the UI

⚡ Improvements

  • Revamped UI with settings panel and modern design
  • Context window management system for efficient token usage
  • Unified environment - agent knows Docker filesystem and capabilities
  • Pre-installed software in Docker image - complex tools ready out-of-the-box
  • Voice configuration - silence threshold, duration, language selection
  • Privacy-focused - all speech processing local, no data sent to OpenAI
  • Stop speech button - interrupt long agent responses anytime
  • Docker Desktop support - easy one-click deployment with UI
  • Port mapping and volume mounting for persistent storage