v0.8

Docker Runtime & Voice Mode

January 10, 2025

Major architecture overhaul - fully Dockerized deployment with one-click setup. Local voice mode with Whisper speech-to-text and browser text-to-speech. SearXNG search engine for private, unlimited web search.

✨ New Features

Docker runtime - fully containerized deployment, no more Python/conda setup required
Local speech-to-text - Whisper models (base, small, large, turbo) run in Docker container
Browser text-to-speech - native browser API for low-latency voice output
SearXNG search engine - local, private, unlimited web search aggregating Google, Brave, DuckDuckGo
Knowledge tool - unified search combining web results, memory, and future RAG sources
File browser - integrated UI for browsing Docker container files
Attachments support - upload and work with files in the UI

⚡ Improvements

Revamped UI with settings panel and modern design
Context window management system for efficient token usage
Unified environment - agent knows Docker filesystem and capabilities
Pre-installed software in Docker image - complex tools ready out-of-the-box
Voice configuration - silence threshold, duration, language selection
Privacy-focused - all speech processing local, no data sent to OpenAI
Stop speech button - interrupt long agent responses anytime
Docker Desktop support - easy one-click deployment with UI
Port mapping and volume mounting for persistent storage

Use Cases arrow_forward

Token arrow_forward

Architecture arrow_forward

Docs arrow_forward

Platform arrow_forward

Docker Runtime & Voice Mode

✨ New Features

⚡ Improvements

Use Cases arrow_forward expand_more

Token arrow_forward

Architecture arrow_forward

Docs arrow_forward expand_more

Platform arrow_forward expand_more

About

My

Proposals

Signatures

Docker Runtime & Voice Mode

✨ New Features

⚡ Improvements

Use Cases arrow_forward

Docs arrow_forward

Platform arrow_forward