v0.8.1

Browser Agent & UX

January 19, 2025

Browser agent powered by browser-use framework enables autonomous web browsing. Real-time screenshots, interactive element highlighting, and major file handling improvements.

✨ New Features

  • Browser agent - autonomous web browsing using browser-use framework as subordinate agent
  • Real-time screenshot updates - watch browser agent work with periodic progress reports
  • Path recognition UI - clickable Linux paths for easy file downloads and navigation
  • Integrated file browser - open folders from any path segment

⚡ Improvements

  • Browser-use framework integration - 100% LangChain compatible, easy plug-and-play
  • Optional vision mode - bounding boxes highlight interactive elements on web pages
  • File attachment handling - upload, compress, merge PDFs, combine images
  • Clickable file paths - download files directly from terminal output
  • File conversion capabilities - convert, compress, and sort files without external tools
  • Privacy-focused file processing - code-based operations keep data local