v0.8.1
Browser Agent & UX
January 19, 2025
Browser agent powered by browser-use framework enables autonomous web browsing. Real-time screenshots, interactive element highlighting, and major file handling improvements.
✨ New Features
- Browser agent - autonomous web browsing using browser-use framework as subordinate agent
- Real-time screenshot updates - watch browser agent work with periodic progress reports
- Path recognition UI - clickable Linux paths for easy file downloads and navigation
- Integrated file browser - open folders from any path segment
⚡ Improvements
- Browser-use framework integration - 100% LangChain compatible, easy plug-and-play
- Optional vision mode - bounding boxes highlight interactive elements on web pages
- File attachment handling - upload, compress, merge PDFs, combine images
- Clickable file paths - download files directly from terminal output
- File conversion capabilities - convert, compress, and sort files without external tools
- Privacy-focused file processing - code-based operations keep data local