Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is why other "AI browsers" that parse and simplify the DOM, then invoke a tool-calling LLM over text are at EOL.

Once Chrome integrates Gemini Live amd treats your browser as a video input stream, it's pixels all the way. No lag, no incorrect clicks on hidden elements.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: