Skip to content
forked from nat/natbot

Globot is an agent that controls your browser using playwright and GPT-4V.

License

Notifications You must be signed in to change notification settings

Globe-Engineer/globot

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

30 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

globot

With GPT-4V, we can finally complete the original vision of natbot.

Help solve general agents by contributing to this repo!

Ideas for Improvement

  • Scrolling (easy to add, but likely to cause divergence)
  • Better context management (learning from mistakes, more descriptive history)
  • Masking the image with node IDs
  • Better DOM parsing (please submit issues/PRs!)
  • More explicit planning
  • Data collection and fine-tuning

NOTE: Remember to use the latest release of the openai API for the vision model:

pip install --upgrade openai

made by Ivan Yevenko

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%