Good idea! But you didn't talk about what actually happened. How many iterations? How long did it take? What are the exit criteria? Would have liked to see the final output. Also, what if the Web page is longer than the screen? Will a screenshot capture a long page?
Thank you for sharing this experiment! I have an idea about using claude models with swarm. OperRouter service wraps different providers (including anthropic) and gives us API casted to the OpenAI API (with custom endpoint). I hope, it works with images too.
I checked the documentation for model claude-3.5-sonnet:beta on OpenRouter (sorry, I can't give the link, because youtube hides comments with links) and find in the API tab example with image. So, it should work.
Nice content , I want to ask what is happening to coding now since current ai tools able to code by their own, so as an software developer we have to provide pseudo code to these agents or something else ?
Good idea! But you didn't talk about what actually happened. How many iterations? How long did it take? What are the exit criteria? Would have liked to see the final output. Also, what if the Web page is longer than the screen? Will a screenshot capture a long page?
Thank you for sharing this experiment! I have an idea about using claude models with swarm. OperRouter service wraps different providers (including anthropic) and gives us API casted to the OpenAI API (with custom endpoint). I hope, it works with images too.
I checked the documentation for model claude-3.5-sonnet:beta on OpenRouter (sorry, I can't give the link, because youtube hides comments with links) and find in the API tab example with image. So, it should work.
Oh, thats very nice, thanks a lot for the input, I definitely gonna try that 👍🙏🏻
Nice content , I want to ask what is happening to coding now since current ai tools able to code by their own, so as an software developer we have to provide pseudo code to these agents or something else ?