An honest review of Devin AI

zack

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 16 гру 2024

КОМЕНТАРІ • 60

@danecjensen 6 місяців тому ⁺²
Chapters (Powered by ChapterMe) -
00:00 - Devon AI agent that claims to be worlds first fully autonomous
02:38 - Devons app turns world into museum
03:30 - Devon App builder with questions, planning, updates
05:20 - Android app asks users for help
06:08 - Deployment time 134
08:29 - Twohour CSS change to add features
08:51 - Devon walkthrough reveals power of commands
09:25 - Devon Excellent prototyping agent, impeccable UX
12:49 - AI software engineer Devons slow performance
14:59 - Devons powerful features, sign up now
15:22 - Lifted access for players
@indigo1417 4 місяці тому ⁺⁷
I just received my invite to Devin. The cheapest plan offered "Personal (Devin Lite) users receive early access to Devin Lite for $50 / month, which includes 65 Devin Lite ACU / month built in. Additional Devin Lite ACUs can be purchased at our standard unit rate of $0.8 / ACU. Currently, each ACU is approximately equivalent to 10 minutes of active Devin Lite work." I'm having difficulty finding more information, but it seems to me, for $50 I get 650 minutes of computing. Looking at the lengths of time reported by Zack this seems like a very poor offer.
@yoshua9676 11 днів тому ⁺²
This sounds like a scam. You can hire engineers in Africa and India for $3 an hour. The founder Scott's previous app Lunchclub pretended it used AI matchmaking when in fact it didn't. I know because I'm friends with a lot of his friends.
@o__sama 6 місяців тому ⁺⁴²
Are you sure it is not like the Amazon AI, a bunch of real people behind the scenes hahaha, it seems too slow for an AI
@noway8233 6 місяців тому ⁺²
Good point , Amazong ciborgs 😅😅
@GrowAndScaleSOLUTION 5 місяців тому ⁺²
if it's a fraud, no venture capitalist would invest in it. Thus it's just a matter of time till it crash. And if it's the case, in the future, other people would try to get this idea off the ground because it's not impossible to do. Coding is less complex than human emotions so this is not impossible to do
@vishnu2407 4 місяці тому
@@GrowAndScaleSOLUTIONhave you not seen the Elizabeth Holmes case? VCs invest in fraud and scams all the time
@rjackstheartofwealth6152 4 місяці тому
@@GrowAndScaleSOLUTION human emotions are not complex, they are insanely easy to manipulate
@GrowAndScaleSOLUTION 4 місяці тому
@@rjackstheartofwealth6152 not all people are easy to manipulate. Try doing sales and you will understand. Try both inbound and outbound sales
@ryanlisse 6 місяців тому ⁺²
Awesome to see you're back, love the overview
@arvinddhindsa2547 5 місяців тому ⁺²
I would love to see a video of Devin working in an existing project.
@EddyLeeKhane 4 дні тому ⁺¹
hasnt aged well?
so far nobody could really show it doing anything really
@youssefwalid2655 6 місяців тому ⁺⁵
I'd love to see it contribute to an existing code base maybe try it on projects of different sizes/complexities
@oo--7714 5 місяців тому ⁺²
Thank you, hojestly wild how this is the only review on devin, the rest are just predicitions lol.
@nustaniel 3 місяці тому ⁺⁵
Did you review the code to look for flaws in its functions logic? Did you extensively test the results it produced to make sure it was not bugged? I have yet to see a LLM produce code that isn't flawed. Either by being it being stupidly overengineered, really poorly structured in terms of performance hits (like for loops meant for huge lists with DOM calls within things like dragover, not using any cached data), or simply not covering all use cases. I also see LLMs typically get stuck on a "solution" they think is correct, even if you tell them to start from scratch, and your only option is to initiate a new instance of chatting with them to make it change their approach. LLMs also seem to lack the capabilities to grap simple logic that is observable to us humans, like try to ask it to figure out the next number in a sequence of numbers and sometimes it will get it right, but once you start getting more complicated, it will go completely off track and not understand the basic observable logic. This is obviously why ChatGPT and so on produce such flawed code usually. How did Devin perform in terms of actual good code? With good I mean stable, bug free and performance-focused. I couldn't care less about readable code if AI is writing and able to parse it. Speaking of, can it parse complex code and understand how it works? LLMs usually in my experience can manage to grasp the overall use for the code it is asked to analyze, but won't understand (again) some of the logical reasons for certain parts of the code. Unless Devin can produce good code, I see it as no better than any other LLM option.
@TenseiCho 6 місяців тому ⁺¹
fingers crossed that I get in and try it out.
@lilian-u5d 4 місяці тому
btw,i am still have no chance to use it,could you tell me how do you get this access to use?
@Danefrak 6 місяців тому ⁺¹
Great overview. I really appreciated this video.
@Qi5_ 5 місяців тому
How install devon on win
@sauravbhagat4150 5 місяців тому
Can it Code games ?
@akaalkripal5724 4 місяці тому
Have you tried Devika?
@raunakchhatwal5350 6 місяців тому ⁺¹
Did this use gpt-4o?
@kubakakauko 6 місяців тому ⁺²³
Devin uses Indian developers in the backend to confirm the output. This is why it took like 3hours
@wenquai 6 місяців тому
not sure what model they're using! they dont seem to disclose it anywhere
@raunakchhatwal5350 6 місяців тому ⁺²
@@wenquai Can you ask Devin? The API Gpt-4o answers that it is gpt-4-turbo while the previous versions don't know their model version.
@slavaprotv 6 місяців тому ⁺¹
I'm the only one who doesn't have access?
@wenquai 5 місяців тому
it looks like they're slowly starting to let more people in via the waitlist
@МаксБоровой-ф6о 6 місяців тому
Hi, Zack! How quickly did you get off the waiting list?
@wenquai 5 місяців тому
about 4 months
@wonderfulworldofmarkets9033 6 місяців тому ⁺⁴
So maybe it felt different using it, but it looked pretty horrible. I think most devs with Co-Pilot could do this infinitely faster. Not to mention remember the solution and re-impelement similar projects very easy in the future.
The entire promise of Devin was a done for you software engineer. This looked horribly ineffficient. And if it needs this much input from a dev... Then why can't the dev just use co-pilot to implement it himself?
@obsoleteProblems 6 місяців тому ⁺¹
Sure, but non technical managers and execs would view this as a one less team to pay salaries and benefits on. Sure, only awful managers and execs would think this way, but let's be real, that's the majority of managers and execs.
@spaceowl5957 6 місяців тому
The big advantage is it mostly works by itself and you don't have to pay it a salary.
Alsooo, real devs also can take a long time to do seemingly simple things, especially when it involves weird bugs, strange css, or they have to figure out the design mostly by themselves (which was the case in this video from what I could tell)
@wonderfulworldofmarkets9033 6 місяців тому
@@spaceowl5957 lmao you do have to pay it a salary. Running models aren't free bro. At least as of now, its causing a lot more issues than its solving.
@spaceowl5957 6 місяців тому ⁺³
@@wonderfulworldofmarkets9033 I mean I don't know the numbers but I think this will be magnitudes cheaper per hour of work compared to a human
@wonderfulworldofmarkets9033 5 місяців тому
@@spaceowl5957Maybe you should learn the numbers. To run this bad model 24/7 for a year costs ~ $2,628,000. To execute a LangChain prompt (chains together multiple models and double checks work similar to what Devin is doing) a prompt of "What is the 23rd episode of Spongebob" just cost $4.50 on AWS Bedrock and took 1 minute and 20 seconds. Imagine how much money its costing to run this thing for 8 hours per task on a lot more complicated data. (And then get it wrong lol).
@s8x. 6 місяців тому ⁺³
Time to flip burgers now
@vectorhacker-r2 5 місяців тому ⁺¹
Nope.
@cubed.public 5 місяців тому ⁺²
There were 2 things I wanted to see about Devin:
How smart - which I guess is not amazing? I'm not sure how buggy the final product is, but it seemed to me like it ran into issues and required intervention.
In the end, the main thing I guess is that it can do stuff, but if it's not as smart as Claude or GPT, then I might as well just copy and paste from the smarter llm instead of waiting for a dumber one automatically do it for me
Basically, if you need to solve a coding problem, it does not seem like Devin is the way to do it
How big is the context window - not sure from this I guess. Current problem with llm is that it's hard to have an entire project as context, so you have to find where to fix/add something and give them the info. I doubt Devin solved this, so I kind of want to see it given/generate a big project (at least bigger than the regular llm context windows) and told to fix something and see how it handles that.
If those 2 things fail, then Devin is more or less a convenience thing - an AI that automatically runs what it generates, reads the error, and reprompts itself. I mean, these things already existed with AutoGPT and other stuff for a while now, so I'm not too invested, especially considering I can just run the llm generated code myself and give them the errors.
So basically, it seems to boil down to convenience. As you said, it could take hours and give out a terrible buggy mess, but at least you didn't spend time on it. But if you truly want an actual product, it seems using smarter llms is still the way to go.
@wenquai 5 місяців тому ⁺²
really great points. i agree - if you're stuck on a specific coding problem, it's probably a waste of time to use Devin versus trying to debug it yourself with Claude/ChatGPT/Cursor. after having access to Devin for a few weeks, I actually found myself using it less and less, which reinforces your point about convenience. it was also just hard to make a habit of opening Devin anytime I wanted to work on my projects.
@umairaliism Місяць тому ⁺¹
@@wenquai then why did you say it is legit? I think you should take down this video or make another video about this. you shouldn't be misguiding people like this.
@rediffusion7996 9 днів тому ⁺¹
@@umairaliism Indeed! Unsubscribed!
@sepiaflux123 6 місяців тому ⁺³
great video, the fan noise is slightly annoying though :-)
@wenquai 6 місяців тому ⁺¹
ty for the kind words! sorry about the fan - will fix in future vids!
@Coder.tahsin 6 місяців тому ⁺¹
Why everyone who got access to Devin never shows real time interaction with Devin? Probably because it will revile how capable this ChatGPT wrapper is........
@wenquai 5 місяців тому ⁺¹
will make a follow up vid that goes through a full run!
@fintech1378 2 місяці тому
Oh no somebody is so insecure and being self denial here
@axelvirtus2514 6 місяців тому ⁺¹
So if i wanna be developer I'm fucked,ai can do all for me
@wenquai 6 місяців тому ⁺²
Actually i think the opposite! I think developers/aspiring developers can benefit from Devin the most. Plus, observing Devin as it works is a great way to learn a new programming language or framework
@axelvirtus2514 6 місяців тому ⁺¹
@@wenquai maby it's a plus for experienced developers,no one want juniors now
@kubakakauko 6 місяців тому ⁺¹
@@axelvirtus2514 no one needs juniors, before and now with AI, companies employ juniors in hopes that they educate them and they stay in the company for a long time. So nothing changes.
@Dom-zy1qy 6 місяців тому ⁺³
@kubakakauko Tool make developer faster -> More work done with same amount of developer -> Fire excess developer to save money -> Less job for developer
@fintech1378 2 місяці тому
Its ok bro just do it for free if its your passion
@SiyaMoni-q7v 3 місяці тому
Davis Michael Martinez Margaret Thomas Barbara
@s8x. 6 місяців тому
I’m cooked
@vectorhacker-r2 5 місяців тому ⁺²
No, you're not.

Наступне

Автоматичне відтворення

Devin review: is it a better AI coding agent than Cursor?