Coding With OpenAI-o1 in Cursor - Can We Replace Claude 3.5 Now?

Поділитися
Вставка
  • Опубліковано 19 гру 2024

КОМЕНТАРІ • 43

  • @eposnix5223
    @eposnix5223 9 годин тому +15

    Using o1 for setting up a project and Claude for doing the small edits is the best of both worlds. I love how cleanly formatted o1's code is!

    • @19LloydG
      @19LloydG 6 годин тому +2

      Indeed! The 01 model is definitely superior for coding, so far it has been flawless and surpassed the preview, as expected. The latency is a small price to pay. However, I am keen to see how Anthropic respond 🙂

    • @Kiki-qh7xk
      @Kiki-qh7xk 4 години тому

      @@19LloydG in my experience, claude is still better.

    • @suicidequad
      @suicidequad 2 години тому

      i still get better results with claude. however i agree on the clean formatting.
      a really underrated practice is to generate examples with whatever model is best at that and feed it as context to your workhorse model.
      having a dataset of good examples works wonders for any llm application.

  • @fabobg
    @fabobg 12 годин тому +27

    So, how much did that o1 usage cost you?

    • @multi_variate
      @multi_variate 11 годин тому +6

      Cursor o1 charges is 40 cents per request.

    • @0xunknown336
      @0xunknown336 11 годин тому +1

      that's the right question.

    • @ArcanoIncantatore
      @ArcanoIncantatore 9 годин тому

      You have 10 01-mini requests/day with Pro subscription, dunno about 01

    • @onlybryanliu
      @onlybryanliu 4 години тому

      @@ArcanoIncantatoreisn’t it unlimited?

    • @samuellembke4565
      @samuellembke4565 3 години тому

      @@multi_variateif true that is actually insanely expensive hahahaha

  • @earthAP
    @earthAP 12 годин тому +10

    what was the API cost for these chats, or its it included in Cursor subscription?

  • @davefellows
    @davefellows 50 хвилин тому

    Looks very impressive. Would be interesting to see a direct comparison with claude with the same prompts

  • @satrianaliqa
    @satrianaliqa 10 годин тому +2

    OHH WOW THE AUDIO TRACK TOO??? 🎉🎉😮

  • @StefanoKocka
    @StefanoKocka 7 годин тому +1

    10:45 You opened the HTML file directly from the drive instead accessing the node server. IMHO the Oneshot worked.

  • @fabiankliebhan
    @fabiankliebhan 2 години тому

    BTW you can also use gemini-2.0-flash-exp and gemini-2.0-flash-thinking-exp-1219 in Cursor (you need to add a Google API Key and add them manually to models)
    In my first tests these models perform very very good :)

  • @mostafaelbarrak6468
    @mostafaelbarrak6468 3 години тому

    I think you should compare it with Claude for the same tasks. Giving Claude the same tasks would allow you to compare both models in terms of response quality, speed, cost, and overall performance.
    However, I don't think the two tasks you provided are very representative, as the documentation appears to be well-structured and comprehensive. These tasks seem to mainly involve reading the documentation and combining the provided examples.
    It might also be valuable to test the same tasks on GPT-4 and even less advanced models, to better evaluate the actual level of difficulty.
    And what would be even more interesting is to assign O1 and Claude more complex real-world tasks, such as working within an existing codebase to add a new feature or solve a specific problem. Most models are quite good at generating standalone files like HTML, JS, or CSS, but they tend to struggle when dealing with an existing codebase.

  • @davefellows
    @davefellows 51 хвилина тому

    You should use Composer rather than Chat. There's also an agent option there which may've worked better for one-shot.
    Also, you can add docs in Cursor itself under Features in the settings.

    • @witchcraft8118
      @witchcraft8118 37 хвилин тому

      can only use gpt4 or sonnet 3.5 in Composer

    • @davefellows
      @davefellows 28 хвилин тому

      @ ah! Good to know, thanks. Wonder if that will change in the future.

  • @alx8439
    @alx8439 Годину тому

    Do you always have to scroll through the chat to find different code snippets and apply the individually one by one? Doesn't Cursor provide any better experience for dealing with this?

    • @ontheruntonowhere
      @ontheruntonowhere Годину тому

      Instead of Chat, he could have used Composer, which will edit multiple files per prompt. All you have to is approve or reject the changes. Sometimes Composer updates files when you're not expecting it, and I often find myself telling it 'don't change anything' when iterating a strategy for building a feature. I probably could get the same result by switching to Chat, but Composer feels smarter, dunno if that's true. Anyone have thoughts on that?

  • @fabiankliebhan
    @fabiankliebhan 10 годин тому

    You really should use the composer with the agent option enabled in Cursor.
    Much better for tasks like the one you did in the video.

  • @kritikusi-666
    @kritikusi-666 6 годин тому +2

    isn't o1 like super expensive?

  • @yurijmikhassiak7342
    @yurijmikhassiak7342 11 годин тому +1

    It's following cursor rules much better than any other model. But cost is high. It's easy 10$/ day

  • @FrozzenFreak
    @FrozzenFreak 8 годин тому +1

    Do you know that you can just paste the URL of the website in the cursor chat? No need to copy the entire content over.

  • @lilan1
    @lilan1 3 години тому

    thanks. was this costly?

  • @_MrGameplay_
    @_MrGameplay_ 7 годин тому

    Does anyone know whether o1 pro mode also will be available through the API and if so how much Cursor might charge for it?

  • @micbab-vg2mu
    @micbab-vg2mu 13 годин тому +3

    Kris I do not have access as well - Maybe because we are from EU.

    • @superlama6452
      @superlama6452 12 годин тому

      i have the o1 model available in the models options. i am from germany

    • @multi_variate
      @multi_variate 11 годин тому

      @@superlama6452 Woke regulators are going to keep you away from innovation. Bring the change.

    • @ontheruntonowhere
      @ontheruntonowhere Годину тому

      @@multi_variate Define woke?

  • @UselessDuckCompany
    @UselessDuckCompany 8 годин тому

    Is cursor much better than just VS code with roo-cline

    • @UselessDuckCompany
      @UselessDuckCompany 8 годин тому

      For example with roo-cline you can connect an Obsidian MCP so it can search and take in knowledge during its work

  • @CLSGL
    @CLSGL 3 години тому

    You know I really liked the idea of Claude, and I even coded with it for a time. But holy shit, their business model is annoying as hell within the actual Claude app. Even as a paying customer I had a limit per chat. Meaning I’d get really far into a project, get a cutoff mid-project, and have to start a new conversation with zero context.
    I use chats as projects. Funnel my entire limit into one chat for Pete’s sake.

  • @m4RIK
    @m4RIK 4 години тому

    thats impressive

  • @SimonNgai-d3u
    @SimonNgai-d3u 9 годин тому +1

    I think it would be better to use sonnet on a regular basis and solve the harder problems with o1 otherwise your waller is gonna be broken 😂😂

  • @marcusfdm6111
    @marcusfdm6111 11 годин тому

    Amazing

  • @Sofiane.ti2.0
    @Sofiane.ti2.0 3 години тому

  • @ThomasConover
    @ThomasConover 5 годин тому

    You’re all doing AI coding completely wrong! 😂😂😂😂

  • @hammeedabdo.82
    @hammeedabdo.82 12 годин тому

    Please, add arabic to audio track.

  • @applego6290
    @applego6290 8 годин тому

    All ai are useless just for boilerplate
    If one that train with an open source repo then u will get these type of ai
    dev u will get what I'm saying if ur not copy-paste

    • @okaydetar821
      @okaydetar821 3 години тому

      Just for boiler plate doesn't sound useless, boilerplates are very useful.