Deepseek AI's R1 vs.

Поділитися
Вставка
  • Опубліковано 9 лют 2025
  • How does R1 stack up against O1? Not just in terms of evals, but in everyday usage?
    With all the hype around R1, I couldn’t resist testing it out. And to make things fair, I brought in Anthropic's Claude to act as a judge for the outputs (yes, LLM as a judge-how meta is that?).
    💡 Check out the video recording where I gave both R1 and O1 the same prompt and had Claude evaluate the results.
    Initial Impressions:
    ✅ R1’s unique feature of showcasing its thought process is super interesting and adds a new layer of transparency to how it works.
    ✅ O1 delivers slightly better outputs in terms of detail and clarity, but…
    ✅ R1 at its price point is an incredible value-no complaints here. (For Consumers its free to use @ chat.deepseek....)
    The Prompt I Used:
    "You are tasked to enter the Indian OTT market. How would you go about it? What aspects would you consider? Come up with a detailed strategy and steps for execution."
    Both LLMs approached the task differently, which made the comparison even more exciting.
    ✨ What are your thoughts? Have you tested R1 or O1 yet? What’s your go-to LLM and why? Let’s discuss in the comments!

КОМЕНТАРІ •