Testing Frontier LLMs (GPT4) on ARC-AGI

Поділитися
Вставка
  • Опубліковано 27 лип 2024
  • Template: www.kaggle.com/code/gregkamra...
    arcprize.org/leaderboard
    arcprize.org/arc-agi-pub
    ARC Prize is a $1,000,000+ public competition to beat and open source a solution to the ARC-AGI benchmark.
    Hosted by Mike Knoop (Co-founder, Zapier) and François Chollet (Creator of ARC-AGI, Keras).
    --
    Website: arcprize.org/
    Twitter/X: / arcprize
    Newsletter: Signup @ arcprize.org/
    Discord: / discord
    Try your first ARC-AGI tasks: arcprize.org/play

КОМЕНТАРІ • 16

  • @jackq2331
    @jackq2331 22 дні тому

    Excellent.

  • @MarkoTManninen
    @MarkoTManninen Місяць тому +1

    I understand retries, but I am confuced with the two attempts. Do you always need to provide two? In which case they would have different data and both would be required for 100% correct prediction? I also missed the part in which the prediction and correct answers are matched and prounounced.

    • @ARCprize
      @ARCprize  29 днів тому +3

      Sorry this isn't more clear on the video!
      You get two tried at each task. Old competitions had 3 tries. So you can basically give two attempts. If either are correct you pass the task.
      Under scoring methodology there is more information: arcprize.org/guide#submissions

  • @LimeTubeH
    @LimeTubeH 29 днів тому

    I'm confused...what are we supposed to attach with our API add-on secret?

    • @ARCprize
      @ARCprize  28 днів тому

      What do you mean attach? That’s where you put your API key and then reference it in your code

  • @conformist
    @conformist Місяць тому +6

    first.

    • @cyb3rvoid
      @cyb3rvoid Місяць тому +2

      That was unreal!

    • @conformist
      @conformist Місяць тому +2

      @@cyb3rvoid for my next magic trick, i will solve the agi price first

    • @wwkk4964
      @wwkk4964 Місяць тому +4

      ​@@conformistsolve it backwards!

    • @filipgara3444
      @filipgara3444 Місяць тому +2

      Ensure diversity in your model

  • @johnkintner
    @johnkintner 23 дні тому

    third since no one called it :kappa:

  • @aluphshahim5808
    @aluphshahim5808 Місяць тому

    Second 😂

  • @sp3ct3rgaming46
    @sp3ct3rgaming46 19 днів тому

    i might be tripping but i think this dude cloned his own voice and then layered it into the video. you can hear the typical elevenlabs lisp

    • @ARCprize
      @ARCprize  19 днів тому +1

      @@sp3ct3rgaming46 you’re tripping. I did the video and no voice dub used