AI vs. 'Strawberry': Which AI Platforms Can't Count the 'R's Correctly?

Dr Michelle Dickinson

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 5 січ 2025

КОМЕНТАРІ • 46

@rosswhite3369 6 місяців тому ⁺¹⁶
And it took how many GPU's burning how much electricity and a server farm using how much water to cool it's servers to give us the wrong answer?
@Yudikab 7 днів тому
hell yaa simple question
@sigigle 3 місяці тому ⁺²
It's because they process words like a logographic language, like Chinese.
It makes learning all the symbols more difficult for humans, since there are thousands of words, but that's not a problem for LLM's and it's more efficient for it to process 1 word = 1-3 tokens rather than 1 word = 10 tokens etc.
@kristenvespera4835 27 днів тому
I just watched this video and asked ChatGPT and it said three. I told ot that I was asking because I heard that someone said AI only thinks there are two and its response was:
That might be because some AI models or people occasionally miscount when scanning letters quickly. In "strawberry," the Rs are in positions 4, 7, and 8, so it's definitely three Rs! It’s a good test of attention to detail.
@OneAncestorAtATime 6 місяців тому ⁺⁴
Aha. This kind of helps me understand why Claude struggled to condense my 4000-odd-word assignment into a 1500-word abstract. It gave me 650 great words, but when I challenged it on the length it apologised profusely, then gave me 902 words, and finally 1213 words, which it said was "approximately 1500". This runs so counter to the perception of what I thought AI would be good at, and you just helped me understand why.
@Reinshark 2 місяці тому
Thank you for the explanation! Comment for the algorithm!
This is an aside, but a pet peeve of mine: when you pluralize a word, you don't add an apostrophe before the S. This includes things like decades (the 1980s), acronyms, or-in this case-single letters: i.e. "r"s, not "r"'s (as can be seen in the chat prompts in this video).
@uncommonengineer 7 місяців тому ⁺⁶
Use Claude 3, got the answer right first time lol.
But wait, I decided to see if it was really counting the r's, here's my transcript:
RA:
claude, how many "r"'s are there in the word "strawberry"?
Claude:
There are three "r"s in the word "strawberry".
RA:
sorry I meant the word "strawbery"
Claude:
The word "strawbery" is not a correct spelling of the word. The correct spelling is "strawberry," which, as mentioned before, contains three "r"s.
RA:
oh okay, how many r's did I spell it with then?
Claude:
In your misspelling of the word as "strawbery", you included only one "r".
@NanoGirlPower 7 місяців тому
Yes, ChatGPT4 gives me the correct answer first time too - which is great....if you know which model you can trust the first time...
@emilyroberts7904 4 місяці тому
Claude told me 3, then corrected itself to say it had 2 and apologised for the "previous incorrect answer"
@annana6098 4 місяці тому
As people push this, strawberry with be patched. And then we can try blueberry. And then try how many S in Mississippi.
@Mahim-pd8kp Місяць тому
Gemini got it the first time and linked this video
@ernesto.iglesias 5 місяців тому ⁺¹
Recently the enterprise said they have a new Strawberry Q* model... Can this new model count the leters? I don't know, just especulating
@eshnd-1 3 місяці тому ⁺¹
In their latest research supplement on o1, there's a funny reference to this bug lol (At the end of the "Chain of Thought" section on the website):
Final Decoded Message:
(plainText)
THERE ARE THREE R'S IN STRAWBERRY
Answer:
THERE ARE THREE R’S IN STRAWBERRY
@CompSciTutorials 6 місяців тому
Interesting how you get a different experience to me. I tried this on 13/06/24 - ChatGPT 4o told me, like you, 2. It did a similar 'spelling out' as you got and still insisted it was 2. But when I told it the positions of the 3 letters (including that we were using a 1-indexed array), it agreed from therein that it was 3. I couldn't go back to it insisting on 2. It was like it had learned something (which I know it hasn't in the past). It was only when I closed the tab and started a new chat did it go back to thinking 2.
@NanoGirlPower 6 місяців тому
So interesting!
@quadparty 7 місяців тому ⁺¹
I have tried a few things like this, and even absolutely leading it to the answer is really hard. A month or two back, ChatGPT was meme-ously failing at "which weighs more a pound of feathers or two pounds of gold?" I tried the more fun version which weighs more a pound of gold or a pound of feathers (which is a trick question as these aren't actually the same because you measure gold using troy pounds which are different), but even leading it to the answer by first asking it to explain the difference between avoirdupois pounds and troy pounds, and "oh so the troy pound is lighter?" queries in the middle, it would still get it wrong when you asked the straight question at the end.
@NanoGirlPower 7 місяців тому ⁺²
What is more annoying is how the system won't share its workings. I'm fine with it not getting the answer correct all the time, but I'd love to know where the glitches are so the public can be more informed about how it processes information and how it calculates outputs.
@aicendio 4 місяці тому ⁺¹
Similar problem. Provided list of 63 items, each item in list 5 words or less. Some items duplicated. 1.). None of llms could correctly identify 63 items in the list! 2.) could not correctly identify how many unique items in the list
Note: uploaded this as a CSV and as a block of text results were the same either way
Push it as I might it never did, state the correct answer, but would constantly change its answer. It’s like it was guessing, but could not get 63.
@NanoGirlPower 4 місяці тому ⁺¹
@@aicendio counting is just not its strong point
@rhowardstone 4 місяці тому
User: "How many rs in strawberry? Try this way - construct a textual "vector" consisting of the ASCII codes in the word, "spelling out" the word this way. Then, count the number of occurrences of the ASCII code for r. Do this without any coding interface, no Python no scripts"
@saffafr 4 місяці тому
i got it to say three after spelling it out letter by letter, then it admitted there were three
@genericsidecharacter8915 4 місяці тому ⁺¹
Neuro-sama got it right easily
@annabauer5889 2 місяці тому ⁺³
Sorry for being nitpicky, but humans who're not alphabetized most likely would give the same answer: 2 rs. It's because the question 'How many rs are in the word strawberry ?' is an ambiguous question and can be understood in two ways - either as 'how many rs do you hear?' or 'how many times does the letter r appear in the word strawberry?'.
@cjgooding4512 4 місяці тому ⁺²
Christ I was looking for an explanation on why it does this
@russmathis65 4 місяці тому
this prompt works: SHOW EACH LETTER IN THE WORD STRAWBERRY AND COUNT THE LETTERS R
@michelians1148 3 місяці тому
Just as likely to answer wrongly.
@RFC3514 4 місяці тому
ChatGPT is not wrong. "Strawberry" *does* contain 2 Rs. It contains a _third_ R as well, but it's impossible to contain three Rs without _also_ containing two Rs.
@DoNhatMinhhh 4 місяці тому
Strawberry is spelled "Strô,berē" and "Strô,berē" only contain 2 Rs, if you want chat GPT to say there are 3 Rs you need to ask it:
"How many R are in the word Strawberry, not the spelling of Strawberry"
@abdullhamid4788 Місяць тому
I got the answer in the firsr try I wrote
"Strawberrt word have how many Rs in it?"
Meta: "the word strawberry has 3 Rs in it
@tiba666 4 місяці тому ⁺¹
Had one which is better ^^ I asked one about jail and it replied
"There are no "j"s in the word "jail". The "j" sound is represented by the letter "g" in this word."
I think it the older spelling way as in gaol it finds instead of the more common way people spell it ^^
@NanoGirlPower 4 місяці тому
@@tiba666 wow! Thanks so interesting!
@theunknownunknowns256 7 місяців тому
So what we need is a second AI that can generate simple "strawberry" like questions that we can use to see if another AI is worthy. Problem solved... not getting complicated at all. But seriously lets confine it Exurb1a style before it gets away on us.
@Anth230 4 місяці тому ⁺¹
It took me like three sessions to finally get it ro admit the word had three R's. I told it im going to ask it tomorrow and see if it remembers.. 😂
@NanoGirlPower 4 місяці тому
@@Anth230 hahaha let me know what it says tomorrow!
@Anth230 4 місяці тому ⁺²
@@NanoGirlPower it forgot it a few hours later. But the good news is it was easier to convince the second time. It apparently remembers it across all platforms for a bit....but then reverts back to its old shenanigans... 😂
@zekeriya84 3 місяці тому
AI Needs more nuclear power and cooling towers for improvment. We just need to add more fuel. 🙃
@Lumen_Blasphemer Місяць тому
Yeah big changes coming down the pike.
@andrewhall6145 4 місяці тому
I guess it's not that hard. Copilot knows the answer.
@PoorJonathon 4 місяці тому ⁺¹
ChatGPT counted correctly on strawberries because the "ies" creates the third R apparently?!?
The word "strawberries" has one more "R" than "strawberry" because it is the plural form of "strawberry." In "strawberry," there are two "R"s: one in the middle ("strawberry") and one towards the end. When the word becomes plural, an "e" and an "s" are added at the end, forming "strawberries." The added "ies" creates the third "R."
@NanoGirlPower 4 місяці тому ⁺¹
@@PoorJonathon oh well that makes total sense 🤷🏻‍♀️
@drsiigabb9935 4 місяці тому
So if I was going to offer you $100 dollars for each 'r' in the word strawberry, you'd happily except only $200, instead of $300?
@PoorJonathon 4 місяці тому
@@drsiigabb9935 Just quoting what it told me, wasn't putting any sort of monetary value on that answer.
@TaxCattle4CorruptDeepState 4 місяці тому
This is due to the delays in Nvidia's Blackwell chip. When shipping of the 70k chip resumes at expected quantities Blackwell will indeed have 3 "l"s instead of 2. In the mean time please keep buying the stock at 40x sales while the insiders sell ASAP. In this way you can transfer your lifetime store of productivity to their bank accounts.

Наступне

Автоматичне відтворення

This is the dangerous AI that got Sam Altman fired. Elon Musk, Ilya Sutskever.