It's because they process words like a logographic language, like Chinese. It makes learning all the symbols more difficult for humans, since there are thousands of words, but that's not a problem for LLM's and it's more efficient for it to process 1 word = 1-3 tokens rather than 1 word = 10 tokens etc.
I just watched this video and asked ChatGPT and it said three. I told ot that I was asking because I heard that someone said AI only thinks there are two and its response was: That might be because some AI models or people occasionally miscount when scanning letters quickly. In "strawberry," the Rs are in positions 4, 7, and 8, so it's definitely three Rs! It’s a good test of attention to detail.
Aha. This kind of helps me understand why Claude struggled to condense my 4000-odd-word assignment into a 1500-word abstract. It gave me 650 great words, but when I challenged it on the length it apologised profusely, then gave me 902 words, and finally 1213 words, which it said was "approximately 1500". This runs so counter to the perception of what I thought AI would be good at, and you just helped me understand why.
Thank you for the explanation! Comment for the algorithm! This is an aside, but a pet peeve of mine: when you pluralize a word, you don't add an apostrophe before the S. This includes things like decades (the 1980s), acronyms, or-in this case-single letters: i.e. "r"s, not "r"'s (as can be seen in the chat prompts in this video).
Use Claude 3, got the answer right first time lol. But wait, I decided to see if it was really counting the r's, here's my transcript: RA: claude, how many "r"'s are there in the word "strawberry"? Claude: There are three "r"s in the word "strawberry". RA: sorry I meant the word "strawbery" Claude: The word "strawbery" is not a correct spelling of the word. The correct spelling is "strawberry," which, as mentioned before, contains three "r"s. RA: oh okay, how many r's did I spell it with then? Claude: In your misspelling of the word as "strawbery", you included only one "r".
In their latest research supplement on o1, there's a funny reference to this bug lol (At the end of the "Chain of Thought" section on the website): Final Decoded Message: (plainText) THERE ARE THREE R'S IN STRAWBERRY Answer: THERE ARE THREE R’S IN STRAWBERRY
Interesting how you get a different experience to me. I tried this on 13/06/24 - ChatGPT 4o told me, like you, 2. It did a similar 'spelling out' as you got and still insisted it was 2. But when I told it the positions of the 3 letters (including that we were using a 1-indexed array), it agreed from therein that it was 3. I couldn't go back to it insisting on 2. It was like it had learned something (which I know it hasn't in the past). It was only when I closed the tab and started a new chat did it go back to thinking 2.
I have tried a few things like this, and even absolutely leading it to the answer is really hard. A month or two back, ChatGPT was meme-ously failing at "which weighs more a pound of feathers or two pounds of gold?" I tried the more fun version which weighs more a pound of gold or a pound of feathers (which is a trick question as these aren't actually the same because you measure gold using troy pounds which are different), but even leading it to the answer by first asking it to explain the difference between avoirdupois pounds and troy pounds, and "oh so the troy pound is lighter?" queries in the middle, it would still get it wrong when you asked the straight question at the end.
What is more annoying is how the system won't share its workings. I'm fine with it not getting the answer correct all the time, but I'd love to know where the glitches are so the public can be more informed about how it processes information and how it calculates outputs.
Similar problem. Provided list of 63 items, each item in list 5 words or less. Some items duplicated. 1.). None of llms could correctly identify 63 items in the list! 2.) could not correctly identify how many unique items in the list Note: uploaded this as a CSV and as a block of text results were the same either way Push it as I might it never did, state the correct answer, but would constantly change its answer. It’s like it was guessing, but could not get 63.
User: "How many rs in strawberry? Try this way - construct a textual "vector" consisting of the ASCII codes in the word, "spelling out" the word this way. Then, count the number of occurrences of the ASCII code for r. Do this without any coding interface, no Python no scripts"
Sorry for being nitpicky, but humans who're not alphabetized most likely would give the same answer: 2 rs. It's because the question 'How many rs are in the word strawberry ?' is an ambiguous question and can be understood in two ways - either as 'how many rs do you hear?' or 'how many times does the letter r appear in the word strawberry?'.
ChatGPT is not wrong. "Strawberry" *does* contain 2 Rs. It contains a _third_ R as well, but it's impossible to contain three Rs without _also_ containing two Rs.
Strawberry is spelled "Strô,berē" and "Strô,berē" only contain 2 Rs, if you want chat GPT to say there are 3 Rs you need to ask it: "How many R are in the word Strawberry, not the spelling of Strawberry"
Had one which is better ^^ I asked one about jail and it replied "There are no "j"s in the word "jail". The "j" sound is represented by the letter "g" in this word." I think it the older spelling way as in gaol it finds instead of the more common way people spell it ^^
So what we need is a second AI that can generate simple "strawberry" like questions that we can use to see if another AI is worthy. Problem solved... not getting complicated at all. But seriously lets confine it Exurb1a style before it gets away on us.
@@NanoGirlPower it forgot it a few hours later. But the good news is it was easier to convince the second time. It apparently remembers it across all platforms for a bit....but then reverts back to its old shenanigans... 😂
ChatGPT counted correctly on strawberries because the "ies" creates the third R apparently?!? The word "strawberries" has one more "R" than "strawberry" because it is the plural form of "strawberry." In "strawberry," there are two "R"s: one in the middle ("strawberry") and one towards the end. When the word becomes plural, an "e" and an "s" are added at the end, forming "strawberries." The added "ies" creates the third "R."
This is due to the delays in Nvidia's Blackwell chip. When shipping of the 70k chip resumes at expected quantities Blackwell will indeed have 3 "l"s instead of 2. In the mean time please keep buying the stock at 40x sales while the insiders sell ASAP. In this way you can transfer your lifetime store of productivity to their bank accounts.
And it took how many GPU's burning how much electricity and a server farm using how much water to cool it's servers to give us the wrong answer?
hell yaa simple question
It's because they process words like a logographic language, like Chinese.
It makes learning all the symbols more difficult for humans, since there are thousands of words, but that's not a problem for LLM's and it's more efficient for it to process 1 word = 1-3 tokens rather than 1 word = 10 tokens etc.
I just watched this video and asked ChatGPT and it said three. I told ot that I was asking because I heard that someone said AI only thinks there are two and its response was:
That might be because some AI models or people occasionally miscount when scanning letters quickly. In "strawberry," the Rs are in positions 4, 7, and 8, so it's definitely three Rs! It’s a good test of attention to detail.
Aha. This kind of helps me understand why Claude struggled to condense my 4000-odd-word assignment into a 1500-word abstract. It gave me 650 great words, but when I challenged it on the length it apologised profusely, then gave me 902 words, and finally 1213 words, which it said was "approximately 1500". This runs so counter to the perception of what I thought AI would be good at, and you just helped me understand why.
Thank you for the explanation! Comment for the algorithm!
This is an aside, but a pet peeve of mine: when you pluralize a word, you don't add an apostrophe before the S. This includes things like decades (the 1980s), acronyms, or-in this case-single letters: i.e. "r"s, not "r"'s (as can be seen in the chat prompts in this video).
Use Claude 3, got the answer right first time lol.
But wait, I decided to see if it was really counting the r's, here's my transcript:
RA:
claude, how many "r"'s are there in the word "strawberry"?
Claude:
There are three "r"s in the word "strawberry".
RA:
sorry I meant the word "strawbery"
Claude:
The word "strawbery" is not a correct spelling of the word. The correct spelling is "strawberry," which, as mentioned before, contains three "r"s.
RA:
oh okay, how many r's did I spell it with then?
Claude:
In your misspelling of the word as "strawbery", you included only one "r".
Yes, ChatGPT4 gives me the correct answer first time too - which is great....if you know which model you can trust the first time...
Claude told me 3, then corrected itself to say it had 2 and apologised for the "previous incorrect answer"
As people push this, strawberry with be patched. And then we can try blueberry. And then try how many S in Mississippi.
Gemini got it the first time and linked this video
Recently the enterprise said they have a new Strawberry Q* model... Can this new model count the leters? I don't know, just especulating
In their latest research supplement on o1, there's a funny reference to this bug lol (At the end of the "Chain of Thought" section on the website):
Final Decoded Message:
(plainText)
THERE ARE THREE R'S IN STRAWBERRY
Answer:
THERE ARE THREE R’S IN STRAWBERRY
Interesting how you get a different experience to me. I tried this on 13/06/24 - ChatGPT 4o told me, like you, 2. It did a similar 'spelling out' as you got and still insisted it was 2. But when I told it the positions of the 3 letters (including that we were using a 1-indexed array), it agreed from therein that it was 3. I couldn't go back to it insisting on 2. It was like it had learned something (which I know it hasn't in the past). It was only when I closed the tab and started a new chat did it go back to thinking 2.
So interesting!
I have tried a few things like this, and even absolutely leading it to the answer is really hard. A month or two back, ChatGPT was meme-ously failing at "which weighs more a pound of feathers or two pounds of gold?" I tried the more fun version which weighs more a pound of gold or a pound of feathers (which is a trick question as these aren't actually the same because you measure gold using troy pounds which are different), but even leading it to the answer by first asking it to explain the difference between avoirdupois pounds and troy pounds, and "oh so the troy pound is lighter?" queries in the middle, it would still get it wrong when you asked the straight question at the end.
What is more annoying is how the system won't share its workings. I'm fine with it not getting the answer correct all the time, but I'd love to know where the glitches are so the public can be more informed about how it processes information and how it calculates outputs.
Similar problem. Provided list of 63 items, each item in list 5 words or less. Some items duplicated. 1.). None of llms could correctly identify 63 items in the list! 2.) could not correctly identify how many unique items in the list
Note: uploaded this as a CSV and as a block of text results were the same either way
Push it as I might it never did, state the correct answer, but would constantly change its answer. It’s like it was guessing, but could not get 63.
@@aicendio counting is just not its strong point
User: "How many rs in strawberry? Try this way - construct a textual "vector" consisting of the ASCII codes in the word, "spelling out" the word this way. Then, count the number of occurrences of the ASCII code for r. Do this without any coding interface, no Python no scripts"
i got it to say three after spelling it out letter by letter, then it admitted there were three
Neuro-sama got it right easily
Sorry for being nitpicky, but humans who're not alphabetized most likely would give the same answer: 2 rs. It's because the question 'How many rs are in the word strawberry ?' is an ambiguous question and can be understood in two ways - either as 'how many rs do you hear?' or 'how many times does the letter r appear in the word strawberry?'.
Christ I was looking for an explanation on why it does this
this prompt works: SHOW EACH LETTER IN THE WORD STRAWBERRY AND COUNT THE LETTERS R
Just as likely to answer wrongly.
ChatGPT is not wrong. "Strawberry" *does* contain 2 Rs. It contains a _third_ R as well, but it's impossible to contain three Rs without _also_ containing two Rs.
Strawberry is spelled "Strô,berē" and "Strô,berē" only contain 2 Rs, if you want chat GPT to say there are 3 Rs you need to ask it:
"How many R are in the word Strawberry, not the spelling of Strawberry"
I got the answer in the firsr try I wrote
"Strawberrt word have how many Rs in it?"
Meta: "the word strawberry has 3 Rs in it
Had one which is better ^^ I asked one about jail and it replied
"There are no "j"s in the word "jail". The "j" sound is represented by the letter "g" in this word."
I think it the older spelling way as in gaol it finds instead of the more common way people spell it ^^
@@tiba666 wow! Thanks so interesting!
So what we need is a second AI that can generate simple "strawberry" like questions that we can use to see if another AI is worthy. Problem solved... not getting complicated at all. But seriously lets confine it Exurb1a style before it gets away on us.
It took me like three sessions to finally get it ro admit the word had three R's. I told it im going to ask it tomorrow and see if it remembers.. 😂
@@Anth230 hahaha let me know what it says tomorrow!
@@NanoGirlPower it forgot it a few hours later. But the good news is it was easier to convince the second time. It apparently remembers it across all platforms for a bit....but then reverts back to its old shenanigans... 😂
AI Needs more nuclear power and cooling towers for improvment. We just need to add more fuel. 🙃
Yeah big changes coming down the pike.
I guess it's not that hard. Copilot knows the answer.
ChatGPT counted correctly on strawberries because the "ies" creates the third R apparently?!?
The word "strawberries" has one more "R" than "strawberry" because it is the plural form of "strawberry." In "strawberry," there are two "R"s: one in the middle ("strawberry") and one towards the end. When the word becomes plural, an "e" and an "s" are added at the end, forming "strawberries." The added "ies" creates the third "R."
@@PoorJonathon oh well that makes total sense 🤷🏻♀️
So if I was going to offer you $100 dollars for each 'r' in the word strawberry, you'd happily except only $200, instead of $300?
@@drsiigabb9935 Just quoting what it told me, wasn't putting any sort of monetary value on that answer.
This is due to the delays in Nvidia's Blackwell chip. When shipping of the 70k chip resumes at expected quantities Blackwell will indeed have 3 "l"s instead of 2. In the mean time please keep buying the stock at 40x sales while the insiders sell ASAP. In this way you can transfer your lifetime store of productivity to their bank accounts.