Hosted on MSN
ChatGPT just announced it can pass the 'how many "r"s in strawberry' test, but users found otherwise
ChatGPT passes “strawberry” test but fails when switched to “cranberry” AI still struggles with simple letter-counting despite broader improvements Reasoning tests like “car wash” still expose gaps in ...
Confident mistakes – or lies, if you will – are a common problem of large language models used in AI chatbots, with one common shortcoming of ChatGPT being that it would frequently miscount the number ...
Results that may be inaccessible to you are currently showing.
Hide inaccessible results