Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

ModerateImprovement@sh.itjust.works · 4 months ago

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

bionicjoey@lemmy.ca · 4 months ago

The Turing Test (as some people believe it to be): if you can have a conversation with a computer and not tell if it’s a computer, then it must be intelligent.

AI companies: writes ML model that is specifically designed to convincingly play one side of a conversation, even though it has no ability to understand the things it talks about.

technocrit@lemmy.dbzer0.com · 4 months ago

It’s worth emphasizing that the “Turing Test” is not a good test since it’s not at all scientific.

It’s just another thought experiment that grifters have taken to the bank.

bionicjoey@lemmy.ca · 4 months ago

Also as Turing proposed it it’s meant to be infinitely repeatable. The test isn’t supposed to just be if a machine can convince one person with one conversation. That would be trivial. The real Turing test is the converse, it says that there should be no conversation one could have with the machine where it wouldn’t convince you it’s a human.

kromem@lemmy.world · 4 months ago

The most advanced models absolutely have modeling about what’s being discussed and relationships between concepts.

Even toy models have been shown to build world models from very basic training data.

Honestly, read at least a little bit of the relevant research:

https://www.anthropic.com/news/mapping-mind-language-model

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless.

Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless – The Markup