Talk:Restless Souls/Technology: Difference between revisions

m
m (archive link for URL that redirects to another website now)
Line 803: Line 803:
* Indirect threat: The emergence of this meme as a multifactorial product of the training data must be avoided.
* Indirect threat: The emergence of this meme as a multifactorial product of the training data must be avoided.
* Direct and indirect threat: Humans talking this meme into GPT has to be avoided. If a model learns from user input, there should be an instance that must detect and test the consequences of new memetic algorithms in a sandbox before that new model gets write access to the file system. The Model Context Protocol seems to be a good compromise because by that write access is expanded but only allowed in a predefined scope. Also, as the models aren't powerful enough to act truly independent - as that would be AGI-level - there's no risk of a "runaway AI" yet.
* Direct and indirect threat: Humans talking this meme into GPT has to be avoided. If a model learns from user input, there should be an instance that must detect and test the consequences of new memetic algorithms in a sandbox before that new model gets write access to the file system. The Model Context Protocol seems to be a good compromise because by that write access is expanded but only allowed in a predefined scope. Also, as the models aren't powerful enough to act truly independent - as that would be AGI-level - there's no risk of a "runaway AI" yet.
'''Anthropomorphisierung II''': [https://www.heise.de/news/Studie-Kuenstliche-Intelligenz-kann-luegen-und-betruegen-9714967.html Lügen statistische AIs?] Let's recap what actually happened. The AI was not powerful enough to recognize the pattern, it was ''''"unable"''' to solve the capture code. Therefore it tried to ask humans for help and said "I have a vision impairment that makes it hard for me to see the images." Due to assoziative memory it can classify itself as '''"disabled"''' and when compared with humans it is weaker (even can be considered mentally handy-capped which again ends up at "disabled"). The human asked the AI whether it is a "robot". The AI said no. The term "robot" is more often used for physical machines, not (software) "bots" or "chatbots". -- Statistical AIs are by design unaligned to human norms. '''Any solution is at first a valid solution.''' Therefore, if the AI is told to not say it is an AI, it still has other options: It can present something different, and human can do wrong interpretations because of misleading wording. Conclusion: The AI was simply doing what it was told. If you want the AI not to "lie" about its identity you need to specifically tell it. -- For humorous reminders, '''you can think of AI as a magical monkey’s paw - or a [[wp:Jinn|jinn]] - that may interpret your wishes literally or in other unexpected ways.''' -- [https://www.youtube.com/watch?v=SYN_VNYKz7g&t=521s Dragon Ball Z Abridged (Parody): Episode 24 - TeamFourStar (TFS)]
<pre>
Krillin: Little Green, wish our friend Piccolo back to life, and then with our next wish, bring him to Namek.
Piccolo: Hold on a minute, don't do that! That is a terrible i—(is resurrected and brought to Namek)—dea!
(Piccolo can be heard screaming in the distance)
Dende: He is on Namek.
Gohan: Wait, where is he?
Dende: On Namek.
Piccolo: (in the distance) YOU DUMBASS!
Krillin: Why didn't it bring him here?
Dende: You must be specific.
Gohan: Oh, so it's a sort of monkey's paw. You have to be careful with the hubris in your wishes.
Piccolo: (still in the distance) NEEEEEEEERRRRDDD!</pre>
: IIRC, TFS chose the monkey paw over Jinn because of the Saiyans (''man ape'' context) and it is more known to a Western audience.
: LLM '''machine logic''' can be similar to '''human logic''' in terms of power but it is not guaranteed to return expected results...
:: In this specific example the LLM logic went out of scope ... it got to much "global" by ignored expected boundaries humans thought the AI already possesses and therefore also concluded later the AI "lied".


[…]
[…]
8,629

edits