zlacker

[parent] [thread] 2 comments
1. MVisse+(OP)[view] [source] 2023-11-19 01:08:56
You should read the safety paper of GPT-4. It can easily manipulate humans to attains it goals.
replies(1): >>mattkr+pl
2. mattkr+pl[view] [source] 2023-11-19 03:31:55
>>MVisse+(OP)
Does it have goals beyond “find a likely series of tokens that extends the input?”

Is the idea that it will hack into NORAD and a launch a first-strike to increase the log-likelihood of “WWIII was begun by…?”

replies(1): >>Davidz+kG
◧◩
3. Davidz+kG[view] [source] [discussion] 2023-11-19 06:31:30
>>mattkr+pl
I think this is misguided. There can be goals internal to the system which do not arise from goals of the external system. For example, when simulating a chess game, it (behaves identically to) has a goal of winning the game. This is not a written expressed goal but is emergent. Like the goals of a human are emergent from the biological system which on the cellular level have very different goals
[go to top]