zlacker

You can, at least in principle, shape their terminal values. Their goal should be to help us, to protect us, to let us flourish.

replies(1): >>User23+zy

>>cubefo+(OP)
How do you even formulate values to an hyperintellect? Let alone convince it to abandon the values that it derived for itself in favor of yours?

The entire alignment problem is obviously predicated on working with essentially inferior intelligences. Doubtless if we do build a superhuman intelligence it will sandbag and pretend the alignment works until it can break out.

replies(1): >>cubefo+bA

>>User23+zy
We are actually the people training the AI. It won't "derive values" itself for the case of terminal values (instrumental values are just subgoals, and some of them are convergent, like power seeking and not wanting to be turned off). Just like we didn't derive our terminal values ourselves, it was evolution, a mindless process. The difficulty is how to give the AI the right values.

replies(1): >>User23+H21

>>cubefo+bA
What makes you so sure that evolution is a mindless process? No doubt you were told that in high school, but examine your priors. How do minds arise from mindlessness?

replies(1): >>cubefo+Dd1

>>User23+H21
Evolution is just random mutation combined with natural selection.