zlacker

[parent] [thread] 25 comments
1. Jimthe+(OP)[view] [source] 2023-07-05 18:16:28
"Even if you just make GPT-4 say 33% smarter and 50 or 100 times faster and more efficient, that can lead to control of industrial and military assets being handed over to these AI agents."

I call BS on this...it's an LLM...

replies(6): >>crop_r+f1 >>chaxor+k4 >>Footke+Ha >>ben_w+A51 >>reaper+FG1 >>nopins+HN1
2. crop_r+f1[view] [source] 2023-07-05 18:21:07
>>Jimthe+(OP)
Saying it's "an LLM" doesn't change the impact. GPT4 is an LLM, and so are many others ranging from toy quality to GPT3.5. It is very clear GPT4 is much better. If there is another jump like GPT4 , whether it is LLM or not, it's impact will be huge.
replies(2): >>esafak+65 >>woadwa+oM
3. chaxor+k4[view] [source] 2023-07-05 18:33:02
>>Jimthe+(OP)
It's important to recognize that the model is fully capable of operating in open world environments, with visual stimuli and motor output, go achieve high level tasks. This has been demonstrated in proofs of concepts several times now with systems such as voyager et al. So, while there are certainly some details that are important, much of them are the annoyances that we devs deal with all the time (how to connect various parts of a system properly, etc) the fundamental capabilities of expressivity in these models are not that limited. Certainly limited in some sense (as seen in the several papers applying category theoretic arguments to transformers) but for many engineering applications in the world, these models are very capable and useful.

Guarantees of correctness and safety are obviously of huge concern, hence the main article. But it's absolutely not unreasonable to see these models allowing humanoid robots capable of various day to day activities and work.

replies(3): >>Dennis+xu >>jgalt2+Q81 >>hgsgm+oh1
◧◩
4. esafak+65[view] [source] [discussion] 2023-07-05 18:35:22
>>crop_r+f1
Plus the next thing might not be an LLM.
5. Footke+Ha[view] [source] 2023-07-05 18:56:09
>>Jimthe+(OP)
Military command and control is already performed via input and output of token streams.
◧◩
6. Dennis+xu[view] [source] [discussion] 2023-07-05 20:24:17
>>chaxor+k4
To save others the trouble, I googled Voyager, it's pretty interesting. I had no idea an LLM could do this sort of thing:

https://voyager.minedojo.org/

replies(2): >>famous+yB >>yldedl+o12
◧◩◪
7. famous+yB[view] [source] [discussion] 2023-07-05 20:58:44
>>Dennis+xu
Other examples(in the real world) you might find interesting.

https://tidybot.cs.princeton.edu/ https://innermonologue.github.io/

https://palm-e.github.io/

https://www.microsoft.com/en-us/research/group/autonomous-sy...

replies(1): >>Animat+kE
◧◩◪◨
8. Animat+kE[view] [source] [discussion] 2023-07-05 21:11:31
>>famous+yB
> https://palm-e.github.io/

The alignment problem will come up when the robot control system notices that the guy with the stick is interfering with the robot's goals.

replies(1): >>c_cran+2G
◧◩◪◨⬒
9. c_cran+2G[view] [source] [discussion] 2023-07-05 21:18:45
>>Animat+kE
A robot control system without a mechanical override in favor of the stick is a poor one indeed.
◧◩
10. woadwa+oM[view] [source] [discussion] 2023-07-05 21:52:22
>>crop_r+f1
Meanwhile, GPT-4 still can’t reliably multiply small numbers.

https://arxiv.org/abs/2304.02015

replies(4): >>fprott+lP >>mhb+AU >>famous+9V >>Camper+RW
◧◩◪
11. fprott+lP[view] [source] [discussion] 2023-07-05 22:08:56
>>woadwa+oM
A minor inconvenience when GPT-4 has no problem learning how to use a code interpreter.
◧◩◪
12. mhb+AU[view] [source] [discussion] 2023-07-05 22:38:36
>>woadwa+oM
Do you find that comforting when an emergent property of a system whose objective is to complete the next word is able to make drawings?
replies(1): >>Strict+w21
◧◩◪
13. famous+9V[view] [source] [discussion] 2023-07-05 22:41:37
>>woadwa+oM
It's alright with algorithmic prompts - https://arxiv.org/abs/2211.09066

also it knows when to use a calculator if it has access to one so it's not a big deal

◧◩◪
14. Camper+RW[view] [source] [discussion] 2023-07-05 22:52:34
>>woadwa+oM
"This Apple II is useless. It can't even run Crysis."
◧◩◪◨
15. Strict+w21[view] [source] [discussion] 2023-07-05 23:25:45
>>mhb+AU
Imagine you meet a human who is eloquent, expressive, speaks ten languages, can pass the bar or the medical board exams easily, but who cannot reliably distinguish between truth and falsehood on the smallest of questions ("what is 6x9? 42") and has no persistent memory or sense of self.

Would you be "comforted" that this mega-genius is worse at arithmetic than you are and doesn't remember what it did yesterday?

Probably not. You might well be worried that this weird psychopath is going to get a medical license and cut the wrong number of fingers off of a whole bunch of patients.

replies(1): >>mhb+t31
◧◩◪◨⬒
16. mhb+t31[view] [source] [discussion] 2023-07-05 23:32:45
>>Strict+w21
We're agreeing, aren't we?
17. ben_w+A51[view] [source] 2023-07-05 23:45:52
>>Jimthe+(OP)
It's autocomplete on steroids…

That can guide me through the process of writing a Navier-Stokes simulation…

In a foreign language…

That can be trivially put into a loop and tasked with acting like an agent…

And which is good enough that people are already seriously asking themselves if they need to hire people to do certain tasks…

Why call BS?

It's not perfect, sure, but it's not making a highly regional joke about the Isle of White Ferry[0] either.

[0] "What's brown and comes steaming out the back of Cowes?"

replies(1): >>wickof+xb1
◧◩
18. jgalt2+Q81[view] [source] [discussion] 2023-07-06 00:07:20
>>chaxor+k4
> It's important to recognize that the model is fully capable of operating in open world environment

How so? If they cannot drive a car?

replies(1): >>chaxor+5B9
◧◩
19. wickof+xb1[view] [source] [discussion] 2023-07-06 00:26:54
>>ben_w+A51
But you're also autocomplete (prediction engine) on steroids.

https://www.psy.ox.ac.uk/news/the-brain-is-a-prediction-mach...

replies(1): >>ben_w+q22
◧◩
20. hgsgm+oh1[view] [source] [discussion] 2023-07-06 01:09:44
>>chaxor+k4
I don't understand why Voyager benefits from being an LLM, vs a "normal" Neural Net. It's not talking to anyone or learning from text.
replies(1): >>Footke+pA1
◧◩◪
21. Footke+pA1[view] [source] [discussion] 2023-07-06 03:30:33
>>hgsgm+oh1
> We introduce Voyager, the first LLM-powered embodied lifelong learning agent to drive exploration, master a wide range of skills, and make new discoveries continually without human intervention in Minecraft. Voyager is made possible through three key modules: 1) an automatic curriculum that maximizes exploration; 2) a skill library for storing and retrieving complex behaviors; and 3) a new iterative prompting mechanism that generates executable code for embodied control.

It looks like being LLM-based is helpful for generating control scripts and communicating its reasoning. Text seems to provide useful building blocks for higher-order reasoning and behavior. As with humans!

22. reaper+FG1[view] [source] 2023-07-06 04:14:10
>>Jimthe+(OP)
GPT-4 is actually multimodal, not just an LLM. OpenAI just doesn't provide the public with any way to use the image embedding capabilities.
23. nopins+HN1[view] [source] 2023-07-06 05:23:05
>>Jimthe+(OP)
LLM is already a misnomer. Many of the latest models are better called LFMs (Large Foundation Models). They have multimodal capabilities. Some can even handle sensory input humans can't.

Another comment already links to demos and papers of LFMs operating robots and agents in 3D environments.

◧◩◪
24. yldedl+o12[view] [source] [discussion] 2023-07-06 07:24:26
>>Dennis+xu
Voyager is pretty cool, but it's not transferable to the real world at all. The automatic curriculum relies on lots of specific knowledge from people talking about how to get better at Minecraft. The skill library writes programs using the Mineflayer API, which provides primitives for all physics, entities, actions, state etc. A real-life analogue of that would be like solving robotics and perception real quick.
◧◩◪
25. ben_w+q22[view] [source] [discussion] 2023-07-06 07:32:16
>>wickof+xb1
"It's one of those irregular verbs, isn't it? I'm good at improv and speaking on my feet, you finish each other's sentences, they're just autocomplete on steroids."

https://en.wikiquote.org/wiki/Yes,_Minister

◧◩◪
26. chaxor+5B9[view] [source] [discussion] 2023-07-08 05:24:34
>>jgalt2+Q81
What evidence do you have that allows you to make the assertion that they 'cannot drive a car'?
[go to top]