zlacker

[return to "LLMs cannot find reasoning errors, but can correct them"]
1. ilaksh+Hm[view] [source] 2023-11-20 21:01:43
>>koie+(OP)
I was just testing Bard with some very simple coding exercises and it did well.

I noticed that they automatically create at least three other draft responses.

I assume that this is a technique that allows them to try multiple times and then select the best one.

Just mentioning it because it seems like another example of not strictly "zero-shot"ing a response. Which seems important for getting good results with these models.

I'm guessing they use batching for this. I wonder if it might become more common to run multiple inference subtasks for the same main task inside of a batch, for purposes of self-correcting agent swarms or something. The outputs from step one are reviewed by the group in step 2, then they try again in step 3.

I guess that only applies for a small department where there is frequently just one person using it at a time.

◧◩
2. Millio+Kt[view] [source] 2023-11-20 21:31:38
>>ilaksh+Hm
IIRC there were some OpenAI docs that recommended doing exactly this, make n generations and use a smaller fine tuned model to select the best one
◧◩◪
3. DaiPlu+9A[view] [source] 2023-11-20 22:00:51
>>Millio+Kt
...does this directly relate to the high operating costs of LLMs-as-a-service, if for every request they have to run n-many redundant LLM requests? So if they could improve things so that a single prompt/request+response has a higher chance of being high-quality they wouldn't need to run alternatives?
◧◩◪◨
4. Millio+bz1[view] [source] 2023-11-21 04:37:01
>>DaiPlu+9A
Another point: Now that I think about it, I doubt this is compatible with streaming the output to the user, which might be an issue in some cases.
◧◩◪◨⬒
5. DaiPlu+9C1[view] [source] 2023-11-21 05:00:13
>>Millio+bz1
I thought the char-by-char teleprinter thing was just an effect (y’know, for user-engagement and to make the interaction feel more genuine) - and that these systems just return output in buffered blocks/pages or whatever-it-is that they wired-up their network to do.

It’s not like DALL-E outputs pixels in scanout order - or in brushstroke order (…er… or does it?)

[go to top]