zlacker

[parent] [thread] 2 comments
1. belter+(OP)[view] [source] 2023-12-27 18:22:03
Really? Because the GPT-3 paper talks about "...two internet-based books corpora (Books1 and Books2)..." (see pages 8 and 9) - https://arxiv.org/pdf/2005.14165.pdf

Unclear what that corpora might be, or if its the same books2 you are referring to.

replies(1): >>simonw+Jo
2. simonw+Jo[view] [source] 2023-12-27 20:32:37
>>belter+(OP)
My guess is that this poster meant books3, not books2.

books1 and books2 are OpenAI corpuses that have never (to my knowledge) had their content revealed.

books3 is public, developed outside of OpenAI and we know exactly what's in it.

replies(1): >>devind+75e
◧◩
3. devind+75e[view] [source] [discussion] 2024-01-02 07:04:22
>>simonw+Jo
sorry, books3 is indeed what I meant.
[go to top]