Nicolette

17%
Flag icon
That wasn’t nearly enough for GPT-3. So Nest expanded the data by adding an even broader scrape of links shared on Reddit as well as a scrape of English-language Wikipedia and a mysterious dataset called Books2, details of which OpenAI has never disclosed, but which two people with knowledge of the dataset told me contained published books ripped from Library Genesis, an online shadow repository of torrented books and scholarly articles.
Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI
Rate this book
Clear rating
Open Preview