🧵Here's an explanation using simple words:
1/6
In other words, they paid people to chit-chat.
2/6
To solve that, they used humans (again) to rank randomly selected answers that ChatGPT was spitting out from best to worst.
3/6
So there are two models:
1. a model that can answer questions like a human.
2. a model that can say how good/bad the answers was.
The last step is brilliant.
4/6
So what was the "reward" here? Spoiler: Not a cookie. They used the score as reward to train the model.
5/6
1. Have a model generate a human-like answer.
2. Have a model score that answer.
3. Have model learn from the score and re-adjust the answer until it gets an A+.
4. Repeat a million times until accurate.
*chef kiss*
6/6
alphasignal.ai
- Every chat costs is in the single-digits cents.
- It will be monetized soon.
- ChatGPT was trained on Azure
- @sama @elonmusk @ilyasut @gdb @woj_zaremba @johnschulman2 are the founders of @OpenAI.
- There are no main authors behind ChatGPT.
More from this author
GoogleAI just released "Muse", a text-to-image generation/editing model via Masked Generative Transformers: - Achieves new SOTA - Zero-shot, Mask-fre...
Game changer. You can now run GPT locally on your macbook with GPT4All, a new 7B LLM based on LLaMa. It's completely open source: demo, data and cod...
Impressive. MetaGPT is about to reach 10,000 stars on Github. It's a Multi-Agent Framework that can behave as an engineer, product manager, architec...
A great read. Stop using the elbow criterion for k-means and how to choose the number of clusters instead (alternatives). "..researchers and reviewer...
Recent Threads
taekook taguan ng anak au wherein jk received a surprising gift from their xmas party… [ christmas special 🎄] https://t.co/WY3C450KpV
@HitWithAHeart I hear him before I see him. The weight of his steps on the stairs. Slower than usual. Measured. Like he’s already bracing for whatev...
(1/7) I'm not going to do a full trailer breakdown for Zach Cregger's Resident Evil film, since we have an early form of the script you can place a lo...
Nikola Jokic is 0-6 against 50+ win teams in the playoffs. https://t.co/l5hCeVCoUj
Uni is a fighter! https://t.co/AXkBVFJ2My
Triggered girl at comedy show… https://t.co/z1BC2qG7Rd
Popular Threads
Ware County, Ga has broken the Dominion algorithm: Using sequestered Dominion Equipment, Ware County ran a equal number of Trump votes and Biden vote...
Top 20 Players with the most goals + assists in football history, only players with assists available (following the Opta criteria for assists) Seaso...
The ICT Mentorship Core Content Month 1 Summarized: https://t.co/6tXJxPMDhm
Winning the Chevening Scholarship + 12 Strong Samples of the Chevening Essay There are four important Essays on the Chevening Scholarship application...
DON'T ARGUE WITH DONKEYS (Fable) The donkey said to the tiger: - "The grass is blue". The tiger replied: - "No, the grass is green." The discussion...
Please retweet and share if you support my and others' vaccine injury recoveries. https://t.co/y8xNWwRUOO