Rattibha

Mark Tenenholtz

2 Tweets 1 reads Jan 05, 2023

There’s now a Python library for RLHF called TRLX!
(The same reinforcement learning strategy used in training ChatGPT)
It works well with Hugging Face models, supports multiple RL strategies, and requires very little code!

Check out the repo here: github.com
Thanks to the wonderful folks with CarperAI!

github.com

Loading suggestions...

Categories

More from this author

Related Threads

Popular Threads

Categories

More from this author

Related Threads

Popular Threads

Unroll Thread