We fine-tuned a gpt-3.5 ReAct agent to be better at chain-of-thought 💭 A big gripe with gpt-3.5-turbo is that its reasoning is worse than gpt-4, ca

We fine-tuned a gpt-3.5 ReAct agent to be better at chain-of-thought 💭 A big gripe with gpt-3.5-turbo is that its reasoning is worse than gpt-4, ca

We fine-tuned a gpt-3.5 ReAct agent to be better at chain-of-thought 💭 A big gripe with gpt-3.5-turbo is that its reasoning is worse than gpt-4, ca