We fine-tuned a gpt-3.5 ReAct agent to be better at chain-of-thought 💠A big gripe with gpt-3.5-turbo is that its reasoning is worse than gpt-4, ca
We fine-tuned a gpt-3.5 ReAct agent to be better at chain-of-thought 💠A big gripe with gpt-3.5-turbo is that its reasoning is worse than gpt-4, ca