Last week we released s1 - our simple recipe for sample-efficient reasoning & test-time scaling. Weβre releasing π¬π.π trained on the π¬ππ¦π ππ πͺ
Last week we released s1 - our simple recipe for sample-efficient reasoning & test-time scaling. Weβre releasing π¬π.π trained on the π¬ππ¦π ππ πͺ