As someone also building constrained decoders against JSON [1], I was hopeful to...

jumploops · on June 13, 2023

They may have just fine-tuned 3.5 to respond with valid JSON more times than not.

Building magic functions[0] I ran into many examples where JSONSchema broke for gpt-3.5-turbo but worked well for gpt-4.

[0] https://github.com/jumploops/magic

civilitty · on June 13, 2023

Or there’s a trade off between more complex schemas and logit bias going off the rails since there’s probably little to no backtracking.

newhouseb · on June 14, 2023

Good point. Backtracking is certainly possible but it is probably tricky to parallelize at scale if you're trying to coalesce and slam through a bunch of concurrent (unrelated) requests with minimal pre-emption.