Field-level quality diagnostics for structured JSON output. Your JSON is valid. But is it correct?
Per-grammar-role loss decomposition for fine-tuned structured JSON output