> We also found good levels of accuracy: the generated documents were 70% accura...

> We also found good levels of accuracy: the generated documents were 70% accurate, and the generated code was at 60%.

How is accuracy measured here? Is a document a single file? Is the LLM generating code and some separate kind of “document” such that “code” accuracy can be 60% while “document” accuracy can be 70%?