Message ID | 20240419043820.178731-1-jsnow@redhat.com (mailing list archive) |
---|---|
Headers | show |
Series | Add qapi-domain Sphinx extension | expand |
John Snow <jsnow@redhat.com> writes: > This series adds a new qapi-domain extension for Sphinx, which adds a > series of custom directives for documenting QAPI definitions. > > GitLab CI: https://gitlab.com/jsnow/qemu/-/pipelines/1259566476 > > (Link to a demo HTML page at the end of this cover letter, but I want > you to read the cover letter first to explain what you're seeing.) > > This adds a new QAPI Index page, cross-references for QMP commands, > events, and data types, and improves the aesthetics of the QAPI/QMP > documentation. Cross-references alone will be a *massive* improvement! I'm sure readers will appreciate better looks and an index, too. > This series adds only the new ReST syntax, *not* the autogenerator. The > ReST syntax used in this series is, in general, not intended for anyone > to actually write by hand. This mimics how Sphinx's own autodoc > extension generates Python domain directives, which are then re-parsed > to produce the final result. > > I have prototyped such a generator, but it isn't ready for inclusion > yet. (Rest assured: error context reporting is preserved down to the > line, even in generated ReST. There is no loss in usability for this > approach. It will likely either supplant qapidoc.py or heavily alter > it.) The generator requires only extremely minor changes to > scripts/qapi/parser.py to preserve nested indentation and provide more > accurate line information. It is less invasive than you may > fear. Relying on a secondary ReST parse phase eliminates much of the > complexity of qapidoc.py. Sleep soundly. I'm a Sphinx noob. Let me paraphrase you to make sure I understand. You proprose to generate formatted documentation in two steps: • First, the QAPI generator generates .rst from the QAPI schema. The generated .rst makes use of a custom directives. • Second, Sphinx turns the .rst into formatted documentation. A Sphinx qapi-domain extension implements the custom directives This mirrors how Sphinx works for Python docs. Which is its original use case. Your series demonstrates the second step, with test input you wrote manually. You have code for the first step, but you'd prefer to show it later. Fair? > The purpose of sending this series in its current form is largely to > solicit feedback on general aesthetics, layout, and features. Sphinx is > a wily beast, and feedback at this stage will dictate how and where > certain features are implemented. I'd appreciate help with that. Opinions? > A goal for this syntax (and the generator) is to fully in-line all > command/event/object members, inherited or local, boxed or not, branched > or not. This should provide a boon to using these docs as a reference, > because users will not have to grep around the page looking for various > types, branches, or inherited members. Any arguments types will be > hyperlinked to their definition, further aiding usability. Commands can > be hotlinked from anywhere else in the manual, and will provide a > complete reference directly on the first screenful of information. Let me elaborate a bit here. A command's arguments can be specified inline, like so: { 'command': 'job-cancel', 'data': { 'id': 'str' } } The arguments are then documented right with the command. But they can also be specified by referencing an object type, like so: { 'command': 'block-dirty-bitmap-remove', 'data': 'BlockDirtyBitmap' } Reasons for doing it this way: • Several commands take the same arguments, and you don't want to repeat yourself. • You want generated C take a single struct argument ('boxed': true). • The arguments are a union (which requires 'boxed': true). Drawback: the arguments are then documented elsewhere. Not nice. Bug: the generated documentation fails to point there. You're proposing to inline the argument documentation, so it appears right with the command. An event's data is just like a command's argument. A command's return value can only specified by referencing a type. Same doc usability issue. Similarly, a union type's base can specified inline or by referencing a struct type, and a union's branches must be specified by referencing a struct type. Same doc usability issue. At least, the generated documentation does point to the referenced types. > (Okay, maybe two screenfuls for commands with a ton of > arguments... There's only so much I can do!) *cough* blockdev-add *cough* > This RFC series includes a "sandbox" .rst document that showcases the > features of this domain by writing QAPI directives by hand; this > document will be removed from the series before final inclusion. It's > here to serve as a convenient test-bed for y'all to give feedback. > > All that said, here's the sandbox document fully rendered: > https://jsnow.gitlab.io/qemu/qapi/index.html > > And here's the new QAPI index page created by that sandbox document: > https://jsnow.gitlab.io/qemu/qapi-index.html > > Known issues / points of interest: > > - The formatting upsets checkpatch. The default line length for the > "black" formatting tool is a little long. I'll fix it next spin. > > - I did my best to preserve cross-version compatibility, but some > problems have crept in here and there. This series may require > Sphinx>= 4.0, like the dbus extension. Notably, the Ubuntu build fails > on Gitlab CI currently. The next version will text against more Sphinx > versions more rigorously. Sphinx version 5.3.0 and above should work > perfectly. > > - There's a bug in Sphinx itself that may manifest in your testing, > concerning reported error locations. There's a patch in this series > that fixes it, but it's later in the series. If you run into the bug > while testing with this series, try applying that patch first. > > - QAPI 'namespaces' aren't yet handled. i.e., there's nothing to > distinguish entities between QMP, QGA and QSD yet. That feature will > be added to a future version of this patchset (Likely when the > generator is ready for inclusion: without it, references will clash > and the index will gripe about duplicated entries.) qemu-storage-daemon's QMP is a proper subset of qemu-system-FOO's. Regardless, each of them has its own, independent reference manual. That's defensible. But the way we build them can complicate matters. For instance, when I tried to elide types not used for QMP from the reference manuals, I got defeated by Sphinx caching. > - Per-member features and ifcond are not yet accounted for; though > definition-scoped features and ifconds are. Please feel free to > suggest how you'd like to see those represented. > > - Inlining all members means there is some ambiguity on what to do with > doc block sections on inlined entities; features and members have an > obvious home - body, since, and misc sections are not as obvious on > how to handle. This will feature more prominently in the generator > series. Yes, this is a real problem. The member documentation gets written in the context of the type. It may make sense only in that context. Inlining copies it into a different context. Features may need to combine. Say a command's arguments are a union type, and several branches of the union both contain deprecated members. These branch types all document feature @deprecated. Simply inlining their feature documentation results in feature @deprecated documented several times. Ugh. Combining them would be better. But how? Do we need to rethink how we document features? > - Some features could be implemented in either the QAPI domain syntax > *or* the generator, or some combination of both. Depending on > aesthetic feedback, this may influence where those features should be > implemented. > > - The formatting and aesthetics of branches are a little up in the air, > see the qapi:union patch for more details. > > - It's late, maybe other things. Happy Friday! > > Hope you like it! Looks promising!
Markus Armbruster <armbru@redhat.com> writes: [...] >> The purpose of sending this series in its current form is largely to >> solicit feedback on general aesthetics, layout, and features. Sphinx is >> a wily beast, and feedback at this stage will dictate how and where >> certain features are implemented. > > I'd appreciate help with that. Opinions? Less than clear, let me try again: I'm soliciting opinions on the new look. Check it out... [...] >> This RFC series includes a "sandbox" .rst document that showcases the >> features of this domain by writing QAPI directives by hand; this >> document will be removed from the series before final inclusion. It's >> here to serve as a convenient test-bed for y'all to give feedback. ... here: >> All that said, here's the sandbox document fully rendered: >> https://jsnow.gitlab.io/qemu/qapi/index.html >> >> And here's the new QAPI index page created by that sandbox document: >> https://jsnow.gitlab.io/qemu/qapi-index.html [...]
On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> wrote: > John Snow <jsnow@redhat.com> writes: > > > This series adds a new qapi-domain extension for Sphinx, which adds a > > series of custom directives for documenting QAPI definitions. > > > > GitLab CI: https://gitlab.com/jsnow/qemu/-/pipelines/1259566476 > > > > (Link to a demo HTML page at the end of this cover letter, but I want > > you to read the cover letter first to explain what you're seeing.) > > > > This adds a new QAPI Index page, cross-references for QMP commands, > > events, and data types, and improves the aesthetics of the QAPI/QMP > > documentation. > > Cross-references alone will be a *massive* improvement! I'm sure > readers will appreciate better looks and an index, too. > > > This series adds only the new ReST syntax, *not* the autogenerator. The > > ReST syntax used in this series is, in general, not intended for anyone > > to actually write by hand. This mimics how Sphinx's own autodoc > > extension generates Python domain directives, which are then re-parsed > > to produce the final result. > > > > I have prototyped such a generator, but it isn't ready for inclusion > > yet. (Rest assured: error context reporting is preserved down to the > > line, even in generated ReST. There is no loss in usability for this > > approach. It will likely either supplant qapidoc.py or heavily alter > > it.) The generator requires only extremely minor changes to > > scripts/qapi/parser.py to preserve nested indentation and provide more > > accurate line information. It is less invasive than you may > > fear. Relying on a secondary ReST parse phase eliminates much of the > > complexity of qapidoc.py. Sleep soundly. > > I'm a Sphinx noob. Let me paraphrase you to make sure I understand. > > You proprose to generate formatted documentation in two steps: > > • First, the QAPI generator generates .rst from the QAPI schema. The > generated .rst makes use of a custom directives. > Yes, but this .rst file is built in-memory and never makes it to disk, like Sphinx's autodoc for Python. (We can add a debug knob to log it or save it out to disk if needed.) > • Second, Sphinx turns the .rst into formatted documentation. A Sphinx > qapi-domain extension implements the custom directives > Yes. > This mirrors how Sphinx works for Python docs. Which is its original > use case. > > Your series demonstrates the second step, with test input you wrote > manually. > > You have code for the first step, but you'd prefer to show it later. > Right, it's not fully finished, although I have events, commands, and objects working. Unions, Alternates and Events need work. > Fair? > Bingo! > > The purpose of sending this series in its current form is largely to > > solicit feedback on general aesthetics, layout, and features. Sphinx is > > a wily beast, and feedback at this stage will dictate how and where > > certain features are implemented. > > I'd appreciate help with that. Opinions? > > A goal for this syntax (and the generator) is to fully in-line all > > command/event/object members, inherited or local, boxed or not, branched > > or not. This should provide a boon to using these docs as a reference, > > because users will not have to grep around the page looking for various > > types, branches, or inherited members. Any arguments types will be > > hyperlinked to their definition, further aiding usability. Commands can > > be hotlinked from anywhere else in the manual, and will provide a > > complete reference directly on the first screenful of information. > > Let me elaborate a bit here. > > A command's arguments can be specified inline, like so: > > { 'command': 'job-cancel', 'data': { 'id': 'str' } } > > The arguments are then documented right with the command. > > But they can also be specified by referencing an object type, like so: > > { 'command': 'block-dirty-bitmap-remove', > 'data': 'BlockDirtyBitmap' } > > Reasons for doing it this way: > > • Several commands take the same arguments, and you don't want to repeat > yourself. > > • You want generated C take a single struct argument ('boxed': true). > > • The arguments are a union (which requires 'boxed': true). > > Drawback: the arguments are then documented elsewhere. Not nice. > > Bug: the generated documentation fails to point there. > > You're proposing to inline the argument documentation, so it appears > right with the command. > > An event's data is just like a command's argument. > > A command's return value can only specified by referencing a type. Same > doc usability issue. > > Similarly, a union type's base can specified inline or by referencing a > struct type, and a union's branches must be specified by referencing a > struct type. Same doc usability issue. > > At least, the generated documentation does point to the referenced > types. > Right. My proposal is to recursively inline referenced bases for the top-level members so that this manual is useful as a user reference, without worrying about the actual QAPI structure. Return types will just be hotlinked. > > (Okay, maybe two screenfuls for commands with a ton of > > arguments... There's only so much I can do!) > > *cough* blockdev-add *cough* > > This RFC series includes a "sandbox" .rst document that showcases the > > features of this domain by writing QAPI directives by hand; this > > document will be removed from the series before final inclusion. It's > > here to serve as a convenient test-bed for y'all to give feedback. > > > > All that said, here's the sandbox document fully rendered: > > https://jsnow.gitlab.io/qemu/qapi/index.html > > > > And here's the new QAPI index page created by that sandbox document: > > https://jsnow.gitlab.io/qemu/qapi-index.html > > > > Known issues / points of interest: > > > > - The formatting upsets checkpatch. The default line length for the > > "black" formatting tool is a little long. I'll fix it next spin. > > > > - I did my best to preserve cross-version compatibility, but some > > problems have crept in here and there. This series may require > > Sphinx>= 4.0, like the dbus extension. Notably, the Ubuntu build fails > > on Gitlab CI currently. The next version will text against more Sphinx > > versions more rigorously. Sphinx version 5.3.0 and above should work > > perfectly. > > > > - There's a bug in Sphinx itself that may manifest in your testing, > > concerning reported error locations. There's a patch in this series > > that fixes it, but it's later in the series. If you run into the bug > > while testing with this series, try applying that patch first. > > > > - QAPI 'namespaces' aren't yet handled. i.e., there's nothing to > > distinguish entities between QMP, QGA and QSD yet. That feature will > > be added to a future version of this patchset (Likely when the > > generator is ready for inclusion: without it, references will clash > > and the index will gripe about duplicated entries.) > > qemu-storage-daemon's QMP is a proper subset of qemu-system-FOO's. > Regardless, each of them has its own, independent reference manual. > That's defensible. > > But the way we build them can complicate matters. For instance, when I > tried to elide types not used for QMP from the reference manuals, I got > defeated by Sphinx caching. > I haven't tried yet, but I believe it should be possible to "tag" each referenced QAPI object and mark it as "visited". Each namespaced definition would be tagged separately. (i.e. qemu.block-core.blockdev-backup and qsd.block-core.blockdev-backup would be two distinct entities.) Then, the renderer could ignore/squelch untagged entities. (I think.) Maybe some work to do to un-index unvisited entities. Or we can go the other route and bake this info into the schema and squelch stuff earlier. We can add this feature later. I am still not sure why your prototype for this ran into cache issues, but I am convinced it should be fixable by making namespaced entities distinct. We could perhaps bake the namespace into the qapidoc directive, e.g.: .. qapi:autodoc:: schema.json :namespace: QSD (And the default adds no namespace.) > > - Per-member features and ifcond are not yet accounted for; though > > definition-scoped features and ifconds are. Please feel free to > > suggest how you'd like to see those represented. > > > > - Inlining all members means there is some ambiguity on what to do with > > doc block sections on inlined entities; features and members have an > > obvious home - body, since, and misc sections are not as obvious on > > how to handle. This will feature more prominently in the generator > > series. > > Yes, this is a real problem. > > The member documentation gets written in the context of the type. It > may make sense only in that context. Inlining copies it into a > different context. > Features may need to combine. Say a command's arguments are a union > type, and several branches of the union both contain deprecated members. > These branch types all document feature @deprecated. Simply inlining > their feature documentation results in feature @deprecated documented > several times. Ugh. Combining them would be better. But how? Do we > need to rethink how we document features? > To be honest, I figured I'd plow ahead and then find the counter-examples programmatically and decide what to do once I had my pile of edge cases. It might involve rewriting docs in some places, but I think the usability win is completely worth the hassle. It's possible there might be some rework needed to maximize cogency of the generated docs, but I think prioritizing a user-facing reference for QMP is the objectively correct end goal and I'm more than happy to work backwards from that desired state. > > - Some features could be implemented in either the QAPI domain syntax > > *or* the generator, or some combination of both. Depending on > > aesthetic feedback, this may influence where those features should be > > implemented. > > > > - The formatting and aesthetics of branches are a little up in the air, > > see the qapi:union patch for more details. > > > > - It's late, maybe other things. Happy Friday! > > > > Hope you like it! > > Looks promising! > Fingers crossed. This has bothered me about QEMU for years. --js >
John Snow <jsnow@redhat.com> writes: > On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> wrote: > >> John Snow <jsnow@redhat.com> writes: >> >> > This series adds a new qapi-domain extension for Sphinx, which adds a >> > series of custom directives for documenting QAPI definitions. >> > >> > GitLab CI: https://gitlab.com/jsnow/qemu/-/pipelines/1259566476 >> > >> > (Link to a demo HTML page at the end of this cover letter, but I want >> > you to read the cover letter first to explain what you're seeing.) >> > >> > This adds a new QAPI Index page, cross-references for QMP commands, >> > events, and data types, and improves the aesthetics of the QAPI/QMP >> > documentation. >> >> Cross-references alone will be a *massive* improvement! I'm sure >> readers will appreciate better looks and an index, too. >> >> > This series adds only the new ReST syntax, *not* the autogenerator. The >> > ReST syntax used in this series is, in general, not intended for anyone >> > to actually write by hand. This mimics how Sphinx's own autodoc >> > extension generates Python domain directives, which are then re-parsed >> > to produce the final result. >> > >> > I have prototyped such a generator, but it isn't ready for inclusion >> > yet. (Rest assured: error context reporting is preserved down to the >> > line, even in generated ReST. There is no loss in usability for this >> > approach. It will likely either supplant qapidoc.py or heavily alter >> > it.) The generator requires only extremely minor changes to >> > scripts/qapi/parser.py to preserve nested indentation and provide more >> > accurate line information. It is less invasive than you may >> > fear. Relying on a secondary ReST parse phase eliminates much of the >> > complexity of qapidoc.py. Sleep soundly. >> >> I'm a Sphinx noob. Let me paraphrase you to make sure I understand. >> >> You proprose to generate formatted documentation in two steps: >> >> • First, the QAPI generator generates .rst from the QAPI schema. The >> generated .rst makes use of a custom directives. >> > > Yes, but this .rst file is built in-memory and never makes it to disk, like > Sphinx's autodoc for Python. Makes sense. > (We can add a debug knob to log it or save it out to disk if needed.) Likely useful at least occasionally. >> • Second, Sphinx turns the .rst into formatted documentation. A Sphinx >> qapi-domain extension implements the custom directives > > Yes. > > >> This mirrors how Sphinx works for Python docs. Which is its original >> use case. >> >> Your series demonstrates the second step, with test input you wrote >> manually. >> >> You have code for the first step, but you'd prefer to show it later. > > Right, it's not fully finished, although I have events, commands, and > objects working. Unions, Alternates and Events need work. > > >> Fair? > > Bingo! Thanks! >> > The purpose of sending this series in its current form is largely to >> > solicit feedback on general aesthetics, layout, and features. Sphinx is >> > a wily beast, and feedback at this stage will dictate how and where >> > certain features are implemented. >> >> I'd appreciate help with that. Opinions? > > >> > A goal for this syntax (and the generator) is to fully in-line all >> > command/event/object members, inherited or local, boxed or not, branched >> > or not. This should provide a boon to using these docs as a reference, >> > because users will not have to grep around the page looking for various >> > types, branches, or inherited members. Any arguments types will be >> > hyperlinked to their definition, further aiding usability. Commands can >> > be hotlinked from anywhere else in the manual, and will provide a >> > complete reference directly on the first screenful of information. >> >> Let me elaborate a bit here. >> >> A command's arguments can be specified inline, like so: >> >> { 'command': 'job-cancel', 'data': { 'id': 'str' } } >> >> The arguments are then documented right with the command. >> >> But they can also be specified by referencing an object type, like so: >> >> { 'command': 'block-dirty-bitmap-remove', >> 'data': 'BlockDirtyBitmap' } >> >> Reasons for doing it this way: >> >> • Several commands take the same arguments, and you don't want to repeat >> yourself. >> >> • You want generated C take a single struct argument ('boxed': true). >> >> • The arguments are a union (which requires 'boxed': true). >> >> Drawback: the arguments are then documented elsewhere. Not nice. >> >> Bug: the generated documentation fails to point there. >> >> You're proposing to inline the argument documentation, so it appears >> right with the command. >> >> An event's data is just like a command's argument. >> >> A command's return value can only specified by referencing a type. Same >> doc usability issue. >> >> Similarly, a union type's base can specified inline or by referencing a >> struct type, and a union's branches must be specified by referencing a >> struct type. Same doc usability issue. >> >> At least, the generated documentation does point to the referenced >> types. >> > > Right. My proposal is to recursively inline referenced bases for the > top-level members so that this manual is useful as a user reference, > without worrying about the actual QAPI structure. > > Return types will just be hotlinked. The argument for inlining the arguments equally applies to results: "users will not have to grep around the page". 102 out of 236 commands return something, usually some object type or an array of some object type. Most of these types are used for exactly one command's return and nothing else. Regardless, it's fine to explore inlining just for arguments. If we can make that work, extending it to return results shouldn't be too hard. >> > (Okay, maybe two screenfuls for commands with a ton of >> > arguments... There's only so much I can do!) >> >> *cough* blockdev-add *cough* > > >> > This RFC series includes a "sandbox" .rst document that showcases the >> > features of this domain by writing QAPI directives by hand; this >> > document will be removed from the series before final inclusion. It's >> > here to serve as a convenient test-bed for y'all to give feedback. >> > >> > All that said, here's the sandbox document fully rendered: >> > https://jsnow.gitlab.io/qemu/qapi/index.html >> > >> > And here's the new QAPI index page created by that sandbox document: >> > https://jsnow.gitlab.io/qemu/qapi-index.html >> > >> > Known issues / points of interest: >> > >> > - The formatting upsets checkpatch. The default line length for the >> > "black" formatting tool is a little long. I'll fix it next spin. >> > >> > - I did my best to preserve cross-version compatibility, but some >> > problems have crept in here and there. This series may require >> > Sphinx>= 4.0, like the dbus extension. Notably, the Ubuntu build fails >> > on Gitlab CI currently. The next version will text against more Sphinx >> > versions more rigorously. Sphinx version 5.3.0 and above should work >> > perfectly. >> > >> > - There's a bug in Sphinx itself that may manifest in your testing, >> > concerning reported error locations. There's a patch in this series >> > that fixes it, but it's later in the series. If you run into the bug >> > while testing with this series, try applying that patch first. >> > >> > - QAPI 'namespaces' aren't yet handled. i.e., there's nothing to >> > distinguish entities between QMP, QGA and QSD yet. That feature will >> > be added to a future version of this patchset (Likely when the >> > generator is ready for inclusion: without it, references will clash >> > and the index will gripe about duplicated entries.) >> >> qemu-storage-daemon's QMP is a proper subset of qemu-system-FOO's. >> Regardless, each of them has its own, independent reference manual. >> That's defensible. >> >> But the way we build them can complicate matters. For instance, when I >> tried to elide types not used for QMP from the reference manuals, I got >> defeated by Sphinx caching. > > I haven't tried yet, but I believe it should be possible to "tag" each > referenced QAPI object and mark it as "visited". Each namespaced definition > would be tagged separately. > > (i.e. qemu.block-core.blockdev-backup and qsd.block-core.blockdev-backup > would be two distinct entities.) > > Then, the renderer could ignore/squelch untagged entities. (I think.) > > Maybe some work to do to un-index unvisited entities. > > Or we can go the other route and bake this info into the schema and squelch > stuff earlier. We can add this feature later. I am still not sure why your > prototype for this ran into cache issues, but I am convinced it should be > fixable by making namespaced entities distinct. From (hazy) memory: when full QMP and storage daemon QMP both include a sub-module for its types, but full QMP uses more of its types than storage daemon QMP does, the storage daemon manual elides more of its types than the QMP manual does. However, sphinx-build's caching uses whatever got built first for *both* manuals. When we happen to build the full QMP manual first, we end up with extra types in the storage daemon manual. When we happen to build the storage daemon manual first, the full manual is missing types. > We could perhaps bake the namespace into the qapidoc directive, e.g.: > > .. qapi:autodoc:: schema.json > :namespace: QSD > > (And the default adds no namespace.) Is it worth it? How useful is the separate QEMU Storage Daemon QMP Reference Manual? It's an exact duplicate of selected chapters of the QEMU QMP Reference Manual... Could we instead document "QMP, but only these chapters"? Diversion: can we come up with a better way of subsetting the full QAPI schema for the storage daemon? We currently subset by having storage-daemon/qapi/qapi-schema.json a subset of the submodules qapi/qapi-schema.json includes. The code generated for the entire schema differs (basically qapi-init-commands.c, qapi-emit-events.[ch], and qapi-introspect.c). The code generated for the submodules should be identical (modulo unused array types, but that's harmless detail). So we ignore the code generated for the storage daemon's submodules. End of diversion. >> > - Per-member features and ifcond are not yet accounted for; though >> > definition-scoped features and ifconds are. Please feel free to >> > suggest how you'd like to see those represented. >> > >> > - Inlining all members means there is some ambiguity on what to do with >> > doc block sections on inlined entities; features and members have an >> > obvious home - body, since, and misc sections are not as obvious on >> > how to handle. This will feature more prominently in the generator >> > series. >> >> Yes, this is a real problem. >> >> The member documentation gets written in the context of the type. It >> may make sense only in that context. Inlining copies it into a >> different context. >> >> Features may need to combine. Say a command's arguments are a union >> type, and several branches of the union both contain deprecated members. >> These branch types all document feature @deprecated. Simply inlining >> their feature documentation results in feature @deprecated documented >> several times. Ugh. Combining them would be better. But how? Do we >> need to rethink how we document features? > > To be honest, I figured I'd plow ahead and then find the counter-examples > programmatically and decide what to do once I had my pile of edge cases. > > It might involve rewriting docs in some places, but I think the usability > win is completely worth the hassle. > > It's possible there might be some rework needed to maximize cogency of the > generated docs, but I think prioritizing a user-facing reference for QMP is > the objectively correct end goal and I'm more than happy to work backwards > from that desired state. I'm not disputing the idea that documenting the arguments right with the command is good. I'm merely pointing out obstacles to pulling that off. Adjusting existing documentation is only half the battle. The other half is making sure documentation stays adjusted. We may have to come up with new documentation rules, and ways to enforce them. >> > - Some features could be implemented in either the QAPI domain syntax >> > *or* the generator, or some combination of both. Depending on >> > aesthetic feedback, this may influence where those features should be >> > implemented. >> > >> > - The formatting and aesthetics of branches are a little up in the air, >> > see the qapi:union patch for more details. >> > >> > - It's late, maybe other things. Happy Friday! >> > >> > Hope you like it! >> >> Looks promising! > > Fingers crossed. This has bothered me about QEMU for years. Thanks!
On Mon, Apr 22, 2024 at 5:20 AM Markus Armbruster <armbru@redhat.com> wrote: > > John Snow <jsnow@redhat.com> writes: > > > On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> wrote: > > > >> John Snow <jsnow@redhat.com> writes: > >> > >> > This series adds a new qapi-domain extension for Sphinx, which adds a > >> > series of custom directives for documenting QAPI definitions. > >> > > >> > GitLab CI: https://gitlab.com/jsnow/qemu/-/pipelines/1259566476 > >> > > >> > (Link to a demo HTML page at the end of this cover letter, but I want > >> > you to read the cover letter first to explain what you're seeing.) > >> > > >> > This adds a new QAPI Index page, cross-references for QMP commands, > >> > events, and data types, and improves the aesthetics of the QAPI/QMP > >> > documentation. > >> > >> Cross-references alone will be a *massive* improvement! I'm sure > >> readers will appreciate better looks and an index, too. > >> > >> > This series adds only the new ReST syntax, *not* the autogenerator. The > >> > ReST syntax used in this series is, in general, not intended for anyone > >> > to actually write by hand. This mimics how Sphinx's own autodoc > >> > extension generates Python domain directives, which are then re-parsed > >> > to produce the final result. > >> > > >> > I have prototyped such a generator, but it isn't ready for inclusion > >> > yet. (Rest assured: error context reporting is preserved down to the > >> > line, even in generated ReST. There is no loss in usability for this > >> > approach. It will likely either supplant qapidoc.py or heavily alter > >> > it.) The generator requires only extremely minor changes to > >> > scripts/qapi/parser.py to preserve nested indentation and provide more > >> > accurate line information. It is less invasive than you may > >> > fear. Relying on a secondary ReST parse phase eliminates much of the > >> > complexity of qapidoc.py. Sleep soundly. > >> > >> I'm a Sphinx noob. Let me paraphrase you to make sure I understand. > >> > >> You proprose to generate formatted documentation in two steps: > >> > >> • First, the QAPI generator generates .rst from the QAPI schema. The > >> generated .rst makes use of a custom directives. > >> > > > > Yes, but this .rst file is built in-memory and never makes it to disk, like > > Sphinx's autodoc for Python. > > Makes sense. > > > (We can add a debug knob to log it or save it out to disk if needed.) > > Likely useful at least occasionally. Yep, python's autodoc has such a knob to use the debugging log for this. I just want to point out that avoiding the intermediate file on-disk is actually the mechanism by which I can preserve source lines, so this is how it's gotta be. I build an intermediate doc in-memory with source filenames and source lines along with the modified doc block content so that ReST errors can be tracked back directly to the QAPI json files. If we saved to disk and parsed that, it'd obliterate that information. > > >> • Second, Sphinx turns the .rst into formatted documentation. A Sphinx > >> qapi-domain extension implements the custom directives > > > > Yes. > > > > > >> This mirrors how Sphinx works for Python docs. Which is its original > >> use case. > >> > >> Your series demonstrates the second step, with test input you wrote > >> manually. > >> > >> You have code for the first step, but you'd prefer to show it later. > > > > Right, it's not fully finished, although I have events, commands, and > > objects working. Unions, Alternates and Events need work. > > > > > >> Fair? > > > > Bingo! > > Thanks! > > >> > The purpose of sending this series in its current form is largely to > >> > solicit feedback on general aesthetics, layout, and features. Sphinx is > >> > a wily beast, and feedback at this stage will dictate how and where > >> > certain features are implemented. > >> > >> I'd appreciate help with that. Opinions? > > > > > >> > A goal for this syntax (and the generator) is to fully in-line all > >> > command/event/object members, inherited or local, boxed or not, branched > >> > or not. This should provide a boon to using these docs as a reference, > >> > because users will not have to grep around the page looking for various > >> > types, branches, or inherited members. Any arguments types will be > >> > hyperlinked to their definition, further aiding usability. Commands can > >> > be hotlinked from anywhere else in the manual, and will provide a > >> > complete reference directly on the first screenful of information. > >> > >> Let me elaborate a bit here. > >> > >> A command's arguments can be specified inline, like so: > >> > >> { 'command': 'job-cancel', 'data': { 'id': 'str' } } > >> > >> The arguments are then documented right with the command. > >> > >> But they can also be specified by referencing an object type, like so: > >> > >> { 'command': 'block-dirty-bitmap-remove', > >> 'data': 'BlockDirtyBitmap' } > >> > >> Reasons for doing it this way: > >> > >> • Several commands take the same arguments, and you don't want to repeat > >> yourself. > >> > >> • You want generated C take a single struct argument ('boxed': true). > >> > >> • The arguments are a union (which requires 'boxed': true). > >> > >> Drawback: the arguments are then documented elsewhere. Not nice. > >> > >> Bug: the generated documentation fails to point there. > >> > >> You're proposing to inline the argument documentation, so it appears > >> right with the command. > >> > >> An event's data is just like a command's argument. > >> > >> A command's return value can only specified by referencing a type. Same > >> doc usability issue. > >> > >> Similarly, a union type's base can specified inline or by referencing a > >> struct type, and a union's branches must be specified by referencing a > >> struct type. Same doc usability issue. > >> > >> At least, the generated documentation does point to the referenced > >> types. > >> > > > > Right. My proposal is to recursively inline referenced bases for the > > top-level members so that this manual is useful as a user reference, > > without worrying about the actual QAPI structure. > > > > Return types will just be hotlinked. > > The argument for inlining the arguments equally applies to results: > "users will not have to grep around the page". > > 102 out of 236 commands return something, usually some object type or an > array of some object type. Most of these types are used for exactly one > command's return and nothing else. > > Regardless, it's fine to explore inlining just for arguments. If we can > make that work, extending it to return results shouldn't be too hard. Yeah. Future work, if we want it. Otherwise, I think it's not *too* bad to hotlink to the return arg type; but if that's info that you want to "hide" from the user API, I get it - we can work on squanching that in too. (For a follow-up though, please.) > > >> > (Okay, maybe two screenfuls for commands with a ton of > >> > arguments... There's only so much I can do!) > >> > >> *cough* blockdev-add *cough* > > > > > >> > This RFC series includes a "sandbox" .rst document that showcases the > >> > features of this domain by writing QAPI directives by hand; this > >> > document will be removed from the series before final inclusion. It's > >> > here to serve as a convenient test-bed for y'all to give feedback. > >> > > >> > All that said, here's the sandbox document fully rendered: > >> > https://jsnow.gitlab.io/qemu/qapi/index.html > >> > > >> > And here's the new QAPI index page created by that sandbox document: > >> > https://jsnow.gitlab.io/qemu/qapi-index.html > >> > > >> > Known issues / points of interest: > >> > > >> > - The formatting upsets checkpatch. The default line length for the > >> > "black" formatting tool is a little long. I'll fix it next spin. > >> > > >> > - I did my best to preserve cross-version compatibility, but some > >> > problems have crept in here and there. This series may require > >> > Sphinx>= 4.0, like the dbus extension. Notably, the Ubuntu build fails > >> > on Gitlab CI currently. The next version will text against more Sphinx > >> > versions more rigorously. Sphinx version 5.3.0 and above should work > >> > perfectly. > >> > > >> > - There's a bug in Sphinx itself that may manifest in your testing, > >> > concerning reported error locations. There's a patch in this series > >> > that fixes it, but it's later in the series. If you run into the bug > >> > while testing with this series, try applying that patch first. > >> > > >> > - QAPI 'namespaces' aren't yet handled. i.e., there's nothing to > >> > distinguish entities between QMP, QGA and QSD yet. That feature will > >> > be added to a future version of this patchset (Likely when the > >> > generator is ready for inclusion: without it, references will clash > >> > and the index will gripe about duplicated entries.) > >> > >> qemu-storage-daemon's QMP is a proper subset of qemu-system-FOO's. > >> Regardless, each of them has its own, independent reference manual. > >> That's defensible. > >> > >> But the way we build them can complicate matters. For instance, when I > >> tried to elide types not used for QMP from the reference manuals, I got > >> defeated by Sphinx caching. > > > > I haven't tried yet, but I believe it should be possible to "tag" each > > referenced QAPI object and mark it as "visited". Each namespaced definition > > would be tagged separately. > > > > (i.e. qemu.block-core.blockdev-backup and qsd.block-core.blockdev-backup > > would be two distinct entities.) > > > > Then, the renderer could ignore/squelch untagged entities. (I think.) > > > > Maybe some work to do to un-index unvisited entities. > > > > Or we can go the other route and bake this info into the schema and squelch > > stuff earlier. We can add this feature later. I am still not sure why your > > prototype for this ran into cache issues, but I am convinced it should be > > fixable by making namespaced entities distinct. > > From (hazy) memory: when full QMP and storage daemon QMP both include a > sub-module for its types, but full QMP uses more of its types than > storage daemon QMP does, the storage daemon manual elides more of its > types than the QMP manual does. However, sphinx-build's caching uses > whatever got built first for *both* manuals. When we happen to build > the full QMP manual first, we end up with extra types in the storage > daemon manual. When we happen to build the storage daemon manual first, > the full manual is missing types. Weird. I still don't understand where that info is getting cached, to be honest ... > > > We could perhaps bake the namespace into the qapidoc directive, e.g.: > > > > .. qapi:autodoc:: schema.json > > :namespace: QSD > > > > (And the default adds no namespace.) > > Is it worth it? How useful is the separate QEMU Storage Daemon QMP > Reference Manual? It's an exact duplicate of selected chapters of the > QEMU QMP Reference Manual... Could we instead document "QMP, but only > these chapters"? I dunno. That's up to you and Kevin, I think? For now, I'm just following "What we've got" but with cross-referencing and indexing improvements. Those improvements require distinguishing QMP and QSD somehow. Adding simple namespace support seems like the most flexible *and* dead simple way to do it. Other improvements can come later if desired, I think. If you need help adding Sphinx features to make it happen, you know where I live. > > Diversion: can we come up with a better way of subsetting the full QAPI > schema for the storage daemon? Out of scope for me I'm afraid, but clue me in to the discussion if/when you have it. > > We currently subset by having storage-daemon/qapi/qapi-schema.json a > subset of the submodules qapi/qapi-schema.json includes. The code > generated for the entire schema differs (basically qapi-init-commands.c, > qapi-emit-events.[ch], and qapi-introspect.c). The code generated for > the submodules should be identical (modulo unused array types, but > that's harmless detail). So we ignore the code generated for the > storage daemon's submodules. Ah, so you are debating a type of pragma or other meta-include that just fishes out specific definitions? That could work, I think - I'm not sure how the QAPI Schema end would look, but the Sphinx domain could provide a meta-directive for adding a QSD support tag to specific entities or some such thing, and the single QMP reference could advertise a qemu-storage-daemon support flag. Or something - I'm not really privy to the big picture when it comes to how we want to handle QMP modularization and differentiation between different binaries. > > End of diversion. > > >> > - Per-member features and ifcond are not yet accounted for; though > >> > definition-scoped features and ifconds are. Please feel free to > >> > suggest how you'd like to see those represented. > >> > > >> > - Inlining all members means there is some ambiguity on what to do with > >> > doc block sections on inlined entities; features and members have an > >> > obvious home - body, since, and misc sections are not as obvious on > >> > how to handle. This will feature more prominently in the generator > >> > series. > >> > >> Yes, this is a real problem. > >> > >> The member documentation gets written in the context of the type. It > >> may make sense only in that context. Inlining copies it into a > >> different context. > >> > >> Features may need to combine. Say a command's arguments are a union > >> type, and several branches of the union both contain deprecated members. > >> These branch types all document feature @deprecated. Simply inlining > >> their feature documentation results in feature @deprecated documented > >> several times. Ugh. Combining them would be better. But how? Do we > >> need to rethink how we document features? > > > > To be honest, I figured I'd plow ahead and then find the counter-examples > > programmatically and decide what to do once I had my pile of edge cases. > > > > It might involve rewriting docs in some places, but I think the usability > > win is completely worth the hassle. > > > > It's possible there might be some rework needed to maximize cogency of the > > generated docs, but I think prioritizing a user-facing reference for QMP is > > the objectively correct end goal and I'm more than happy to work backwards > > from that desired state. > > I'm not disputing the idea that documenting the arguments right with the > command is good. I'm merely pointing out obstacles to pulling that off. > > Adjusting existing documentation is only half the battle. The other > half is making sure documentation stays adjusted. We may have to come > up with new documentation rules, and ways to enforce them. For the sake of argument, let's say we forbid everything except arg/features from definitions destined to be used as base/inherited types. This would be very easy to enforce at the qapidoc level where the doc inlining is performed by yelping when the base type contains additional documentation sections. Now, in the real world, maybe sometimes those sections are useful and we don't want to get rid of them, esp. because they may contain useful documentation that we don't want to duplicate in the source files. My plan is to just forbid them at first and enumerate the cases where they occur, then decide which ones are better off being moved elsewhere or explicitly tolerated. The generator's tolerance can be adjusted accordingly and we can formulate a rule for exactly how doc blocks are combined and merged. I think it won't be a problem to enforce it programmatically. I'll get back to you on how often and precisely where these cases occur so you can take a look and see how you feel. > > >> > - Some features could be implemented in either the QAPI domain syntax > >> > *or* the generator, or some combination of both. Depending on > >> > aesthetic feedback, this may influence where those features should be > >> > implemented. > >> > > >> > - The formatting and aesthetics of branches are a little up in the air, > >> > see the qapi:union patch for more details. > >> > > >> > - It's late, maybe other things. Happy Friday! > >> > > >> > Hope you like it! > >> > >> Looks promising! > > > > Fingers crossed. This has bothered me about QEMU for years. > > Thanks! >
On Mon, Apr 22, 2024 at 12:38 PM John Snow <jsnow@redhat.com> wrote: > > On Mon, Apr 22, 2024 at 5:20 AM Markus Armbruster <armbru@redhat.com> wrote: > > > > John Snow <jsnow@redhat.com> writes: > > > > > On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> wrote: > > > > > >> John Snow <jsnow@redhat.com> writes: > > >> > > >> > This series adds a new qapi-domain extension for Sphinx, which adds a > > >> > series of custom directives for documenting QAPI definitions. > > >> > > > >> > GitLab CI: https://gitlab.com/jsnow/qemu/-/pipelines/1259566476 > > >> > > > >> > (Link to a demo HTML page at the end of this cover letter, but I want > > >> > you to read the cover letter first to explain what you're seeing.) > > >> > > > >> > This adds a new QAPI Index page, cross-references for QMP commands, > > >> > events, and data types, and improves the aesthetics of the QAPI/QMP > > >> > documentation. > > >> > > >> Cross-references alone will be a *massive* improvement! I'm sure > > >> readers will appreciate better looks and an index, too. > > >> > > >> > This series adds only the new ReST syntax, *not* the autogenerator. The > > >> > ReST syntax used in this series is, in general, not intended for anyone > > >> > to actually write by hand. This mimics how Sphinx's own autodoc > > >> > extension generates Python domain directives, which are then re-parsed > > >> > to produce the final result. > > >> > > > >> > I have prototyped such a generator, but it isn't ready for inclusion > > >> > yet. (Rest assured: error context reporting is preserved down to the > > >> > line, even in generated ReST. There is no loss in usability for this > > >> > approach. It will likely either supplant qapidoc.py or heavily alter > > >> > it.) The generator requires only extremely minor changes to > > >> > scripts/qapi/parser.py to preserve nested indentation and provide more > > >> > accurate line information. It is less invasive than you may > > >> > fear. Relying on a secondary ReST parse phase eliminates much of the > > >> > complexity of qapidoc.py. Sleep soundly. > > >> > > >> I'm a Sphinx noob. Let me paraphrase you to make sure I understand. > > >> > > >> You proprose to generate formatted documentation in two steps: > > >> > > >> • First, the QAPI generator generates .rst from the QAPI schema. The > > >> generated .rst makes use of a custom directives. > > >> > > > > > > Yes, but this .rst file is built in-memory and never makes it to disk, like > > > Sphinx's autodoc for Python. > > > > Makes sense. > > > > > (We can add a debug knob to log it or save it out to disk if needed.) > > > > Likely useful at least occasionally. > > Yep, python's autodoc has such a knob to use the debugging log for > this. I just want to point out that avoiding the intermediate file > on-disk is actually the mechanism by which I can preserve source > lines, so this is how it's gotta be. > > I build an intermediate doc in-memory with source filenames and source > lines along with the modified doc block content so that ReST errors > can be tracked back directly to the QAPI json files. If we saved to > disk and parsed that, it'd obliterate that information. > > > > > >> • Second, Sphinx turns the .rst into formatted documentation. A Sphinx > > >> qapi-domain extension implements the custom directives > > > > > > Yes. > > > > > > > > >> This mirrors how Sphinx works for Python docs. Which is its original > > >> use case. > > >> > > >> Your series demonstrates the second step, with test input you wrote > > >> manually. > > >> > > >> You have code for the first step, but you'd prefer to show it later. > > > > > > Right, it's not fully finished, although I have events, commands, and > > > objects working. Unions, Alternates and Events need work. > > > > > > > > >> Fair? > > > > > > Bingo! > > > > Thanks! > > > > >> > The purpose of sending this series in its current form is largely to > > >> > solicit feedback on general aesthetics, layout, and features. Sphinx is > > >> > a wily beast, and feedback at this stage will dictate how and where > > >> > certain features are implemented. > > >> > > >> I'd appreciate help with that. Opinions? > > > > > > > > >> > A goal for this syntax (and the generator) is to fully in-line all > > >> > command/event/object members, inherited or local, boxed or not, branched > > >> > or not. This should provide a boon to using these docs as a reference, > > >> > because users will not have to grep around the page looking for various > > >> > types, branches, or inherited members. Any arguments types will be > > >> > hyperlinked to their definition, further aiding usability. Commands can > > >> > be hotlinked from anywhere else in the manual, and will provide a > > >> > complete reference directly on the first screenful of information. > > >> > > >> Let me elaborate a bit here. > > >> > > >> A command's arguments can be specified inline, like so: > > >> > > >> { 'command': 'job-cancel', 'data': { 'id': 'str' } } > > >> > > >> The arguments are then documented right with the command. > > >> > > >> But they can also be specified by referencing an object type, like so: > > >> > > >> { 'command': 'block-dirty-bitmap-remove', > > >> 'data': 'BlockDirtyBitmap' } > > >> > > >> Reasons for doing it this way: > > >> > > >> • Several commands take the same arguments, and you don't want to repeat > > >> yourself. > > >> > > >> • You want generated C take a single struct argument ('boxed': true). > > >> > > >> • The arguments are a union (which requires 'boxed': true). > > >> > > >> Drawback: the arguments are then documented elsewhere. Not nice. > > >> > > >> Bug: the generated documentation fails to point there. > > >> > > >> You're proposing to inline the argument documentation, so it appears > > >> right with the command. > > >> > > >> An event's data is just like a command's argument. > > >> > > >> A command's return value can only specified by referencing a type. Same > > >> doc usability issue. > > >> > > >> Similarly, a union type's base can specified inline or by referencing a > > >> struct type, and a union's branches must be specified by referencing a > > >> struct type. Same doc usability issue. > > >> > > >> At least, the generated documentation does point to the referenced > > >> types. > > >> > > > > > > Right. My proposal is to recursively inline referenced bases for the > > > top-level members so that this manual is useful as a user reference, > > > without worrying about the actual QAPI structure. > > > > > > Return types will just be hotlinked. > > > > The argument for inlining the arguments equally applies to results: > > "users will not have to grep around the page". > > > > 102 out of 236 commands return something, usually some object type or an > > array of some object type. Most of these types are used for exactly one > > command's return and nothing else. > > > > Regardless, it's fine to explore inlining just for arguments. If we can > > make that work, extending it to return results shouldn't be too hard. > > Yeah. Future work, if we want it. Otherwise, I think it's not *too* > bad to hotlink to the return arg type; but if that's info that you > want to "hide" from the user API, I get it - we can work on squanching > that in too. (For a follow-up though, please.) > > > > > >> > (Okay, maybe two screenfuls for commands with a ton of > > >> > arguments... There's only so much I can do!) > > >> > > >> *cough* blockdev-add *cough* > > > > > > > > >> > This RFC series includes a "sandbox" .rst document that showcases the > > >> > features of this domain by writing QAPI directives by hand; this > > >> > document will be removed from the series before final inclusion. It's > > >> > here to serve as a convenient test-bed for y'all to give feedback. > > >> > > > >> > All that said, here's the sandbox document fully rendered: > > >> > https://jsnow.gitlab.io/qemu/qapi/index.html > > >> > > > >> > And here's the new QAPI index page created by that sandbox document: > > >> > https://jsnow.gitlab.io/qemu/qapi-index.html > > >> > > > >> > Known issues / points of interest: > > >> > > > >> > - The formatting upsets checkpatch. The default line length for the > > >> > "black" formatting tool is a little long. I'll fix it next spin. > > >> > > > >> > - I did my best to preserve cross-version compatibility, but some > > >> > problems have crept in here and there. This series may require > > >> > Sphinx>= 4.0, like the dbus extension. Notably, the Ubuntu build fails > > >> > on Gitlab CI currently. The next version will text against more Sphinx > > >> > versions more rigorously. Sphinx version 5.3.0 and above should work > > >> > perfectly. > > >> > > > >> > - There's a bug in Sphinx itself that may manifest in your testing, > > >> > concerning reported error locations. There's a patch in this series > > >> > that fixes it, but it's later in the series. If you run into the bug > > >> > while testing with this series, try applying that patch first. > > >> > > > >> > - QAPI 'namespaces' aren't yet handled. i.e., there's nothing to > > >> > distinguish entities between QMP, QGA and QSD yet. That feature will > > >> > be added to a future version of this patchset (Likely when the > > >> > generator is ready for inclusion: without it, references will clash > > >> > and the index will gripe about duplicated entries.) > > >> > > >> qemu-storage-daemon's QMP is a proper subset of qemu-system-FOO's. > > >> Regardless, each of them has its own, independent reference manual. > > >> That's defensible. > > >> > > >> But the way we build them can complicate matters. For instance, when I > > >> tried to elide types not used for QMP from the reference manuals, I got > > >> defeated by Sphinx caching. > > > > > > I haven't tried yet, but I believe it should be possible to "tag" each > > > referenced QAPI object and mark it as "visited". Each namespaced definition > > > would be tagged separately. > > > > > > (i.e. qemu.block-core.blockdev-backup and qsd.block-core.blockdev-backup > > > would be two distinct entities.) > > > > > > Then, the renderer could ignore/squelch untagged entities. (I think.) > > > > > > Maybe some work to do to un-index unvisited entities. > > > > > > Or we can go the other route and bake this info into the schema and squelch > > > stuff earlier. We can add this feature later. I am still not sure why your > > > prototype for this ran into cache issues, but I am convinced it should be > > > fixable by making namespaced entities distinct. > > > > From (hazy) memory: when full QMP and storage daemon QMP both include a > > sub-module for its types, but full QMP uses more of its types than > > storage daemon QMP does, the storage daemon manual elides more of its > > types than the QMP manual does. However, sphinx-build's caching uses > > whatever got built first for *both* manuals. When we happen to build > > the full QMP manual first, we end up with extra types in the storage > > daemon manual. When we happen to build the storage daemon manual first, > > the full manual is missing types. > > Weird. I still don't understand where that info is getting cached, to > be honest ... > > > > > > We could perhaps bake the namespace into the qapidoc directive, e.g.: > > > > > > .. qapi:autodoc:: schema.json > > > :namespace: QSD > > > > > > (And the default adds no namespace.) > > > > Is it worth it? How useful is the separate QEMU Storage Daemon QMP > > Reference Manual? It's an exact duplicate of selected chapters of the > > QEMU QMP Reference Manual... Could we instead document "QMP, but only > > these chapters"? > > I dunno. That's up to you and Kevin, I think? For now, I'm just > following "What we've got" but with cross-referencing and indexing > improvements. Those improvements require distinguishing QMP and QSD > somehow. Adding simple namespace support seems like the most flexible > *and* dead simple way to do it. Other improvements can come later if > desired, I think. > > If you need help adding Sphinx features to make it happen, you know > where I live. > > > > > Diversion: can we come up with a better way of subsetting the full QAPI > > schema for the storage daemon? > > Out of scope for me I'm afraid, but clue me in to the discussion > if/when you have it. > > > > > We currently subset by having storage-daemon/qapi/qapi-schema.json a > > subset of the submodules qapi/qapi-schema.json includes. The code > > generated for the entire schema differs (basically qapi-init-commands.c, > > qapi-emit-events.[ch], and qapi-introspect.c). The code generated for > > the submodules should be identical (modulo unused array types, but > > that's harmless detail). So we ignore the code generated for the > > storage daemon's submodules. > > Ah, so you are debating a type of pragma or other meta-include that > just fishes out specific definitions? That could work, I think - I'm > not sure how the QAPI Schema end would look, but the Sphinx domain > could provide a meta-directive for adding a QSD support tag to > specific entities or some such thing, and the single QMP reference > could advertise a qemu-storage-daemon support flag. > > Or something - I'm not really privy to the big picture when it comes > to how we want to handle QMP modularization and differentiation > between different binaries. > > > > > End of diversion. > > > > >> > - Per-member features and ifcond are not yet accounted for; though > > >> > definition-scoped features and ifconds are. Please feel free to > > >> > suggest how you'd like to see those represented. > > >> > > > >> > - Inlining all members means there is some ambiguity on what to do with > > >> > doc block sections on inlined entities; features and members have an > > >> > obvious home - body, since, and misc sections are not as obvious on > > >> > how to handle. This will feature more prominently in the generator > > >> > series. > > >> > > >> Yes, this is a real problem. > > >> > > >> The member documentation gets written in the context of the type. It > > >> may make sense only in that context. Inlining copies it into a > > >> different context. > > >> > > >> Features may need to combine. Say a command's arguments are a union > > >> type, and several branches of the union both contain deprecated members. > > >> These branch types all document feature @deprecated. Simply inlining > > >> their feature documentation results in feature @deprecated documented > > >> several times. Ugh. Combining them would be better. But how? Do we > > >> need to rethink how we document features? > > > > > > To be honest, I figured I'd plow ahead and then find the counter-examples > > > programmatically and decide what to do once I had my pile of edge cases. > > > > > > It might involve rewriting docs in some places, but I think the usability > > > win is completely worth the hassle. > > > > > > It's possible there might be some rework needed to maximize cogency of the > > > generated docs, but I think prioritizing a user-facing reference for QMP is > > > the objectively correct end goal and I'm more than happy to work backwards > > > from that desired state. > > > > I'm not disputing the idea that documenting the arguments right with the > > command is good. I'm merely pointing out obstacles to pulling that off. > > > > Adjusting existing documentation is only half the battle. The other > > half is making sure documentation stays adjusted. We may have to come > > up with new documentation rules, and ways to enforce them. > > For the sake of argument, let's say we forbid everything except > arg/features from definitions destined to be used as base/inherited > types. This would be very easy to enforce at the qapidoc level where > the doc inlining is performed by yelping when the base type contains > additional documentation sections. > > Now, in the real world, maybe sometimes those sections are useful and > we don't want to get rid of them, esp. because they may contain useful > documentation that we don't want to duplicate in the source files. > > My plan is to just forbid them at first and enumerate the cases where > they occur, then decide which ones are better off being moved > elsewhere or explicitly tolerated. The generator's tolerance can be > adjusted accordingly and we can formulate a rule for exactly how doc > blocks are combined and merged. I think it won't be a problem to > enforce it programmatically. > > I'll get back to you on how often and precisely where these cases > occur so you can take a look and see how you feel. > For a warmup, let's look at every unique instance of non-empty paragraph text on an object that is used as a base anywhere: Warning: AudiodevPerDirectionOptions - inlined paragraph Warning: BlockdevOptionsCurlBase - inlined paragraph Warning: BlockdevOptionsGenericCOWFormat - inlined paragraph Warning: BlockdevOptionsGenericFormat - inlined paragraph Warning: BlockExportOptionsNbdBase - inlined paragraph Warning: BlockNodeInfo - inlined paragraph Warning: ChardevCommon - inlined paragraph Warning: CpuInstanceProperties - inlined paragraph Warning: CryptodevBackendProperties - inlined paragraph Warning: EventLoopBaseProperties - inlined paragraph Warning: MemoryBackendProperties - inlined paragraph Warning: NetfilterProperties - inlined paragraph Warning: QCryptoBlockAmendOptionsLUKS - inlined paragraph Warning: QCryptoBlockCreateOptionsLUKS - inlined paragraph Warning: QCryptoBlockInfoBase - inlined paragraph Warning: QCryptoBlockOptionsBase - inlined paragraph Warning: QCryptoBlockOptionsLUKS - inlined paragraph Warning: RngProperties - inlined paragraph Warning: SecretCommonProperties - inlined paragraph Warning: SpiceBasicInfo - inlined paragraph Warning: TlsCredsProperties - inlined paragraph Warning: VncBasicInfo - inlined paragraph There's 22 instances. Let's look at what they say: AudiodevPerDirectionOptions: "General audio backend options that are used for both playback and recording." BlockdevOptionsCurlBase: "Driver specific block device options shared by all protocols supported by the curl backend." BlockdevOptionsGenericCOWFormat: "Driver specific block device options for image format that have no option besides their data source and an optional backing file." BlockdevOptionsGenericFormat: "Driver specific block device options for image format that have no option besides their data source." BlockExportOptionsNbdBase: "An NBD block export (common options shared between nbd-server-add and the NBD branch of block-export-add)." BlockNodeInfo: "Information about a QEMU image file" ChardevCommon: "Configuration shared across all chardev backends" CpuInstanceProperties: "List of properties to be used for hotplugging a CPU instance, it should be passed by management with device_add command when a CPU is being hotplugged. Which members are optional and which mandatory depends on the architecture and board. For s390x see :ref:`cpu-topology-s390x`. The ids other than the node-id specify the position of the CPU within the CPU topology (as defined by the machine property "smp", thus see also type @SMPConfiguration)" CryptodevBackendProperties: "Properties for cryptodev-backend and cryptodev-backend-builtin objects." EventLoopBaseProperties: "Common properties for event loops" MemoryBackendProperties: "Properties for objects of classes derived from memory-backend." NetfilterProperties: "Properties for objects of classes derived from netfilter." QCryptoBlockAmendOptionsLUKS: "This struct defines the update parameters that activate/de-activate set of keyslots" QCryptoBlockCreateOptionsLUKS: "The options that apply to LUKS encryption format initialization" QCryptoBlockInfoBase: "The common information that applies to all full disk encryption formats" QCryptoBlockOptionsBase: "The common options that apply to all full disk encryption formats" QCryptoBlockOptionsLUKS: "The options that apply to LUKS encryption format" RngProperties: "Properties for objects of classes derived from rng." SecretCommonProperties: "Properties for objects of classes derived from secret-common." SpiceBasicInfo: "The basic information for SPICE network connection" TlsCredsProperties: "Properties for objects of classes derived from tls-creds." VncBasicInfo: "The basic information for vnc network connection" ... Alright. So like 98% of the time, it's functionally useless information for the end-user. The only thing that looks mildly interesting is CpuInstanceProperties and *maybe* QCryptoBlockAmendOptionsLUKS. I propose we add a new section to QAPI doc blocks that gets ignored from rendered documentation, like "Comment:" -- to keep any info that might be mildly relevant to a developer that explains the *factoring* of the object, but won't be exposed in user-facing documents. On the occasion we DO want to inline documentation paragraphs, we can leave them in and have the doc generator inline them. There's probably very few cases where we actually want this. Let's take a look at any tagged sections now, excluding "Since": Warning: BackupCommon - inlined tag section Note Warning: CpuInstanceProperties - inlined, tag section Note Warning: MemoryBackendProperties - inlined tag section Note Not very many! Just three. BackupCommon: Note: @on-source-error and @on-target-error only affect background I/O. If an error occurs during a guest write request, the device's rerror/werror actions will be used. BackupCommon is used as a base for DriveBackup and BlockdevBackup. In this case, this note probably does belong on both. It should be inlined and included. CpuInstanceProperties: Note: management should be prepared to pass through additional properties with device_add. This is only used as a base with no other addon args for NumaCpuOptions, in turn only used for an argument. This is probably right to pass through, too. (Though I admit I don't really know what it means... we can discuss the doc *quality* another day.) MemoryBackendProperties: Note: prealloc=true and reserve=false cannot be set at the same time. With reserve=true, the behavior depends on the operating system: for example, Linux will not reserve swap space for shared file mappings -- "not applicable". In contrast, reserve=false will bail out if it cannot be configured accordingly. This is used in MemoryBackendFileProperties, MemoryBackendMemfdProperties, and MemoryBackendEpcProperties. None are commands. I think the note here should also be inlined into each object. So, I think that: - Most naked paragraphs are usually useless to the end-user, and we should put them behind a Comment section to prevent them from being inlined. - Any *other* paragraph or section should be included in the descendent. We'll just review with that eye going forward. - Since: ... we'll talk about later this week. --js
John Snow <jsnow@redhat.com> writes: > On Mon, Apr 22, 2024 at 5:20 AM Markus Armbruster <armbru@redhat.com> wrote: >> >> John Snow <jsnow@redhat.com> writes: >> >> > On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> wrote: >> > >> >> John Snow <jsnow@redhat.com> writes: >> >> >> >> > This series adds a new qapi-domain extension for Sphinx, which adds a >> >> > series of custom directives for documenting QAPI definitions. >> >> > >> >> > GitLab CI: https://gitlab.com/jsnow/qemu/-/pipelines/1259566476 >> >> > >> >> > (Link to a demo HTML page at the end of this cover letter, but I want >> >> > you to read the cover letter first to explain what you're seeing.) >> >> > >> >> > This adds a new QAPI Index page, cross-references for QMP commands, >> >> > events, and data types, and improves the aesthetics of the QAPI/QMP >> >> > documentation. >> >> >> >> Cross-references alone will be a *massive* improvement! I'm sure >> >> readers will appreciate better looks and an index, too. >> >> >> >> > This series adds only the new ReST syntax, *not* the autogenerator. The >> >> > ReST syntax used in this series is, in general, not intended for anyone >> >> > to actually write by hand. This mimics how Sphinx's own autodoc >> >> > extension generates Python domain directives, which are then re-parsed >> >> > to produce the final result. >> >> > >> >> > I have prototyped such a generator, but it isn't ready for inclusion >> >> > yet. (Rest assured: error context reporting is preserved down to the >> >> > line, even in generated ReST. There is no loss in usability for this >> >> > approach. It will likely either supplant qapidoc.py or heavily alter >> >> > it.) The generator requires only extremely minor changes to >> >> > scripts/qapi/parser.py to preserve nested indentation and provide more >> >> > accurate line information. It is less invasive than you may >> >> > fear. Relying on a secondary ReST parse phase eliminates much of the >> >> > complexity of qapidoc.py. Sleep soundly. >> >> >> >> I'm a Sphinx noob. Let me paraphrase you to make sure I understand. >> >> >> >> You proprose to generate formatted documentation in two steps: >> >> >> >> • First, the QAPI generator generates .rst from the QAPI schema. The >> >> generated .rst makes use of a custom directives. >> >> >> > >> > Yes, but this .rst file is built in-memory and never makes it to disk, like >> > Sphinx's autodoc for Python. >> >> Makes sense. >> >> > (We can add a debug knob to log it or save it out to disk if needed.) >> >> Likely useful at least occasionally. > > Yep, python's autodoc has such a knob to use the debugging log for > this. I just want to point out that avoiding the intermediate file > on-disk is actually the mechanism by which I can preserve source > lines, so this is how it's gotta be. > > I build an intermediate doc in-memory with source filenames and source > lines along with the modified doc block content so that ReST errors > can be tracked back directly to the QAPI json files. If we saved to > disk and parsed that, it'd obliterate that information. Sounds just fine to me. >> >> • Second, Sphinx turns the .rst into formatted documentation. A Sphinx >> >> qapi-domain extension implements the custom directives >> > >> > Yes. >> > >> > >> >> This mirrors how Sphinx works for Python docs. Which is its original >> >> use case. >> >> >> >> Your series demonstrates the second step, with test input you wrote >> >> manually. >> >> >> >> You have code for the first step, but you'd prefer to show it later. >> > >> > Right, it's not fully finished, although I have events, commands, and >> > objects working. Unions, Alternates and Events need work. >> > >> > >> >> Fair? >> > >> > Bingo! >> >> Thanks! >> >> >> > The purpose of sending this series in its current form is largely to >> >> > solicit feedback on general aesthetics, layout, and features. Sphinx is >> >> > a wily beast, and feedback at this stage will dictate how and where >> >> > certain features are implemented. >> >> >> >> I'd appreciate help with that. Opinions? >> > >> > >> >> > A goal for this syntax (and the generator) is to fully in-line all >> >> > command/event/object members, inherited or local, boxed or not, branched >> >> > or not. This should provide a boon to using these docs as a reference, >> >> > because users will not have to grep around the page looking for various >> >> > types, branches, or inherited members. Any arguments types will be >> >> > hyperlinked to their definition, further aiding usability. Commands can >> >> > be hotlinked from anywhere else in the manual, and will provide a >> >> > complete reference directly on the first screenful of information. >> >> >> >> Let me elaborate a bit here. >> >> >> >> A command's arguments can be specified inline, like so: >> >> >> >> { 'command': 'job-cancel', 'data': { 'id': 'str' } } >> >> >> >> The arguments are then documented right with the command. >> >> >> >> But they can also be specified by referencing an object type, like so: >> >> >> >> { 'command': 'block-dirty-bitmap-remove', >> >> 'data': 'BlockDirtyBitmap' } >> >> >> >> Reasons for doing it this way: >> >> >> >> • Several commands take the same arguments, and you don't want to repeat >> >> yourself. >> >> >> >> • You want generated C take a single struct argument ('boxed': true). >> >> >> >> • The arguments are a union (which requires 'boxed': true). >> >> >> >> Drawback: the arguments are then documented elsewhere. Not nice. >> >> >> >> Bug: the generated documentation fails to point there. >> >> >> >> You're proposing to inline the argument documentation, so it appears >> >> right with the command. >> >> >> >> An event's data is just like a command's argument. >> >> >> >> A command's return value can only specified by referencing a type. Same >> >> doc usability issue. >> >> >> >> Similarly, a union type's base can specified inline or by referencing a >> >> struct type, and a union's branches must be specified by referencing a >> >> struct type. Same doc usability issue. >> >> >> >> At least, the generated documentation does point to the referenced >> >> types. >> >> >> > >> > Right. My proposal is to recursively inline referenced bases for the >> > top-level members so that this manual is useful as a user reference, >> > without worrying about the actual QAPI structure. >> > >> > Return types will just be hotlinked. >> >> The argument for inlining the arguments equally applies to results: >> "users will not have to grep around the page". >> >> 102 out of 236 commands return something, usually some object type or an >> array of some object type. Most of these types are used for exactly one >> command's return and nothing else. >> >> Regardless, it's fine to explore inlining just for arguments. If we can >> make that work, extending it to return results shouldn't be too hard. > > Yeah. Future work, if we want it. Otherwise, I think it's not *too* > bad to hotlink to the return arg type; I think hiding a command's results behind a link is roughly as bad as hiding its arguments behind a link. > but if that's info that you > want to "hide" from the user API, I get it - we can work on squanching > that in too. (For a follow-up though, please.) It's not about hiding type names. Sure, type names are not ABI, since there's nothing the user can do with them. But as long as that's understood, they can be useful in documentation. We use them as text for links to and a title for the documentation of a type. Messing with type names affects link texts and titles in documentation, which is fine. >> >> > (Okay, maybe two screenfuls for commands with a ton of >> >> > arguments... There's only so much I can do!) >> >> >> >> *cough* blockdev-add *cough* >> > >> > >> >> > This RFC series includes a "sandbox" .rst document that showcases the >> >> > features of this domain by writing QAPI directives by hand; this >> >> > document will be removed from the series before final inclusion. It's >> >> > here to serve as a convenient test-bed for y'all to give feedback. >> >> > >> >> > All that said, here's the sandbox document fully rendered: >> >> > https://jsnow.gitlab.io/qemu/qapi/index.html >> >> > >> >> > And here's the new QAPI index page created by that sandbox document: >> >> > https://jsnow.gitlab.io/qemu/qapi-index.html >> >> > >> >> > Known issues / points of interest: >> >> > >> >> > - The formatting upsets checkpatch. The default line length for the >> >> > "black" formatting tool is a little long. I'll fix it next spin. >> >> > >> >> > - I did my best to preserve cross-version compatibility, but some >> >> > problems have crept in here and there. This series may require >> >> > Sphinx>= 4.0, like the dbus extension. Notably, the Ubuntu build fails >> >> > on Gitlab CI currently. The next version will text against more Sphinx >> >> > versions more rigorously. Sphinx version 5.3.0 and above should work >> >> > perfectly. >> >> > >> >> > - There's a bug in Sphinx itself that may manifest in your testing, >> >> > concerning reported error locations. There's a patch in this series >> >> > that fixes it, but it's later in the series. If you run into the bug >> >> > while testing with this series, try applying that patch first. >> >> > >> >> > - QAPI 'namespaces' aren't yet handled. i.e., there's nothing to >> >> > distinguish entities between QMP, QGA and QSD yet. That feature will >> >> > be added to a future version of this patchset (Likely when the >> >> > generator is ready for inclusion: without it, references will clash >> >> > and the index will gripe about duplicated entries.) >> >> >> >> qemu-storage-daemon's QMP is a proper subset of qemu-system-FOO's. >> >> Regardless, each of them has its own, independent reference manual. >> >> That's defensible. >> >> >> >> But the way we build them can complicate matters. For instance, when I >> >> tried to elide types not used for QMP from the reference manuals, I got >> >> defeated by Sphinx caching. >> > >> > I haven't tried yet, but I believe it should be possible to "tag" each >> > referenced QAPI object and mark it as "visited". Each namespaced definition >> > would be tagged separately. >> > >> > (i.e. qemu.block-core.blockdev-backup and qsd.block-core.blockdev-backup >> > would be two distinct entities.) >> > >> > Then, the renderer could ignore/squelch untagged entities. (I think.) >> > >> > Maybe some work to do to un-index unvisited entities. >> > >> > Or we can go the other route and bake this info into the schema and squelch >> > stuff earlier. We can add this feature later. I am still not sure why your >> > prototype for this ran into cache issues, but I am convinced it should be >> > fixable by making namespaced entities distinct. >> >> From (hazy) memory: when full QMP and storage daemon QMP both include a >> sub-module for its types, but full QMP uses more of its types than >> storage daemon QMP does, the storage daemon manual elides more of its >> types than the QMP manual does. However, sphinx-build's caching uses >> whatever got built first for *both* manuals. When we happen to build >> the full QMP manual first, we end up with extra types in the storage >> daemon manual. When we happen to build the storage daemon manual first, >> the full manual is missing types. > > Weird. I still don't understand where that info is getting cached, to > be honest ... Me neither. >> > We could perhaps bake the namespace into the qapidoc directive, e.g.: >> > >> > .. qapi:autodoc:: schema.json >> > :namespace: QSD >> > >> > (And the default adds no namespace.) >> >> Is it worth it? How useful is the separate QEMU Storage Daemon QMP >> Reference Manual? It's an exact duplicate of selected chapters of the >> QEMU QMP Reference Manual... Could we instead document "QMP, but only >> these chapters"? > > I dunno. That's up to you and Kevin, I think? For now, I'm just > following "What we've got" but with cross-referencing and indexing > improvements. Those improvements require distinguishing QMP and QSD > somehow. Adding simple namespace support seems like the most flexible > *and* dead simple way to do it. Other improvements can come later if > desired, I think. If you want namespace support to keep QMP and QGA apart, go for it. If it's just for full QMP vs. qemu-storage-daemon QMP, I suggest to ignore the issue for now. > If you need help adding Sphinx features to make it happen, you know > where I live. > >> Diversion: can we come up with a better way of subsetting the full QAPI >> schema for the storage daemon? > > Out of scope for me I'm afraid, but clue me in to the discussion > if/when you have it. Sure! >> We currently subset by having storage-daemon/qapi/qapi-schema.json a >> subset of the submodules qapi/qapi-schema.json includes. The code >> generated for the entire schema differs (basically qapi-init-commands.c, >> qapi-emit-events.[ch], and qapi-introspect.c). The code generated for >> the submodules should be identical (modulo unused array types, but >> that's harmless detail). So we ignore the code generated for the >> storage daemon's submodules. > > Ah, so you are debating a type of pragma or other meta-include that > just fishes out specific definitions? That could work, I think - I'm > not sure how the QAPI Schema end would look, but the Sphinx domain > could provide a meta-directive for adding a QSD support tag to > specific entities or some such thing, and the single QMP reference > could advertise a qemu-storage-daemon support flag. > > Or something - I'm not really privy to the big picture when it comes > to how we want to handle QMP modularization and differentiation > between different binaries. > >> End of diversion. >> >> >> > - Per-member features and ifcond are not yet accounted for; though >> >> > definition-scoped features and ifconds are. Please feel free to >> >> > suggest how you'd like to see those represented. >> >> > >> >> > - Inlining all members means there is some ambiguity on what to do with >> >> > doc block sections on inlined entities; features and members have an >> >> > obvious home - body, since, and misc sections are not as obvious on >> >> > how to handle. This will feature more prominently in the generator >> >> > series. >> >> >> >> Yes, this is a real problem. >> >> >> >> The member documentation gets written in the context of the type. It >> >> may make sense only in that context. Inlining copies it into a >> >> different context. >> >> >> >> Features may need to combine. Say a command's arguments are a union >> >> type, and several branches of the union both contain deprecated members. >> >> These branch types all document feature @deprecated. Simply inlining >> >> their feature documentation results in feature @deprecated documented >> >> several times. Ugh. Combining them would be better. But how? Do we >> >> need to rethink how we document features? >> > >> > To be honest, I figured I'd plow ahead and then find the counter-examples >> > programmatically and decide what to do once I had my pile of edge cases. >> > >> > It might involve rewriting docs in some places, but I think the usability >> > win is completely worth the hassle. >> > >> > It's possible there might be some rework needed to maximize cogency of the >> > generated docs, but I think prioritizing a user-facing reference for QMP is >> > the objectively correct end goal and I'm more than happy to work backwards >> > from that desired state. >> >> I'm not disputing the idea that documenting the arguments right with the >> command is good. I'm merely pointing out obstacles to pulling that off. >> >> Adjusting existing documentation is only half the battle. The other >> half is making sure documentation stays adjusted. We may have to come >> up with new documentation rules, and ways to enforce them. > > For the sake of argument, let's say we forbid everything except > arg/features from definitions destined to be used as base/inherited > types. This would be very easy to enforce at the qapidoc level where > the doc inlining is performed by yelping when the base type contains > additional documentation sections. Yes. The type's doc comment is then no longer fit for generating standalone documentation for the type, only for generating inlined documentation. > Now, in the real world, maybe sometimes those sections are useful and > we don't want to get rid of them, esp. because they may contain useful > documentation that we don't want to duplicate in the source files. Or a type is used both in a position where its documentation is to be inlined, and a position where it isn't. The latter requires us to generate standalone documentation for the type, so we can link to it. Whether this is a problem in practice I can't say. > My plan is to just forbid them at first and enumerate the cases where > they occur, then decide which ones are better off being moved > elsewhere or explicitly tolerated. The generator's tolerance can be > adjusted accordingly and we can formulate a rule for exactly how doc > blocks are combined and merged. I think it won't be a problem to > enforce it programmatically. > > I'll get back to you on how often and precisely where these cases > occur so you can take a look and see how you feel. > >> >> >> > - Some features could be implemented in either the QAPI domain syntax >> >> > *or* the generator, or some combination of both. Depending on >> >> > aesthetic feedback, this may influence where those features should be >> >> > implemented. >> >> > >> >> > - The formatting and aesthetics of branches are a little up in the air, >> >> > see the qapi:union patch for more details. >> >> > >> >> > - It's late, maybe other things. Happy Friday! >> >> > >> >> > Hope you like it! >> >> >> >> Looks promising! >> > >> > Fingers crossed. This has bothered me about QEMU for years. >> >> Thanks!
John Snow <jsnow@redhat.com> writes: > On Mon, Apr 22, 2024 at 12:38 PM John Snow <jsnow@redhat.com> wrote: >> >> On Mon, Apr 22, 2024 at 5:20 AM Markus Armbruster <armbru@redhat.com> wrote: >> > >> > John Snow <jsnow@redhat.com> writes: >> > >> > > On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> wrote: >> > > >> > >> John Snow <jsnow@redhat.com> writes: >> > >> >> > >> > This series adds a new qapi-domain extension for Sphinx, which adds a >> > >> > series of custom directives for documenting QAPI definitions. [...] >> > >> > Known issues / points of interest: [...] >> > >> > - Inlining all members means there is some ambiguity on what to do with >> > >> > doc block sections on inlined entities; features and members have an >> > >> > obvious home - body, since, and misc sections are not as obvious on >> > >> > how to handle. This will feature more prominently in the generator >> > >> > series. >> > >> >> > >> Yes, this is a real problem. >> > >> >> > >> The member documentation gets written in the context of the type. It >> > >> may make sense only in that context. Inlining copies it into a >> > >> different context. >> > >> >> > >> Features may need to combine. Say a command's arguments are a union >> > >> type, and several branches of the union both contain deprecated members. >> > >> These branch types all document feature @deprecated. Simply inlining >> > >> their feature documentation results in feature @deprecated documented >> > >> several times. Ugh. Combining them would be better. But how? Do we >> > >> need to rethink how we document features? >> > > >> > > To be honest, I figured I'd plow ahead and then find the counter-examples >> > > programmatically and decide what to do once I had my pile of edge cases. >> > > >> > > It might involve rewriting docs in some places, but I think the usability >> > > win is completely worth the hassle. >> > > >> > > It's possible there might be some rework needed to maximize cogency of the >> > > generated docs, but I think prioritizing a user-facing reference for QMP is >> > > the objectively correct end goal and I'm more than happy to work backwards >> > > from that desired state. >> > >> > I'm not disputing the idea that documenting the arguments right with the >> > command is good. I'm merely pointing out obstacles to pulling that off. >> > >> > Adjusting existing documentation is only half the battle. The other >> > half is making sure documentation stays adjusted. We may have to come >> > up with new documentation rules, and ways to enforce them. >> >> For the sake of argument, let's say we forbid everything except >> arg/features from definitions destined to be used as base/inherited >> types. This would be very easy to enforce at the qapidoc level where >> the doc inlining is performed by yelping when the base type contains >> additional documentation sections. >> >> Now, in the real world, maybe sometimes those sections are useful and >> we don't want to get rid of them, esp. because they may contain useful >> documentation that we don't want to duplicate in the source files. >> >> My plan is to just forbid them at first and enumerate the cases where >> they occur, then decide which ones are better off being moved >> elsewhere or explicitly tolerated. The generator's tolerance can be >> adjusted accordingly and we can formulate a rule for exactly how doc >> blocks are combined and merged. I think it won't be a problem to >> enforce it programmatically. >> >> I'll get back to you on how often and precisely where these cases >> occur so you can take a look and see how you feel. >> > > For a warmup, let's look at every unique instance of non-empty > paragraph text on an object that is used as a base anywhere: > > Warning: AudiodevPerDirectionOptions - inlined paragraph > Warning: BlockdevOptionsCurlBase - inlined paragraph > Warning: BlockdevOptionsGenericCOWFormat - inlined paragraph > Warning: BlockdevOptionsGenericFormat - inlined paragraph > Warning: BlockExportOptionsNbdBase - inlined paragraph > Warning: BlockNodeInfo - inlined paragraph > Warning: ChardevCommon - inlined paragraph > Warning: CpuInstanceProperties - inlined paragraph > Warning: CryptodevBackendProperties - inlined paragraph > Warning: EventLoopBaseProperties - inlined paragraph > Warning: MemoryBackendProperties - inlined paragraph > Warning: NetfilterProperties - inlined paragraph > Warning: QCryptoBlockAmendOptionsLUKS - inlined paragraph > Warning: QCryptoBlockCreateOptionsLUKS - inlined paragraph > Warning: QCryptoBlockInfoBase - inlined paragraph > Warning: QCryptoBlockOptionsBase - inlined paragraph > Warning: QCryptoBlockOptionsLUKS - inlined paragraph > Warning: RngProperties - inlined paragraph > Warning: SecretCommonProperties - inlined paragraph > Warning: SpiceBasicInfo - inlined paragraph > Warning: TlsCredsProperties - inlined paragraph > Warning: VncBasicInfo - inlined paragraph > > There's 22 instances. > > Let's look at what they say: > > AudiodevPerDirectionOptions: "General audio backend options that are > used for both playback and recording." > BlockdevOptionsCurlBase: "Driver specific block device options shared > by all protocols supported by the curl backend." > BlockdevOptionsGenericCOWFormat: "Driver specific block device options > for image format that have no option besides their data source and an > optional backing file." > BlockdevOptionsGenericFormat: "Driver specific block device options > for image format that have no option besides their data source." > BlockExportOptionsNbdBase: "An NBD block export (common options shared > between nbd-server-add and the NBD branch of block-export-add)." > BlockNodeInfo: "Information about a QEMU image file" > ChardevCommon: "Configuration shared across all chardev backends" > > CpuInstanceProperties: > "List of properties to be used for hotplugging a CPU instance, it > should be passed by management with device_add command when a CPU is > being hotplugged. > > Which members are optional and which mandatory depends on the > architecture and board. > > For s390x see :ref:`cpu-topology-s390x`. > > The ids other than the node-id specify the position of the CPU > within the CPU topology (as defined by the machine property "smp", > thus see also type @SMPConfiguration)" > > CryptodevBackendProperties: "Properties for cryptodev-backend and > cryptodev-backend-builtin objects." > EventLoopBaseProperties: "Common properties for event loops" > MemoryBackendProperties: "Properties for objects of classes derived > from memory-backend." > NetfilterProperties: "Properties for objects of classes derived from netfilter." > QCryptoBlockAmendOptionsLUKS: "This struct defines the update > parameters that activate/de-activate set of keyslots" > QCryptoBlockCreateOptionsLUKS: "The options that apply to LUKS > encryption format initialization" > QCryptoBlockInfoBase: "The common information that applies to all full > disk encryption formats" > QCryptoBlockOptionsBase: "The common options that apply to all full > disk encryption formats" > QCryptoBlockOptionsLUKS: "The options that apply to LUKS encryption format" > RngProperties: "Properties for objects of classes derived from rng." > SecretCommonProperties: "Properties for objects of classes derived > from secret-common." > SpiceBasicInfo: "The basic information for SPICE network connection" > TlsCredsProperties: "Properties for objects of classes derived from tls-creds." > VncBasicInfo: "The basic information for vnc network connection" > > ... Alright. So like 98% of the time, it's functionally useless > information for the end-user. The only thing that looks mildly > interesting is CpuInstanceProperties and *maybe* > QCryptoBlockAmendOptionsLUKS. > > I propose we add a new section to QAPI doc blocks that gets ignored > from rendered documentation, like "Comment:" -- to keep any info that > might be mildly relevant to a developer that explains the *factoring* > of the object, but won't be exposed in user-facing documents. Two existing ways to add comments that don't go into generated documentation: 1. Write a non-doc comment. ## # This is a doc comment. ## # # This isn't. # 2. TODO sections in a doc comment. Not sure we need more ways, but if we do, we'll create them. > On the occasion we DO want to inline documentation paragraphs, we can > leave them in and have the doc generator inline them. There's probably > very few cases where we actually want this. > > Let's take a look at any tagged sections now, excluding "Since": > > Warning: BackupCommon - inlined tag section Note > Warning: CpuInstanceProperties - inlined, tag section Note > Warning: MemoryBackendProperties - inlined tag section Note > > Not very many! Just three. > > BackupCommon: > > Note: @on-source-error and @on-target-error only affect background > I/O. If an error occurs during a guest write request, the > device's rerror/werror actions will be used. > > BackupCommon is used as a base for DriveBackup and BlockdevBackup. In > this case, this note probably does belong on both. It should be > inlined and included. > > CpuInstanceProperties: > > Note: management should be prepared to pass through additional > properties with device_add. > > This is only used as a base with no other addon args for > NumaCpuOptions, in turn only used for an argument. This is probably > right to pass through, too. (Though I admit I don't really know what > it means... we can discuss the doc *quality* another day.) Oh boy, don't get me started! > MemoryBackendProperties: > > Note: prealloc=true and reserve=false cannot be set at the same > time. With reserve=true, the behavior depends on the operating > system: for example, Linux will not reserve swap space for > shared file mappings -- "not applicable". In contrast, > reserve=false will bail out if it cannot be configured > accordingly. > > This is used in MemoryBackendFileProperties, > MemoryBackendMemfdProperties, and MemoryBackendEpcProperties. None are > commands. I think the note here should also be inlined into each > object. > > So, I think that: > > - Most naked paragraphs are usually useless to the end-user, and we > should put them behind a Comment section to prevent them from being > inlined. We may want to delete them instead. > - Any *other* paragraph or section should be included in the > descendent. We'll just review with that eye going forward. But where exactly does it go? The question applies not just to tagged sections such as "Note:", but to argument, member, and feature descriptions, too. Our current answer for argument / member descriptions is "right after the body section". Works because we permit only the body section before argument / member descriptions . Pretty arbitrary restriction, though. Fine print: can be a bit more complicated for unions, but let's ignore that here. I guess our current answer for feature descriptions is something like "right after argument / member descriptions if they exist, else right after the body section". What could our answer be for other sections? In my experience, the people writing the doc comments rarely check how they come out in generated documentation. The closer the doc comments are to the generated documentation, the higher the chance it comes out as intended. The more complex the inlining gets, the higher the chance of mishaps. If cases of complex inlining are rare, we could sidestep complex inlining by inlining manually. If that's undesirable or impractical, we could require explicit instructions where the inlined material should go. I'm merely thinking aloud; these are certainly not requests, just ideas, and possibly bad ones. > - Since: ... we'll talk about later this week. Yes.
On Tue, Apr 23, 2024, 3:48 AM Markus Armbruster <armbru@redhat.com> wrote: > John Snow <jsnow@redhat.com> writes: > > > On Mon, Apr 22, 2024 at 12:38 PM John Snow <jsnow@redhat.com> wrote: > >> > >> On Mon, Apr 22, 2024 at 5:20 AM Markus Armbruster <armbru@redhat.com> > wrote: > >> > > >> > John Snow <jsnow@redhat.com> writes: > >> > > >> > > On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> > wrote: > >> > > > >> > >> John Snow <jsnow@redhat.com> writes: > >> > >> > >> > >> > This series adds a new qapi-domain extension for Sphinx, which > adds a > >> > >> > series of custom directives for documenting QAPI definitions. > > [...] > > >> > >> > Known issues / points of interest: > > [...] > > >> > >> > - Inlining all members means there is some ambiguity on what to > do with > >> > >> > doc block sections on inlined entities; features and members > have an > >> > >> > obvious home - body, since, and misc sections are not as > obvious on > >> > >> > how to handle. This will feature more prominently in the > generator > >> > >> > series. > >> > >> > >> > >> Yes, this is a real problem. > >> > >> > >> > >> The member documentation gets written in the context of the type. > It > >> > >> may make sense only in that context. Inlining copies it into a > >> > >> different context. > >> > >> > >> > >> Features may need to combine. Say a command's arguments are a > union > >> > >> type, and several branches of the union both contain deprecated > members. > >> > >> These branch types all document feature @deprecated. Simply > inlining > >> > >> their feature documentation results in feature @deprecated > documented > >> > >> several times. Ugh. Combining them would be better. But how? > Do we > >> > >> need to rethink how we document features? > >> > > > >> > > To be honest, I figured I'd plow ahead and then find the > counter-examples > >> > > programmatically and decide what to do once I had my pile of edge > cases. > >> > > > >> > > It might involve rewriting docs in some places, but I think the > usability > >> > > win is completely worth the hassle. > >> > > > >> > > It's possible there might be some rework needed to maximize cogency > of the > >> > > generated docs, but I think prioritizing a user-facing reference > for QMP is > >> > > the objectively correct end goal and I'm more than happy to work > backwards > >> > > from that desired state. > >> > > >> > I'm not disputing the idea that documenting the arguments right with > the > >> > command is good. I'm merely pointing out obstacles to pulling that > off. > >> > > >> > Adjusting existing documentation is only half the battle. The other > >> > half is making sure documentation stays adjusted. We may have to come > >> > up with new documentation rules, and ways to enforce them. > >> > >> For the sake of argument, let's say we forbid everything except > >> arg/features from definitions destined to be used as base/inherited > >> types. This would be very easy to enforce at the qapidoc level where > >> the doc inlining is performed by yelping when the base type contains > >> additional documentation sections. > >> > >> Now, in the real world, maybe sometimes those sections are useful and > >> we don't want to get rid of them, esp. because they may contain useful > >> documentation that we don't want to duplicate in the source files. > >> > >> My plan is to just forbid them at first and enumerate the cases where > >> they occur, then decide which ones are better off being moved > >> elsewhere or explicitly tolerated. The generator's tolerance can be > >> adjusted accordingly and we can formulate a rule for exactly how doc > >> blocks are combined and merged. I think it won't be a problem to > >> enforce it programmatically. > >> > >> I'll get back to you on how often and precisely where these cases > >> occur so you can take a look and see how you feel. > >> > > > > For a warmup, let's look at every unique instance of non-empty > > paragraph text on an object that is used as a base anywhere: > > > > Warning: AudiodevPerDirectionOptions - inlined paragraph > > Warning: BlockdevOptionsCurlBase - inlined paragraph > > Warning: BlockdevOptionsGenericCOWFormat - inlined paragraph > > Warning: BlockdevOptionsGenericFormat - inlined paragraph > > Warning: BlockExportOptionsNbdBase - inlined paragraph > > Warning: BlockNodeInfo - inlined paragraph > > Warning: ChardevCommon - inlined paragraph > > Warning: CpuInstanceProperties - inlined paragraph > > Warning: CryptodevBackendProperties - inlined paragraph > > Warning: EventLoopBaseProperties - inlined paragraph > > Warning: MemoryBackendProperties - inlined paragraph > > Warning: NetfilterProperties - inlined paragraph > > Warning: QCryptoBlockAmendOptionsLUKS - inlined paragraph > > Warning: QCryptoBlockCreateOptionsLUKS - inlined paragraph > > Warning: QCryptoBlockInfoBase - inlined paragraph > > Warning: QCryptoBlockOptionsBase - inlined paragraph > > Warning: QCryptoBlockOptionsLUKS - inlined paragraph > > Warning: RngProperties - inlined paragraph > > Warning: SecretCommonProperties - inlined paragraph > > Warning: SpiceBasicInfo - inlined paragraph > > Warning: TlsCredsProperties - inlined paragraph > > Warning: VncBasicInfo - inlined paragraph > > > > There's 22 instances. > > > > Let's look at what they say: > > > > AudiodevPerDirectionOptions: "General audio backend options that are > > used for both playback and recording." > > BlockdevOptionsCurlBase: "Driver specific block device options shared > > by all protocols supported by the curl backend." > > BlockdevOptionsGenericCOWFormat: "Driver specific block device options > > for image format that have no option besides their data source and an > > optional backing file." > > BlockdevOptionsGenericFormat: "Driver specific block device options > > for image format that have no option besides their data source." > > BlockExportOptionsNbdBase: "An NBD block export (common options shared > > between nbd-server-add and the NBD branch of block-export-add)." > > BlockNodeInfo: "Information about a QEMU image file" > > ChardevCommon: "Configuration shared across all chardev backends" > > > > CpuInstanceProperties: > > "List of properties to be used for hotplugging a CPU instance, it > > should be passed by management with device_add command when a CPU is > > being hotplugged. > > > > Which members are optional and which mandatory depends on the > > architecture and board. > > > > For s390x see :ref:`cpu-topology-s390x`. > > > > The ids other than the node-id specify the position of the CPU > > within the CPU topology (as defined by the machine property "smp", > > thus see also type @SMPConfiguration)" > > > > CryptodevBackendProperties: "Properties for cryptodev-backend and > > cryptodev-backend-builtin objects." > > EventLoopBaseProperties: "Common properties for event loops" > > MemoryBackendProperties: "Properties for objects of classes derived > > from memory-backend." > > NetfilterProperties: "Properties for objects of classes derived from > netfilter." > > QCryptoBlockAmendOptionsLUKS: "This struct defines the update > > parameters that activate/de-activate set of keyslots" > > QCryptoBlockCreateOptionsLUKS: "The options that apply to LUKS > > encryption format initialization" > > QCryptoBlockInfoBase: "The common information that applies to all full > > disk encryption formats" > > QCryptoBlockOptionsBase: "The common options that apply to all full > > disk encryption formats" > > QCryptoBlockOptionsLUKS: "The options that apply to LUKS encryption > format" > > RngProperties: "Properties for objects of classes derived from rng." > > SecretCommonProperties: "Properties for objects of classes derived > > from secret-common." > > SpiceBasicInfo: "The basic information for SPICE network connection" > > TlsCredsProperties: "Properties for objects of classes derived from > tls-creds." > > VncBasicInfo: "The basic information for vnc network connection" > > > > ... Alright. So like 98% of the time, it's functionally useless > > information for the end-user. The only thing that looks mildly > > interesting is CpuInstanceProperties and *maybe* > > QCryptoBlockAmendOptionsLUKS. > > > > I propose we add a new section to QAPI doc blocks that gets ignored > > from rendered documentation, like "Comment:" -- to keep any info that > > might be mildly relevant to a developer that explains the *factoring* > > of the object, but won't be exposed in user-facing documents. > > Two existing ways to add comments that don't go into generated > documentation: > > 1. Write a non-doc comment. > > ## > # This is a doc comment. > ## > > # > # This isn't. > # > > 2. TODO sections in a doc comment. > > Not sure we need more ways, but if we do, we'll create them. > Yeah. we could just delete them. Just offering a section in case you don't want to lose a mandate that every entity should be documented. I don't have strong feelings beyond "I think we don't need to inline these, but we should make it obvious when we do or do not." If you want to just pull out these paragraphs to be comments before/after the doc block or just delete them, I don't have strong feelings. > > On the occasion we DO want to inline documentation paragraphs, we can > > leave them in and have the doc generator inline them. There's probably > > very few cases where we actually want this. > > > > Let's take a look at any tagged sections now, excluding "Since": > > > > Warning: BackupCommon - inlined tag section Note > > Warning: CpuInstanceProperties - inlined, tag section Note > > Warning: MemoryBackendProperties - inlined tag section Note > > > > Not very many! Just three. > > > > BackupCommon: > > > > Note: @on-source-error and @on-target-error only affect background > > I/O. If an error occurs during a guest write request, the > > device's rerror/werror actions will be used. > > > > BackupCommon is used as a base for DriveBackup and BlockdevBackup. In > > this case, this note probably does belong on both. It should be > > inlined and included. > > > > CpuInstanceProperties: > > > > Note: management should be prepared to pass through additional > > properties with device_add. > > > > This is only used as a base with no other addon args for > > NumaCpuOptions, in turn only used for an argument. This is probably > > right to pass through, too. (Though I admit I don't really know what > > it means... we can discuss the doc *quality* another day.) > > Oh boy, don't get me started! > :) > > MemoryBackendProperties: > > > > Note: prealloc=true and reserve=false cannot be set at the same > > time. With reserve=true, the behavior depends on the operating > > system: for example, Linux will not reserve swap space for > > shared file mappings -- "not applicable". In contrast, > > reserve=false will bail out if it cannot be configured > > accordingly. > > > > This is used in MemoryBackendFileProperties, > > MemoryBackendMemfdProperties, and MemoryBackendEpcProperties. None are > > commands. I think the note here should also be inlined into each > > object. > > > > So, I think that: > > > > - Most naked paragraphs are usually useless to the end-user, and we > > should put them behind a Comment section to prevent them from being > > inlined. > > We may want to delete them instead. > Sure, yeah. You're the maintainer, so your call. (1) New comment section (2) Pulled out as a non-doc-block comment (3) Just deleted. > > - Any *other* paragraph or section should be included in the > > descendent. We'll just review with that eye going forward. > > But where exactly does it go? > "Into the HTML. Any other questions?" ;) > The question applies not just to tagged sections such as "Note:", but to > argument, member, and feature descriptions, too. > Our current answer for argument / member descriptions is "right after > the body section". Works because we permit only the body section before > argument / member descriptions . Pretty arbitrary restriction, though. > > Fine print: can be a bit more complicated for unions, but let's ignore > that here. > > I guess our current answer for feature descriptions is something like > "right after argument / member descriptions if they exist, else right > after the body section". > > What could our answer be for other sections? > Inherited members: Injected *before* the member section *when already present*. i.e., members are source order, ancestor-to-descendent. Features are the same. The system does not care if they are duplicated. The only ambiguity arises when the final descendent does not *have* member or feature sections. Current prototype's hack: if we leave the doc sections and members/features remain undocumented, they are appended to the end of the block. I do agree this isn't ideal. We had a call off-list earlier today where you suggested the idea of a section or token that would mark the location for such things to be inlined in the event that there was no obvious place otherwise. I don't have a better idea; and it would make this action explicit rather than implicit. It would only be needed in very few doc blocks and it would be possible to write a very, very detailed error message in the circumstances where this was missed. i.e. a simple section that's just simply: Inherited-Members: ... or Inherited-Features: ... would do perfectly well. Example error message: 'QAPI definition foo has inherited <features/members> [from QAPI <meta-type> bar] but no local <features/members> section in its documentation block. Please annotate where they should be included in generated documentation by including the line "Inherited-<Features/Members>: ..." in the doc block.' (With a pointer to the source file and line where the doc block starts.) If this line is erroneously *included* in a doc block that doesn't need it, we can also emit a (fatal) warning urging its removal and an auditing of the rendered HTML output. That leaves other sections. We also spoke about the possibility of eliminating "notes" and "examples" in exchange for explicit ReST/sphinx syntax. I think that's probably a good idea overall, because it increases flexibility in how we can annotate examples and remove more custom qapidoc parsing code. So, let's assume the only thing left would be "where do we inline inherited paragraphs?" How about this: If we inherit from an object that *has* such paragraphs, then the descendent needs to mark the spot for its inclusion. Copying from the above ideas for members/features, how about: Inherited-paragraphs: ... The docgen uses this as the splice point. If the splice point is absent and there are no paragraphs to splice in, there's no problem. If there are, we can emit a very detailed warning that explains exactly what went wrong and how to fix it. Similarly, if the splice point is indicated but absent, we can warn and urge its removal alongside an audit of the generated documentation. This way, the feature is self-documenting and self-correcting. It will catch regressions written before the feature existed and give guidance on how to fix it. It will catch the large majority of rebase and refactoring mistakes. It adds a little complexity to the QAPIDoc parser, but not very much. (And in fact, with the removal of Notes and Examples, it may indeed be possible to refactor the parser to be drastically simpler. Alongside using all_sections, once qapidoc.py is rewritten/replaced, even more drastic simplifications are possible.) > In my experience, the people writing the doc comments rarely check how > they come out in generated documentation. > > The closer the doc comments are to the generated documentation, the > higher the chance it comes out as intended. > > The more complex the inlining gets, the higher the chance of mishaps. > > If cases of complex inlining are rare, we could sidestep complex > inlining by inlining manually. > > If that's undesirable or impractical, we could require explicit > instructions where the inlined material should go. > > I'm merely thinking aloud; these are certainly not requests, just ideas, > and possibly bad ones. > > > - Since: ... we'll talk about later this week. > > Yes. > Also discussed on call, but I'm leaving this alone for right now. Current prototype: inherited "since" sections are ignored but emit a non-fatal warning. We can audit these cases and decide if that's an acceptable shortcoming while we work on something more robust, or if it needs addressing before merge. (Or if we just issue manual source corrections in the interim before your proposed changes to generate such info are merged.) OK. So: I quite like the idea of inherited placeholders if and only if they are required, and would like to proceed with prototyping this mechanism. Are you OK with me proceeding to prototype this? (It's okay to have reservations and we may still change our mind, but I want a tentative ACK before I embark on this as it is categorically more disruptive than any of the changes I've made thus far. i.e. does it *conceptually* work for you?) --js
John Snow <jsnow@redhat.com> writes: > On Tue, Apr 23, 2024, 3:48 AM Markus Armbruster <armbru@redhat.com> wrote: > >> John Snow <jsnow@redhat.com> writes: >> >> > On Mon, Apr 22, 2024 at 12:38 PM John Snow <jsnow@redhat.com> wrote: >> >> >> >> On Mon, Apr 22, 2024 at 5:20 AM Markus Armbruster <armbru@redhat.com> wrote: >> >> > >> >> > John Snow <jsnow@redhat.com> writes: >> >> > >> >> > > On Fri, Apr 19, 2024, 10:45 AM Markus Armbruster <armbru@redhat.com> wrote: >> >> > > >> >> > >> John Snow <jsnow@redhat.com> writes: >> >> > >> >> >> > >> > This series adds a new qapi-domain extension for Sphinx, which adds a >> >> > >> > series of custom directives for documenting QAPI definitions. >> >> [...] >> >> >> > >> > Known issues / points of interest: >> >> [...] >> >> >> > >> > - Inlining all members means there is some ambiguity on what to do with >> >> > >> > doc block sections on inlined entities; features and members have an >> >> > >> > obvious home - body, since, and misc sections are not as obvious on >> >> > >> > how to handle. This will feature more prominently in the generator >> >> > >> > series. >> >> > >> >> >> > >> Yes, this is a real problem. >> >> > >> >> >> > >> The member documentation gets written in the context of the type. It >> >> > >> may make sense only in that context. Inlining copies it into a >> >> > >> different context. >> >> > >> >> >> > >> Features may need to combine. Say a command's arguments are a union >> >> > >> type, and several branches of the union both contain deprecated members. >> >> > >> These branch types all document feature @deprecated. Simply inlining >> >> > >> their feature documentation results in feature @deprecated documented >> >> > >> several times. Ugh. Combining them would be better. But how? Do we >> >> > >> need to rethink how we document features? >> >> > > >> >> > > To be honest, I figured I'd plow ahead and then find the counter-examples >> >> > > programmatically and decide what to do once I had my pile of edge cases. >> >> > > >> >> > > It might involve rewriting docs in some places, but I think the usability >> >> > > win is completely worth the hassle. >> >> > > >> >> > > It's possible there might be some rework needed to maximize cogency of the >> >> > > generated docs, but I think prioritizing a user-facing reference for QMP is >> >> > > the objectively correct end goal and I'm more than happy to work backwards >> >> > > from that desired state. >> >> > >> >> > I'm not disputing the idea that documenting the arguments right with the >> >> > command is good. I'm merely pointing out obstacles to pulling that off. >> >> > >> >> > Adjusting existing documentation is only half the battle. The other >> >> > half is making sure documentation stays adjusted. We may have to come >> >> > up with new documentation rules, and ways to enforce them. >> >> >> >> For the sake of argument, let's say we forbid everything except >> >> arg/features from definitions destined to be used as base/inherited >> >> types. This would be very easy to enforce at the qapidoc level where >> >> the doc inlining is performed by yelping when the base type contains >> >> additional documentation sections. >> >> >> >> Now, in the real world, maybe sometimes those sections are useful and >> >> we don't want to get rid of them, esp. because they may contain useful >> >> documentation that we don't want to duplicate in the source files. >> >> >> >> My plan is to just forbid them at first and enumerate the cases where >> >> they occur, then decide which ones are better off being moved >> >> elsewhere or explicitly tolerated. The generator's tolerance can be >> >> adjusted accordingly and we can formulate a rule for exactly how doc >> >> blocks are combined and merged. I think it won't be a problem to >> >> enforce it programmatically. >> >> >> >> I'll get back to you on how often and precisely where these cases >> >> occur so you can take a look and see how you feel. >> >> >> > >> > For a warmup, let's look at every unique instance of non-empty >> > paragraph text on an object that is used as a base anywhere: >> > >> > Warning: AudiodevPerDirectionOptions - inlined paragraph >> > Warning: BlockdevOptionsCurlBase - inlined paragraph >> > Warning: BlockdevOptionsGenericCOWFormat - inlined paragraph >> > Warning: BlockdevOptionsGenericFormat - inlined paragraph >> > Warning: BlockExportOptionsNbdBase - inlined paragraph >> > Warning: BlockNodeInfo - inlined paragraph >> > Warning: ChardevCommon - inlined paragraph >> > Warning: CpuInstanceProperties - inlined paragraph >> > Warning: CryptodevBackendProperties - inlined paragraph >> > Warning: EventLoopBaseProperties - inlined paragraph >> > Warning: MemoryBackendProperties - inlined paragraph >> > Warning: NetfilterProperties - inlined paragraph >> > Warning: QCryptoBlockAmendOptionsLUKS - inlined paragraph >> > Warning: QCryptoBlockCreateOptionsLUKS - inlined paragraph >> > Warning: QCryptoBlockInfoBase - inlined paragraph >> > Warning: QCryptoBlockOptionsBase - inlined paragraph >> > Warning: QCryptoBlockOptionsLUKS - inlined paragraph >> > Warning: RngProperties - inlined paragraph >> > Warning: SecretCommonProperties - inlined paragraph >> > Warning: SpiceBasicInfo - inlined paragraph >> > Warning: TlsCredsProperties - inlined paragraph >> > Warning: VncBasicInfo - inlined paragraph >> > >> > There's 22 instances. >> > >> > Let's look at what they say: >> > >> > AudiodevPerDirectionOptions: "General audio backend options that are >> > used for both playback and recording." >> > BlockdevOptionsCurlBase: "Driver specific block device options shared >> > by all protocols supported by the curl backend." >> > BlockdevOptionsGenericCOWFormat: "Driver specific block device options >> > for image format that have no option besides their data source and an >> > optional backing file." >> > BlockdevOptionsGenericFormat: "Driver specific block device options >> > for image format that have no option besides their data source." >> > BlockExportOptionsNbdBase: "An NBD block export (common options shared >> > between nbd-server-add and the NBD branch of block-export-add)." >> > BlockNodeInfo: "Information about a QEMU image file" >> > ChardevCommon: "Configuration shared across all chardev backends" >> > >> > CpuInstanceProperties: >> > "List of properties to be used for hotplugging a CPU instance, it >> > should be passed by management with device_add command when a CPU is >> > being hotplugged. >> > >> > Which members are optional and which mandatory depends on the >> > architecture and board. >> > >> > For s390x see :ref:`cpu-topology-s390x`. >> > >> > The ids other than the node-id specify the position of the CPU >> > within the CPU topology (as defined by the machine property "smp", >> > thus see also type @SMPConfiguration)" >> > >> > CryptodevBackendProperties: "Properties for cryptodev-backend and >> > cryptodev-backend-builtin objects." >> > EventLoopBaseProperties: "Common properties for event loops" >> > MemoryBackendProperties: "Properties for objects of classes derived >> > from memory-backend." >> > NetfilterProperties: "Properties for objects of classes derived from netfilter." >> > QCryptoBlockAmendOptionsLUKS: "This struct defines the update >> > parameters that activate/de-activate set of keyslots" >> > QCryptoBlockCreateOptionsLUKS: "The options that apply to LUKS >> > encryption format initialization" >> > QCryptoBlockInfoBase: "The common information that applies to all full >> > disk encryption formats" >> > QCryptoBlockOptionsBase: "The common options that apply to all full >> > disk encryption formats" >> > QCryptoBlockOptionsLUKS: "The options that apply to LUKS encryption format" >> > RngProperties: "Properties for objects of classes derived from rng." >> > SecretCommonProperties: "Properties for objects of classes derived >> > from secret-common." >> > SpiceBasicInfo: "The basic information for SPICE network connection" >> > TlsCredsProperties: "Properties for objects of classes derived from tls-creds." >> > VncBasicInfo: "The basic information for vnc network connection" >> > >> > ... Alright. So like 98% of the time, it's functionally useless >> > information for the end-user. The only thing that looks mildly >> > interesting is CpuInstanceProperties and *maybe* >> > QCryptoBlockAmendOptionsLUKS. >> > >> > I propose we add a new section to QAPI doc blocks that gets ignored >> > from rendered documentation, like "Comment:" -- to keep any info that >> > might be mildly relevant to a developer that explains the *factoring* >> > of the object, but won't be exposed in user-facing documents. >> >> Two existing ways to add comments that don't go into generated >> documentation: >> >> 1. Write a non-doc comment. >> >> ## >> # This is a doc comment. >> ## >> >> # >> # This isn't. >> # >> >> 2. TODO sections in a doc comment. >> >> Not sure we need more ways, but if we do, we'll create them. > > Yeah. we could just delete them. Just offering a section in case you don't > want to lose a mandate that every entity should be documented. I don't want to think about things like "does this need documentation", or "where does the documentation go". I want our tools to be able to decide that for us. > I don't have strong feelings beyond "I think we don't need to inline these, > but we should make it obvious when we do or do not." > > If you want to just pull out these paragraphs to be comments before/after > the doc block or just delete them, I don't have strong feelings. I figure almost all of them are thoroughly uninteresting. If that turns out to be the case, we delete the uninteresting ones, then decide what to do about each of the few interesting ones. >> > On the occasion we DO want to inline documentation paragraphs, we can >> > leave them in and have the doc generator inline them. There's probably >> > very few cases where we actually want this. >> > >> > Let's take a look at any tagged sections now, excluding "Since": >> > >> > Warning: BackupCommon - inlined tag section Note >> > Warning: CpuInstanceProperties - inlined, tag section Note >> > Warning: MemoryBackendProperties - inlined tag section Note >> > >> > Not very many! Just three. >> > >> > BackupCommon: >> > >> > Note: @on-source-error and @on-target-error only affect background >> > I/O. If an error occurs during a guest write request, the >> > device's rerror/werror actions will be used. >> > >> > BackupCommon is used as a base for DriveBackup and BlockdevBackup. In >> > this case, this note probably does belong on both. It should be >> > inlined and included. >> > >> > CpuInstanceProperties: >> > >> > Note: management should be prepared to pass through additional >> > properties with device_add. >> > >> > This is only used as a base with no other addon args for >> > NumaCpuOptions, in turn only used for an argument. This is probably >> > right to pass through, too. (Though I admit I don't really know what >> > it means... we can discuss the doc *quality* another day.) >> >> Oh boy, don't get me started! > > :) > > >> > MemoryBackendProperties: >> > >> > Note: prealloc=true and reserve=false cannot be set at the same >> > time. With reserve=true, the behavior depends on the operating >> > system: for example, Linux will not reserve swap space for >> > shared file mappings -- "not applicable". In contrast, >> > reserve=false will bail out if it cannot be configured >> > accordingly. >> > >> > This is used in MemoryBackendFileProperties, >> > MemoryBackendMemfdProperties, and MemoryBackendEpcProperties. None are >> > commands. I think the note here should also be inlined into each >> > object. >> > >> > So, I think that: >> > >> > - Most naked paragraphs are usually useless to the end-user, and we >> > should put them behind a Comment section to prevent them from being >> > inlined. >> >> We may want to delete them instead. > > Sure, yeah. You're the maintainer, so your call. > > (1) New comment section > (2) Pulled out as a non-doc-block comment > (3) Just deleted. > > >> > - Any *other* paragraph or section should be included in the >> > descendent. We'll just review with that eye going forward. >> >> But where exactly does it go? > > "Into the HTML. Any other questions?" > > ;) Would you be interested in a product manager role? ;-P >> The question applies not just to tagged sections such as "Note:", but to >> argument, member, and feature descriptions, too. > > >> Our current answer for argument / member descriptions is "right after >> the body section". Works because we permit only the body section before >> argument / member descriptions . Pretty arbitrary restriction, though. >> >> Fine print: can be a bit more complicated for unions, but let's ignore >> that here. >> >> I guess our current answer for feature descriptions is something like >> "right after argument / member descriptions if they exist, else right >> after the body section". >> >> What could our answer be for other sections? > > Inherited members: Injected *before* the member section *when already > present*. i.e., members are source order, ancestor-to-descendent. > > Features are the same. The system does not care if they are duplicated. The reader might. We'll see. > The only ambiguity arises when the final descendent does not *have* member > or feature sections. Yes. > Current prototype's hack: if we leave the doc sections and members/features > remain undocumented, they are appended to the end of the block. > > I do agree this isn't ideal. We had a call off-list earlier today where you > suggested the idea of a section or token that would mark the location for > such things to be inlined in the event that there was no obvious place > otherwise. I don't have a better idea; and it would make this action > explicit rather than implicit. It would only be needed in very few doc > blocks and it would be possible to write a very, very detailed error > message in the circumstances where this was missed. > > i.e. a simple section that's just simply: > > Inherited-Members: ... > > or > > Inherited-Features: ... > > would do perfectly well. > > Example error message: 'QAPI definition foo has inherited > <features/members> [from QAPI <meta-type> bar] but no local > <features/members> section in its documentation block. Please annotate > where they should be included in generated documentation by including the > line "Inherited-<Features/Members>: ..." in the doc block.' > > (With a pointer to the source file and line where the doc block starts.) > > If this line is erroneously *included* in a doc block that doesn't need it, > we can also emit a (fatal) warning urging its removal and an auditing of > the rendered HTML output. > > That leaves other sections. We also spoke about the possibility of > eliminating "notes" and "examples" in exchange for explicit ReST/sphinx > syntax. I think that's probably a good idea overall, because it increases > flexibility in how we can annotate examples and remove more custom qapidoc > parsing code. Yes. We're reinventing reST there for historical reasons only. And our reinvented wheel is decidedly less round than reST's. > So, let's assume the only thing left would be "where do we inline inherited > paragraphs?" > > How about this: > > If we inherit from an object that *has* such paragraphs, then the > descendent needs to mark the spot for its inclusion. > > Copying from the above ideas for members/features, how about: > > Inherited-paragraphs: ... > > The docgen uses this as the splice point. If the splice point is absent and > there are no paragraphs to splice in, there's no problem. If there are, we > can emit a very detailed warning that explains exactly what went wrong and > how to fix it. Similarly, if the splice point is indicated but absent, we > can warn and urge its removal alongside an audit of the generated > documentation. > > This way, the feature is self-documenting and self-correcting. It will > catch regressions written before the feature existed and give guidance on > how to fix it. It will catch the large majority of rebase and refactoring > mistakes. Worth exploring. > It adds a little complexity to the QAPIDoc parser, but not very much. > > (And in fact, with the removal of Notes and Examples, it may indeed be > possible to refactor the parser to be drastically simpler. Alongside using > all_sections, once qapidoc.py is rewritten/replaced, even more drastic > simplifications are possible.) When I rewrote the doc parser, I aimed for such simplifications, but qapidoc.py resisted change too much to be worth it, so I shelved the idea. >> In my experience, the people writing the doc comments rarely check how >> they come out in generated documentation. >> >> The closer the doc comments are to the generated documentation, the >> higher the chance it comes out as intended. >> >> The more complex the inlining gets, the higher the chance of mishaps. >> >> If cases of complex inlining are rare, we could sidestep complex >> inlining by inlining manually. >> >> If that's undesirable or impractical, we could require explicit >> instructions where the inlined material should go. >> >> I'm merely thinking aloud; these are certainly not requests, just ideas, >> and possibly bad ones. >> >> > - Since: ... we'll talk about later this week. >> >> Yes. > > Also discussed on call, but I'm leaving this alone for right now. > > Current prototype: inherited "since" sections are ignored but emit a > non-fatal warning. We can audit these cases and decide if that's an > acceptable shortcoming while we work on something more robust, or if it > needs addressing before merge. > > (Or if we just issue manual source corrections in the interim before your > proposed changes to generate such info are merged.) > > OK. > > So: I quite like the idea of inherited placeholders if and only if they are > required, and would like to proceed with prototyping this mechanism. Are > you OK with me proceeding to prototype this? Yes. I'm curious how many of them we'll need in practice. > (It's okay to have reservations and we may still change our mind, but I > want a tentative ACK before I embark on this as it is categorically more > disruptive than any of the changes I've made thus far. i.e. does it > *conceptually* work for you?) Well, we need *some* solution to make inlining viable. Since inlining is desirable, and no better solution comes to mind, let's explore this one.