[v5] documentation: add tutorial for object walking

The target audience is a Git contributor who is just getting started
with the concept of object walking. The goal is to prepare this
contributor to be able to understand and modify existing commands which
perform revision walks more easily, although it will also prepare
contributors to create new commands which perform walks.

The tutorial covers a basic overview of the structs involved during
object walk, setting up a basic commit walk, setting up a basic
all-object walk, and adding some configuration changes to both walk
types. It intentionally does not cover how to create new commands or
search for options from the command line or gitconfigs.

There is an associated patchset at
https://github.com/nasamuffin/git/tree/revwalk that contains a reference
implementation of the code generated by this tutorial.

Signed-off-by: Emily Shaffer <emilyshaffer@google.com>
Helped-by: Eric Sunshine <sunshine@sunshineco.com>
---
Primarily since v5 this is a reword to try to frame the tutorial as
"object walk tutorial which also covers commit-only walk", rather than
"revision walk tutorial which also covers all-object walk" - per Junio's
suggestion.

Since I had spent some time away from the patch, I also made some small
changes to the wording.

Here's the rangediff from gitster/es/walken-tutorial:

1:  1ed29a34d1 ! 1:  554d940af7 documentation: add tutorial for revision walking
     a => b | 0
     1 file changed, 0 insertions(+), 0 deletions(-)

    @@ Metadata
     Author: Emily Shaffer <emilyshaffer@google.com>

      ## Commit message ##
    -    documentation: add tutorial for revision walking
    +    documentation: add tutorial for object walking

    -    Existing documentation on revision walks seems to be primarily intended
    +    Existing documentation on object walks seems to be primarily intended
         as a reference for those already familiar with the procedure. This
         tutorial attempts to give an entry-level guide to a couple of bare-bones
    -    revision walks so that new Git contributors can learn the concepts
    +    object walks so that new Git contributors can learn the concepts
         without having to wade through options parsing or special casing.

         The target audience is a Git contributor who is just getting started
    -    with the concept of revision walking. The goal is to prepare this
    +    with the concept of object walking. The goal is to prepare this
         contributor to be able to understand and modify existing commands which
         perform revision walks more easily, although it will also prepare
         contributors to create new commands which perform walks.

         The tutorial covers a basic overview of the structs involved during
    -    revision walk, setting up a basic commit walk, setting up a basic
    +    object walk, setting up a basic commit walk, setting up a basic
         all-object walk, and adding some configuration changes to both walk
         types. It intentionally does not cover how to create new commands or
         search for options from the command line or gitconfigs.
    @@ Commit message

         Signed-off-by: Emily Shaffer <emilyshaffer@google.com>
         Helped-by: Eric Sunshine <sunshine@sunshineco.com>
    -    Signed-off-by: Junio C Hamano <gitster@pobox.com>

      ## Documentation/Makefile ##
     @@ Documentation/Makefile: API_DOCS = $(patsubst %.txt,%,$(filter-out technical/api-index-skel.txt technica
    @@ Documentation/Makefile: API_DOCS = $(patsubst %.txt,%,$(filter-out technical/api
      TECH_DOCS += technical/hash-function-transition
      TECH_DOCS += technical/http-protocol

    - ## Documentation/MyFirstRevWalk.txt (new) ##
    + ## Documentation/MyFirstObjectWalk.txt (new) ##
     @@
    -+My First Revision Walk
    ++My First Object Walk
     +======================
     +
    -+== What's a Revision Walk?
    ++== What's an Object Walk?
     +
    -+The revision walk is a key concept in Git - this is the process that underpins
    ++The object walk is a key concept in Git - this is the process that underpins
     +operations like `git log`, `git blame`, and `git reflog`. Beginning at HEAD, the
     +list of objects is found by walking parent relationships between objects. The
    -+revision walk can also be used to determine whether or not a given object is
    ++object walk can also be used to determine whether or not a given object is
     +reachable from the current HEAD pointer.
     +
    ++A related concept is the revision walk, which is focused on commit objects and
    ++their relationships.
    ++
     +=== Related Reading
     +
     +- `Documentation/user-manual.txt` under "Hacking Git" contains some coverage of
     +  the revision walker in its various incarnations.
     +- `Documentation/technical/api-revision-walking.txt`
     +- https://eagain.net/articles/git-for-computer-scientists/[Git for Computer Scientists]
    -+  gives a good overview of the types of objects in Git and what your revision
    ++  gives a good overview of the types of objects in Git and what your object
     +  walk is really describing.
     +
     +== Setting Up
    @@ Documentation/MyFirstRevWalk.txt (new)
     +/*
     + * "git walken"
     + *
    -+ * Part of the "My First Revision Walk" tutorial.
    ++ * Part of the "My First Object Walk" tutorial.
     + */
     +
     +#include "builtin.h"
    @@ Documentation/MyFirstRevWalk.txt (new)
     +
     +Per entry, we find:
     +
    -+`item` is the object provided upon which to base the revision walk. Items in Git
    ++`item` is the object provided upon which to base the object walk. Items in Git
     +can be blobs, trees, commits, or tags. (See `Documentation/gittutorial-2.txt`.)
     +
     +`name` is the object ID (OID) of the object - a hex string you may be familiar
    @@ Documentation/MyFirstRevWalk.txt (new)
     +
     +First, let's see if we can replicate the output of `git log --oneline`. We'll
     +refer back to the implementation frequently to discover norms when performing
    -+a revision walk of our own.
    ++an object walk of our own.
     +
     +To do so, we'll first find all the commits, in order, which preceded the current
     +commit. We'll extract the name and subject of the commit from each.
    @@ Documentation/MyFirstRevWalk.txt (new)
     +
     +=== Setting Up
     +
    -+Preparing for your revision walk has some distinct stages.
    ++Preparing for your object walk has some distinct stages.
     +
     +1. Perform default setup for this mode, and others which may be invoked.
     +2. Check configuration files for relevant settings.
    @@ Documentation/MyFirstRevWalk.txt (new)
     +`grep` and `diff` to initialize themselves by calling each of their
     +initialization functions.
     +
    -+For our purposes, within `git walken`, for the first example we don't intend to
    -+use any other components within Git, and we don't have any configuration to do.
    -+However, we may want to add some later, so for now, we can add an empty
    -+placeholder. Create a new function in `builtin/walken.c`:
    ++For our first example within `git walken`, we don't intend to use any other
    ++components within Git, and we don't have any configuration to do.  However, we
    ++may want to add some later, so for now, we can add an empty placeholder. Create
    ++a new function in `builtin/walken.c`:
     +
     +----
     +static void init_walken_defaults(void)
    @@ Documentation/MyFirstRevWalk.txt (new)
     +}
     +----
     +
    -+// TODO: Checking CLI options
    -+
     +==== Setting Up `rev_info`
     +
     +Now that we've gathered external configuration and options, it's time to
    @@ Documentation/MyFirstRevWalk.txt (new)
     +	 */
     +	get_commit_format("oneline", rev);
     +
    -+	/* Start our revision walk at HEAD. */
    ++	/* Start our object walk at HEAD. */
     +	add_head_to_pending(rev);
     +}
     +----
    @@ Documentation/MyFirstRevWalk.txt (new)
     +
     +There are a few ways that we can change the order of the commits during a
     +revision walk. Firstly, we can use the `enum rev_sort_order` to choose from some
    -+sane orderings.
    ++typical orderings.
     +
     +`topo_order` is the same as `git log --topo-order`: we avoid showing a parent
     +before all of its children have been shown, and we avoid mixing commits which
    @@ Documentation/MyFirstRevWalk.txt (new)
     +We also have the capability to enumerate all objects which were omitted by a
     +filter, like with `git log --filter=<spec> --filter-print-omitted`. Asking
     +`traverse_commit_list_filtered()` to populate the `omitted` list means that our
    -+revision walk does not perform any better than an unfiltered revision walk; all
    ++object walk does not perform any better than an unfiltered object walk; all
     +reachable objects are walked in order to populate the list.
     +
     +First, add the `struct oidset` and related items we will use to iterate it:

<end rangediff>

 Documentation/Makefile              |   1 +
 Documentation/MyFirstObjectWalk.txt | 905 ++++++++++++++++++++++++++++
 2 files changed, 906 insertions(+)
 create mode 100644 Documentation/MyFirstObjectWalk.txt

Message ID	20191010151932.2716-1-emilyshaffer@google.com (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=0T8v=YD=vger.kernel.org=git-owner@kernel.org> Date: Thu, 10 Oct 2019 08:19:32 -0700 In-Reply-To: <20190806231952.39155-1-emilyshaffer@google.com> Message-Id: <20191010151932.2716-1-emilyshaffer@google.com> Mime-Version: 1.0 Subject: [PATCH v5] documentation: add tutorial for object walking From: Emily Shaffer <emilyshaffer@google.com> To: git@vger.kernel.org Cc: Emily Shaffer <emilyshaffer@google.com>, Junio C Hamano <gitster@pobox.com>, Eric Sunshine <sunshine@sunshineco.com>, Jonathan Tan <jonathantanmy@google.com>, Josh Steadmon <steadmon@google.com> Content-Type: text/plain; charset="UTF-8" Sender: git-owner@vger.kernel.org Precedence: bulk
Series	[v5] documentation: add tutorial for object walking \| expand [v5] documentation: add tutorial for object walking

[v5] documentation: add tutorial for object walking

Commit Message

Comments

Patch