From 75485c89fbee8a433ff53a688f7f645f6ec3b160 Mon Sep 17 00:00:00 2001 From: Junio C Hamano Date: Sat, 19 May 2007 04:20:33 +0000 Subject: [PATCH] Autogenerated HTML docs for v1.5.2-rc3-83-gc906 --- RelNotes-1.5.1.5.txt | 15 +- git-rev-list.html | 6 +- git-rev-list.txt | 4 +- git.html | 2 +- git.txt | 4 +- glossary.html | 235 +++--- glossary.txt | 228 +++--- howto/dangling-objects.txt | 109 --- howto/isolate-bugs-with-bisect.txt | 65 -- howto/make-dist.txt | 52 -- howto/using-topic-branches.txt | 296 ------- user-manual.html | 753 ++++++++++++----- user-manual.txt | 1220 +++++++++++++++++++++------- 13 files changed, 1767 insertions(+), 1222 deletions(-) delete mode 100644 howto/dangling-objects.txt delete mode 100644 howto/isolate-bugs-with-bisect.txt delete mode 100644 howto/make-dist.txt delete mode 100644 howto/using-topic-branches.txt diff --git a/RelNotes-1.5.1.5.txt b/RelNotes-1.5.1.5.txt index 5cfe0b5b0..85e821c43 100644 --- a/RelNotes-1.5.1.5.txt +++ b/RelNotes-1.5.1.5.txt @@ -6,6 +6,17 @@ Fixes since v1.5.1.4 * Bugfixes + - git-send-email did not understand aliases file for mutt, which + allows leading whitespaces. + + - git-format-patch emitted Content-Type and Content-Transfer-Encoding + headers for non ASCII contents, but failed to add MIME-Version. + + - git-name-rev had a buffer overrun with a deep history. + + - contributed script import-tars did not get the directory in + tar archives interpreted correctly. + - git-svn was reported to segfault for many people on list and #git; hopefully this has been fixed. @@ -28,10 +39,10 @@ Fixes since v1.5.1.4 and extern, which caused trouble compiling with Forte12 compilers on Sun. - - Many documentation fixes. + - Many many documentation fixes and updates. -- exec >/var/tmp/1 -O=v1.5.1.4-26-gb4b20b2 +O=v1.5.1.4-48-gcecb98a echo O=`git describe refs/heads/maint` git shortlog --no-merges $O..refs/heads/maint diff --git a/git-rev-list.html b/git-rev-list.html index e672e5aa2..1d691795b 100644 --- a/git-rev-list.html +++ b/git-rev-list.html @@ -533,7 +533,7 @@ limiting may be applied.

Limit the commits output to ones with author/committer - header lines that match the specified pattern. + header lines that match the specified pattern (regular expression).

@@ -542,7 +542,7 @@ limiting may be applied.

Limit the commits output to ones with log message that - matches the specified pattern. + matches the specified pattern (regular expression).

@@ -1056,7 +1056,7 @@ and the git-list <git@vger.kernel.org>.

diff --git a/git-rev-list.txt b/git-rev-list.txt index ab90a22f1..c3c2043d1 100644 --- a/git-rev-list.txt +++ b/git-rev-list.txt @@ -207,12 +207,12 @@ limiting may be applied. --author='pattern', --committer='pattern':: Limit the commits output to ones with author/committer - header lines that match the specified pattern. + header lines that match the specified pattern (regular expression). --grep='pattern':: Limit the commits output to ones with log message that - matches the specified pattern. + matches the specified pattern (regular expression). --remove-empty:: diff --git a/git.html b/git.html index e2d64cbe2..6107ae49d 100644 --- a/git.html +++ b/git.html @@ -2333,7 +2333,7 @@ contributors on the git-list <git@vger.kernel.org>.

diff --git a/git.txt b/git.txt index 273ca173d..99b8f5208 100644 --- a/git.txt +++ b/git.txt @@ -40,7 +40,9 @@ Documentation for older releases are available here: * link:RelNotes-1.5.1.txt[release notes for 1.5.1] -* link:v1.5.1.4/git.html[documentation for release 1.5.1.4] +* link:v1.5.1.5/git.html[documentation for release 1.5.1.5] + +* link:RelNotes-1.5.1.5.txt[release notes for 1.5.1.5] * link:RelNotes-1.5.1.4.txt[release notes for 1.5.1.4] diff --git a/glossary.html b/glossary.html index 5e3b56d06..6e92a3b26 100644 --- a/glossary.html +++ b/glossary.html @@ -269,9 +269,9 @@ div.exampleblock-content {

- Via the alternates mechanism, a repository can - inherit part of its object database from another - object database, which is called "alternate". + Via the alternates mechanism, a repository + can inherit part of its object database + from another object database, which is called "alternate".

@@ -279,7 +279,7 @@ div.exampleblock-content {

- A bare repository is normally an appropriately + A bare repository is normally an appropriately named directory with a .git suffix that does not have a locally checked-out copy of any of the files under revision control. That is, all of the git @@ -303,10 +303,15 @@ div.exampleblock-content {

- A non-cyclical graph of revisions, i.e. the complete history of a - particular revision, which is called the - branch head. The heads - are stored in $GIT_DIR/refs/heads/. + A "branch" is an active line of development. The most recent + commit on a branch is referred to as the tip of + that branch. The tip of the branch is referenced by a branch + head, which moves forward as additional development + is done on the branch. A single git + repository can track an arbitrary number of + branches, but your working tree is + associated with just one of them (the "current" or "checked out" + branch), and HEAD points to that branch.

@@ -357,7 +362,7 @@ div.exampleblock-content { as a new series of changes on top of different codebase. In GIT, this is performed by "git cherry-pick" command to extract the change introduced by an existing commit and to record it based on the tip - of the current branch as a new commit. + of the current branch as a new commit.

@@ -365,7 +370,7 @@ div.exampleblock-content {

- A working tree is clean, if it + A working tree is clean, if it corresponds to the revision referenced by the current head. Also see "dirty".

@@ -375,11 +380,17 @@ div.exampleblock-content {

- As a verb: The action of storing the current state of the - index in the object database. The - result is a revision. As a noun: Short hand for - commit object. + As a noun: A single point in the + git history; the entire history of a project is represented as a + set of interrelated commits. The word "commit" is often + used by git in the same places other revision control systems + use the words "revision" or "version". Also used as a short + hand for commit object.

+

As a verb: The action of storing a new snapshot of the project's +state in the git history, by creating a new commit representing the current +state of the index and advancing HEAD +to point at the new commit.

commit object @@ -409,7 +420,7 @@ div.exampleblock-content {

Directed acyclic graph. The commit objects form a directed acyclic graph, because they have parents (directed), and the - graph of commit objects is acyclic (there is no + graph of commit objects is acyclic (there is no chain which begins and ends with the same object).

@@ -421,11 +432,22 @@ div.exampleblock-content {

An unreachable object which is not reachable even from other unreachable objects; a - dangling object has no references to it from any + dangling object has no references to it from any reference or object in the repository.

+detached HEAD +
+
+

+ Normally the HEAD stores the name of a + branch. However, git also allows you to check + out an arbitrary commit that isn't necessarily the tip of any + particular branch. In this case HEAD is said to be "detached". +

+
+
dircache
@@ -446,7 +468,7 @@ div.exampleblock-content {

- A working tree is said to be dirty if + A working tree is said to be "dirty" if it contains modifications which have not been committed to the current branch.

@@ -471,7 +493,7 @@ div.exampleblock-content { branch's changes that happen to be a descendant of what you have. In such these cases, you do not make a new merge commit but instead just update to his - revision. This will happen frequently on a + revision. This will happen frequently on a tracking branch of a remote repository.

@@ -482,10 +504,10 @@ div.exampleblock-content {

Fetching a branch means to get the - branch's head ref from a remote - repository, to find out which objects are missing - from the local object database, and to get them, - too. + branch's head ref from a remote + repository, to find out which objects are + missing from the local object database, + and to get them, too. See also git-fetch(1).

@@ -514,7 +536,7 @@ div.exampleblock-content { Grafts enables two otherwise different lines of development to be joined together by recording fake ancestry information for commits. This way you can make git pretend the set of parents a commit has - is different from what was recorded when the commit was + is different from what was recorded when the commit was created. Configured via the .git/info/grafts file.

@@ -531,8 +553,22 @@ div.exampleblock-content {

- The top of a branch. It contains a ref to the - corresponding commit object. + A named reference to the commit at the tip of a + branch. Heads are stored in + $GIT_DIR/refs/heads/, except when using packed refs. (See + git-pack-refs(1).) +

+
+
+HEAD +
+
+

+ The current branch. In more detail: Your working tree is normally derived from the state of the tree + referred to by HEAD. HEAD is a reference to one of the + heads in your repository, except when using a + detached HEAD, in which case it may + reference an arbitrary commit.

@@ -540,9 +576,7 @@ div.exampleblock-content {

- A ref pointing to a head. Often, this is - abbreviated to "head". Head refs are stored in - $GIT_DIR/refs/heads/. + A synonym for head.

@@ -554,7 +588,7 @@ div.exampleblock-content { to optional scripts that allow a developer to add functionality or checking. Typically, the hooks allow for a command to be pre-verified and potentially aborted, and allow for a post-notification after the - operation is done. The hook scripts are found in the + operation is done. The hook scripts are found in the $GIT_DIR/hooks/ directory, and are enabled by simply making them executable.

@@ -565,7 +599,7 @@ div.exampleblock-content {

A collection of files with stat information, whose contents are stored - as objects. The index is a stored version of your working + as objects. The index is a stored version of your working tree. Truth be told, it can also contain a second, and even a third version of a working tree, which are used when merging. @@ -577,9 +611,9 @@ div.exampleblock-content {

The information regarding a particular file, stored in the - index. An index entry can be unmerged, - if a merge was started, but not yet finished (i.e. if the - index contains multiple versions of that file). + index. An index entry can be unmerged, if a + merge was started, but not yet finished (i.e. if + the index contains multiple versions of that file).

@@ -587,11 +621,11 @@ div.exampleblock-content {

- The default development branch. Whenever you create a git - repository, a branch named - "master" is created, and becomes the active - branch. In most cases, this contains the local - development, though that is purely conventional and not required. + The default development branch. Whenever you + create a git repository, a branch named + "master" is created, and becomes the active branch. In most + cases, this contains the local development, though that is + purely by convention and is not required.

@@ -599,12 +633,25 @@ div.exampleblock-content {

- To merge branches means to try to accumulate the changes - since a common ancestor and apply them to the first - branch. An automatic merge uses heuristics - to accomplish that. Evidently, an automatic merge can - fail. + As a verb: To bring the contents of another + branch (possibly from an external + repository) into the current branch. In the + case where the merged-in branch is from a different repository, + this is done by first fetching the remote branch + and then merging the result into the current branch. This + combination of fetch and merge operations is called a + pull. Merging is performed by an automatic process + that identifies changes made since the branches diverged, and + then applies all those changes together. In cases where changes + conflict, manual intervention may be required to complete the + merge.

+

As a noun: unless it is a fast forward, a +successful merge results in the creation of a new commit +representing the result of the merge, and having as +parents the tips of the merged branches. +This commit is referred to as a "merge commit", or sometimes just a +"merge".

object @@ -613,7 +660,7 @@ div.exampleblock-content {

The unit of storage in git. It is uniquely identified by the SHA1 of its contents. Consequently, an - object can not be changed. + object can not be changed.

@@ -640,9 +687,9 @@ div.exampleblock-content {

The unique identifier of an object. The hash - of the object's contents using the Secure Hash Algorithm + of the object's contents using the Secure Hash Algorithm 1 and usually represented by the 40 character hexadecimal encoding of - the hash of the object (possibly followed by + the hash of the object (possibly followed by a white space).

@@ -672,7 +719,7 @@ div.exampleblock-content {

The default upstream repository. Most projects have at least one upstream project which they track. By default - origin is used for that purpose. New upstream updates + origin is used for that purpose. New upstream updates will be fetched into remote tracking branches named origin/name-of-upstream-branch, which you can see using "git branch -r". @@ -694,7 +741,7 @@ div.exampleblock-content {

The list of identifiers, and other information, of the objects in a pack, to assist in efficiently accessing the contents of a - pack. + pack.

@@ -734,7 +781,7 @@ div.exampleblock-content {

Cute name for programs and program suites depending on core git, presenting a high level access to - core git. Porcelains expose more of a SCM + core git. Porcelains expose more of a SCM interface than the plumbing.

@@ -744,7 +791,7 @@ div.exampleblock-content {

Pulling a branch means to fetch it and - merge it. + merge it. See also git-pull(1).

@@ -752,16 +799,16 @@ div.exampleblock-content {

- Pushing a branch means to get the branch's + Pushing a branch means to get the branch's head ref from a remote repository, - find out if it is an ancestor to the branch's local - head ref is a direct, and in that case, putting all + find out if it is an ancestor to the branch's local + head ref is a direct, and in that case, putting all objects, which are reachable from the local - head ref, and which are missing from the remote - repository, into the remote + head ref, and which are missing from the remote + repository, into the remote object database, and updating the remote - head ref. If the remote head is not an - ancestor to the local head, the push fails. + head ref. If the remote head is not an + ancestor to the local head, the push fails.

@@ -770,8 +817,8 @@ div.exampleblock-content {

All of the ancestors of a given commit are said to be - reachable from that commit. More - generally, one object is reachable from + "reachable" from that commit. More + generally, one object is reachable from another if we can reach the one from the other by a chain that follows tags to whatever they tag, commits to their parents or trees, and @@ -805,17 +852,16 @@ div.exampleblock-content {

A refspec is used by fetch and - push to describe the mapping between remote ref - and local ref. They are combined with a colon in the format - <src>:<dst>, preceded by an optional plus sign, +. For example: git - fetch $URL refs/heads/master:refs/heads/origin means - "grab the master branch head - from the $URL and store it as my origin - branch head". And git push - $URL refs/heads/master:refs/heads/to-upstream means - "publish my master branch - head as to-upstream branch at $URL". See - also git-push(1) + push to describe the mapping between remote + ref and local ref. They are combined with a colon in + the format <src>:<dst>, preceded by an optional plus sign, +. + For example: git fetch $URL + refs/heads/master:refs/heads/origin means "grab the master + branch head from the $URL and store + it as my origin branch head". And git push + $URL refs/heads/master:refs/heads/to-upstream means "publish my + master branch head as to-upstream branch at $URL". See also + git-push(1)

@@ -823,10 +869,11 @@ div.exampleblock-content {

- A collection of refs together with an object database containing all objects which are reachable - from the refs, possibly accompanied by meta data from one or more - porcelains. A repository can share an - object database with other repositories. + A collection of refs together with an + object database containing all objects + which are reachable from the refs, possibly + accompanied by meta data from one or more porcelains. A + repository can share an object database with other repositories.

@@ -878,12 +925,12 @@ div.exampleblock-content {

- A shallow repository has an incomplete + A shallow repository has an incomplete history some of whose commits have parents cauterized away (in other words, git is told to pretend that these commits do not have the parents, even though they are recorded in the commit object). This is sometimes useful when you are interested only in the recent history of a project even though the real history recorded in the - upstream is much larger. A shallow repository + upstream is much larger. A shallow repository is created by giving the —depth option to git-clone(1), and its history can be later deepened with git-fetch(1).

@@ -893,11 +940,12 @@ div.exampleblock-content {

- Symbolic reference: instead of containing the SHA1 id - itself, it is of the format ref: refs/some/thing and when - referenced, it recursively dereferences to this reference. HEAD is a - prime example of a symref. Symbolic references are - manipulated with the git-symbolic-ref(1) command. + Symbolic reference: instead of containing the SHA1 + id itself, it is of the format ref: refs/some/thing and when + referenced, it recursively dereferences to this reference. + HEAD is a prime example of a symref. Symbolic + references are manipulated with the git-symbolic-ref(1) + command.

@@ -905,14 +953,14 @@ div.exampleblock-content {

- A ref pointing to a tag or + A ref pointing to a tag or commit object. In contrast to a head, a tag is not changed by a commit. Tags (not tag objects) are stored in $GIT_DIR/refs/tags/. A git tag has nothing to do with a Lisp tag (which would be called an object type in git's context). A tag is most typically used to mark a particular point in the - commit ancestry chain. + commit ancestry chain.

@@ -921,7 +969,7 @@ div.exampleblock-content {

An object containing a ref pointing to - another object, which can contain a message just like a + another object, which can contain a message just like a commit object. It can also contain a (PGP) signature, in which case it is called a "signed tag object".

@@ -944,8 +992,9 @@ div.exampleblock-content {

A regular git branch that is used to follow changes from - another repository. A tracking branch should not contain direct modifications or have local commits - made to it. A tracking branch can usually be + another repository. A tracking + branch should not contain direct modifications or have local commits + made to it. A tracking branch can usually be identified as the right-hand-side ref in a Pull: refspec.

@@ -955,8 +1004,8 @@ div.exampleblock-content {

- Either a working tree, or a tree object together with the dependent blob and tree objects - (i.e. a stored representation of a working tree). + Either a working tree, or a tree object together with the dependent blob and tree objects + (i.e. a stored representation of a working tree).

@@ -974,8 +1023,7 @@ div.exampleblock-content {

- A ref pointing to either a commit object, a tree object, or a tag object pointing to a tag or commit or - tree object. + A ref pointing to either a commit object, a tree object, or a tag object pointing to a tag or commit or tree object.

@@ -1001,8 +1049,9 @@ div.exampleblock-content {

- The set of files and directories currently being worked on, i.e. you can - work in your working tree without using git at all. + The tree of actual checked out files. The working tree is + normally equal to the HEAD plus any local changes + that you have made but not yet committed.

@@ -1010,7 +1059,7 @@ div.exampleblock-content { diff --git a/glossary.txt b/glossary.txt index 2465514e4..489c3e9d5 100644 --- a/glossary.txt +++ b/glossary.txt @@ -2,12 +2,12 @@ GIT Glossary ============ [[def_alternate_object_database]]alternate object database:: - Via the alternates mechanism, a <> can - inherit part of its <> from another - <>, which is called "alternate". + Via the alternates mechanism, a <> + can inherit part of its <> + from another object database, which is called "alternate". [[def_bare_repository]]bare repository:: - A <> is normally an appropriately + A bare repository is normally an appropriately named <> with a `.git` suffix that does not have a locally checked-out copy of any of the files under <> control. That is, all of the `git` @@ -21,10 +21,15 @@ GIT Glossary Untyped <>, e.g. the contents of a file. [[def_branch]]branch:: - A non-cyclical graph of revisions, i.e. the complete history of a - particular <>, which is called the - branch <>. The heads - are stored in `$GIT_DIR/refs/heads/`. + A "branch" is an active line of development. The most recent + <> on a branch is referred to as the tip of + that branch. The tip of the branch is referenced by a branch + <>, which moves forward as additional development + is done on the branch. A single git + <> can track an arbitrary number of + branches, but your <> is + associated with just one of them (the "current" or "checked out" + branch), and <> points to that branch. [[def_cache]]cache:: Obsolete for: <>. @@ -50,18 +55,25 @@ GIT Glossary as a new series of changes on top of different codebase. In GIT, this is performed by "git cherry-pick" command to extract the change introduced by an existing <> and to record it based on the tip - of the current <> as a new <>. + of the current <> as a new commit. [[def_clean]]clean:: - A <> is <>, if it + A <> is clean, if it corresponds to the <> referenced by the current <>. Also see "<>". [[def_commit]]commit:: - As a verb: The action of storing the current state of the - <> in the <>. The - result is a <>. As a noun: Short hand for - <>. + As a noun: A single point in the + git history; the entire history of a project is represented as a + set of interrelated commits. The word "commit" is often + used by git in the same places other revision control systems + use the words "revision" or "version". Also used as a short + hand for <>. ++ +As a verb: The action of storing a new snapshot of the project's +state in the git history, by creating a new commit representing the current +state of the <> and advancing <> +to point at the new commit. [[def_commit_object]]commit object:: An <> which contains the information about a @@ -77,16 +89,22 @@ GIT Glossary [[def_DAG]]DAG:: Directed acyclic graph. The <> objects form a directed acyclic graph, because they have parents (directed), and the - graph of <> objects is acyclic (there is no + graph of commit objects is acyclic (there is no <> which begins and ends with the same <>). [[def_dangling_object]]dangling object:: An <> which is not <> even from other unreachable objects; a - <> has no references to it from any + dangling object has no references to it from any reference or <> in the <>. +[[def_detached_HEAD]]detached HEAD:: + Normally the <> stores the name of a + <>. However, git also allows you to check + out an arbitrary commit that isn't necessarily the tip of any + particular branch. In this case HEAD is said to be "detached". + [[def_dircache]]dircache:: You are *waaaaay* behind. @@ -94,7 +112,7 @@ GIT Glossary The list you get with "ls" :-) [[def_dirty]]dirty:: - A <> is said to be <> if + A <> is said to be "dirty" if it contains modifications which have not been committed to the current <>. @@ -109,16 +127,16 @@ GIT Glossary <>'s changes that happen to be a descendant of what you have. In such these cases, you do not make a new <> <> but instead just update to his - <>. This will happen frequently on a + revision. This will happen frequently on a <> of a remote <>. [[def_fetch]]fetch:: Fetching a <> means to get the - <>'s <> from a remote - <>, to find out which objects are missing - from the local <>, and to get them, - too. + branch's <> from a remote + <>, to find out which objects are + missing from the local <>, + and to get them, too. See also gitlink:git-fetch[1]. [[def_file_system]]file system:: Linus Torvalds originally designed git to be a user space file system, @@ -132,61 +150,83 @@ GIT Glossary Grafts enables two otherwise different lines of development to be joined together by recording fake ancestry information for commits. This way you can make git pretend the set of parents a <> has - is different from what was recorded when the <> was + is different from what was recorded when the commit was created. Configured via the `.git/info/grafts` file. [[def_hash]]hash:: In git's context, synonym to <>. [[def_head]]head:: - The top of a <>. It contains a <> to the - corresponding <>. + A named reference to the <> at the tip of a + <>. Heads are stored in + `$GIT_DIR/refs/heads/`, except when using packed refs. (See + gitlink:git-pack-refs[1].) + +[[def_HEAD]]HEAD:: + The current branch. In more detail: Your <> is normally derived from the state of the tree + referred to by HEAD. HEAD is a reference to one of the + <> in your repository, except when using a + <>, in which case it may + reference an arbitrary commit. [[def_head_ref]]head ref:: - A <> pointing to a <>. Often, this is - abbreviated to "<>". Head refs are stored in - `$GIT_DIR/refs/heads/`. + A synonym for <>. [[def_hook]]hook:: During the normal execution of several git commands, call-outs are made to optional scripts that allow a developer to add functionality or checking. Typically, the hooks allow for a command to be pre-verified and potentially aborted, and allow for a post-notification after the - operation is done. The <> scripts are found in the + operation is done. The hook scripts are found in the `$GIT_DIR/hooks/` <>, and are enabled by simply making them executable. [[def_index]]index:: A collection of files with stat information, whose contents are stored - as objects. The <> is a stored version of your working + as objects. The index is a stored version of your working <>. Truth be told, it can also contain a second, and even a third version of a <>, which are used when merging. [[def_index_entry]]index entry:: The information regarding a particular file, stored in the - <>. An <> can be unmerged, - if a <> was started, but not yet finished (i.e. if the - <> contains multiple versions of that file). + <>. An index entry can be unmerged, if a + <> was started, but not yet finished (i.e. if + the index contains multiple versions of that file). [[def_master]]master:: - The default development <>. Whenever you create a git - <>, a <> named - "<>" is created, and becomes the active - <>. In most cases, this contains the local - development, though that is purely conventional and not required. + The default development <>. Whenever you + create a git <>, a branch named + "master" is created, and becomes the active branch. In most + cases, this contains the local development, though that is + purely by convention and is not required. [[def_merge]]merge:: - To <> branches means to try to accumulate the changes - since a common ancestor and apply them to the first - <>. An automatic <> uses heuristics - to accomplish that. Evidently, an automatic <> can - fail. + As a verb: To bring the contents of another + <> (possibly from an external + <>) into the current branch. In the + case where the merged-in branch is from a different repository, + this is done by first <> the remote branch + and then merging the result into the current branch. This + combination of fetch and merge operations is called a + <>. Merging is performed by an automatic process + that identifies changes made since the branches diverged, and + then applies all those changes together. In cases where changes + conflict, manual intervention may be required to complete the + merge. ++ +As a noun: unless it is a <>, a +successful merge results in the creation of a new <> +representing the result of the merge, and having as +<> the tips of the merged <>. +This commit is referred to as a "merge commit", or sometimes just a +"merge". [[def_object]]object:: The unit of storage in git. It is uniquely identified by the <> of its contents. Consequently, an - <> can not be changed. + object can not be changed. [[def_object_database]]object database:: Stores a set of "objects", and an individual <> is @@ -198,9 +238,9 @@ GIT Glossary [[def_object_name]]object name:: The unique identifier of an <>. The <> - of the <>'s contents using the Secure Hash Algorithm + of the object's contents using the Secure Hash Algorithm 1 and usually represented by the 40 character hexadecimal encoding of - the <> of the <> (possibly followed by + the <> of the object (possibly followed by a white space). [[def_object_type]]object type:: @@ -215,7 +255,7 @@ GIT Glossary [[def_origin]]origin:: The default upstream <>. Most projects have at least one upstream project which they track. By default - '<>' is used for that purpose. New upstream updates + 'origin' is used for that purpose. New upstream updates will be fetched into remote tracking branches named origin/name-of-upstream-branch, which you can see using "git <> -r". @@ -227,7 +267,7 @@ GIT Glossary [[def_pack_index]]pack index:: The list of identifiers, and other information, of the objects in a <>, to assist in efficiently accessing the contents of a - <>. + pack. [[def_parent]]parent:: A <> contains a (possibly empty) list @@ -247,29 +287,29 @@ GIT Glossary [[def_porcelain]]porcelain:: Cute name for programs and program suites depending on <>, presenting a high level access to - <>. Porcelains expose more of a <> + core git. Porcelains expose more of a <> interface than the <>. [[def_pull]]pull:: Pulling a <> means to <> it and - <> it. + <> it. See also gitlink:git-pull[1]. [[def_push]]push:: - Pushing a <> means to get the <>'s + Pushing a <> means to get the branch's <> from a remote <>, - find out if it is an ancestor to the <>'s local - <> is a direct, and in that case, putting all + find out if it is an ancestor to the branch's local + head ref is a direct, and in that case, putting all objects, which are <> from the local - <>, and which are missing from the remote - <>, into the remote + head ref, and which are missing from the remote + repository, into the remote <>, and updating the remote - <>. If the remote <> is not an - ancestor to the local <>, the <> fails. + head ref. If the remote <> is not an + ancestor to the local head, the push fails. [[def_reachable]]reachable:: All of the ancestors of a given <> are said to be - <> from that <>. More - generally, one <> is <> from + "reachable" from that commit. More + generally, one <> is reachable from another if we can reach the one from the other by a <> that follows <> to whatever they tag, <> to their parents or trees, and @@ -288,24 +328,23 @@ GIT Glossary [[def_refspec]]refspec:: A <> is used by <> and - <> to describe the mapping between remote <> - and local <>. They are combined with a colon in the format - :, preceded by an optional plus sign, +. For example: `git - fetch $URL refs/heads/master:refs/heads/origin` means - "grab the master <> <> - from the $URL and store it as my origin - <> <>". And `git <> - $URL refs/heads/master:refs/heads/to-upstream` means - "publish my master <> - <> as to-upstream <> at $URL". See - also gitlink:git-push[1] + <> to describe the mapping between remote + <> and local ref. They are combined with a colon in + the format :, preceded by an optional plus sign, +. + For example: `git fetch $URL + refs/heads/master:refs/heads/origin` means "grab the master + <> <> from the $URL and store + it as my origin branch head". And `git push + $URL refs/heads/master:refs/heads/to-upstream` means "publish my + master branch head as to-upstream branch at $URL". See also + gitlink:git-push[1] [[def_repository]]repository:: - A collection of refs together with an <> containing all objects which are <> - from the refs, possibly accompanied by meta data from one or more - porcelains. A <> can share an - <> with other repositories. + A collection of refs together with an + <> containing all objects + which are <> from the refs, possibly + accompanied by meta data from one or more porcelains. A + repository can share an object database with other repositories. [[def_resolve]]resolve:: The action of fixing up manually what a failed automatic @@ -327,36 +366,37 @@ GIT Glossary Synonym for <>. [[def_shallow_repository]]shallow repository:: - A <> has an incomplete + A shallow repository has an incomplete history some of whose commits have parents cauterized away (in other words, git is told to pretend that these commits do not have the parents, even though they are recorded in the <>). This is sometimes useful when you are interested only in the recent history of a project even though the real history recorded in the - upstream is much larger. A <> + upstream is much larger. A shallow repository is created by giving the `--depth` option to gitlink:git-clone[1], and its history can be later deepened with gitlink:git-fetch[1]. [[def_symref]]symref:: - Symbolic reference: instead of containing the <> id - itself, it is of the format 'ref: refs/some/thing' and when - referenced, it recursively dereferences to this reference. 'HEAD' is a - prime example of a <>. Symbolic references are - manipulated with the gitlink:git-symbolic-ref[1] command. + Symbolic reference: instead of containing the <> + id itself, it is of the format 'ref: refs/some/thing' and when + referenced, it recursively dereferences to this reference. + '<>' is a prime example of a symref. Symbolic + references are manipulated with the gitlink:git-symbolic-ref[1] + command. [[def_tag]]tag:: - A <> pointing to a <> or + A <> pointing to a tag or <>. In contrast to a <>, a tag is not changed by a <>. Tags (not <>) are stored in `$GIT_DIR/refs/tags/`. A git tag has nothing to do with a Lisp tag (which would be called an <> in git's context). A tag is most typically used to mark a particular point in the - <> ancestry <>. + commit ancestry <>. [[def_tag_object]]tag object:: An <> containing a <> pointing to - another <>, which can contain a message just like a + another object, which can contain a message just like a <>. It can also contain a (PGP) signature, in which case it is called a "signed <>". @@ -370,16 +410,16 @@ GIT Glossary [[def_tracking_branch]]tracking branch:: A regular git <> that is used to follow changes from - another <>. A <> should not contain direct modifications or have local commits - made to it. A <> can usually be + another <>. A tracking + branch should not contain direct modifications or have local commits + made to it. A tracking branch can usually be identified as the right-hand-side <> in a Pull: <>. [[def_tree]]tree:: Either a <>, or a <> together with the dependent blob and <> objects - (i.e. a stored representation of a <>). + object>> together with the dependent blob and tree objects + (i.e. a stored representation of a working tree). [[def_tree_object]]tree object:: An <> containing a list of file names and modes along @@ -389,8 +429,7 @@ GIT Glossary [[def_tree-ish]]tree-ish:: A <> pointing to either a <>, a <>, or a <> pointing to a <> or <> or - <>. + object>> pointing to a tag or commit or tree object. [[def_unmerged_index]]unmerged index:: An <> which contains unmerged @@ -401,5 +440,6 @@ GIT Glossary <>, <>, or any other reference. [[def_working_tree]]working tree:: - The set of files and directories currently being worked on, i.e. you can - work in your <> without using git at all. + The tree of actual checked out files. The working tree is + normally equal to the <> plus any local changes + that you have made but not yet committed. diff --git a/howto/dangling-objects.txt b/howto/dangling-objects.txt deleted file mode 100644 index e82ddae3c..000000000 --- a/howto/dangling-objects.txt +++ /dev/null @@ -1,109 +0,0 @@ -From: Linus Torvalds -Subject: Re: Question about fsck-objects output -Date: Thu, 25 Jan 2007 12:01:06 -0800 (PST) -Message-ID: -Archived-At: -Abstract: Linus describes what dangling objects are, when they - are left behind, and how to view their relationship with branch - heads in gitk - -On Thu, 25 Jan 2007, Larry Streepy wrote: - -> Sorry to ask such a basic question, but I can't quite decipher the output of -> fsck-objects. When I run it, I get this: -> -> git fsck-objects -> dangling commit 2213f6d4dd39ca8baebd0427723723e63208521b -> dangling commit f0d4e00196bd5ee54463e9ea7a0f0e8303da767f -> dangling blob 6a6d0b01b3e96d49a8f2c7addd4ef8c3bd1f5761 -> -> -> Even after a "repack -a -d" they still exist. The man page has a short -> explanation, but, at least for me, it wasn't fully enlightening. :-) -> -> The man page says that dangling commits could be "root" commits, but since my -> repo started as a clone of another repo, I don't see how I could have any root -> commits. Also, the page doesn't really describe what a dangling blob is. -> -> So, can someone explain what these artifacts are and if they are a problem -> that I should be worried about? - -The most common situation is that you've rebased a branch (or you have -pulled from somebody else who rebased a branch, like the "pu" branch in -the git.git archive itself). - -What happens is that the old head of the original branch still exists, as -does obviously everything it pointed to. The branch pointer itself just -doesn't, since you replaced it with another one. - -However, there are certainly other situations too that cause dangling -objects. For example, the "dangling blob" situation you have tends to be -because you did a "git add" of a file, but then, before you actually -committed it and made it part of the bigger picture, you changed something -else in that file and committed that *updated* thing - the old state that -you added originally ends up not being pointed to by any commit/tree, so -it's now a dangling blob object. - -Similarly, when the "recursive" merge strategy runs, and finds that there -are criss-cross merges and thus more than one merge base (which is fairly -unusual, but it does happen), it will generate one temporary midway tree -(or possibly even more, if you had lots of criss-crossing merges and -more than two merge bases) as a temporary internal merge base, and again, -those are real objects, but the end result will not end up pointing to -them, so they end up "dangling" in your repository. - -Generally, dangling objects aren't anything to worry about. They can even -be very useful: if you screw something up, the dangling objects can be how -you recover your old tree (say, you did a rebase, and realized that you -really didn't want to - you can look at what dangling objects you have, -and decide to reset your head to some old dangling state). - -For commits, the most useful thing to do with dangling objects tends to be -to do a simple - - gitk --not --all - -which means exactly what it sounds like: it says that you want to see the -commit history that is described by the dangling commit(s), but you do NOT -want to see the history that is described by all your branches and tags -(which are the things you normally reach). That basically shows you in a -nice way what the danglign commit was (and notice that it might not be -just one commit: we only report the "tip of the line" as being dangling, -but there might be a whole deep and complex commit history that has gotten -dropped - rebasing will do that). - -For blobs and trees, you can't do the same, but you can examine them. You -can just do - - git show - -to show what the contents of the blob were (or, for a tree, basically what -the "ls" for that directory was), and that may give you some idea of what -the operation was that left that dangling object. - -Usually, dangling blobs and trees aren't very interesting. They're almost -always the result of either being a half-way mergebase (the blob will -often even have the conflict markers from a merge in it, if you have had -conflicting merges that you fixed up by hand), or simply because you -interrupted a "git fetch" with ^C or something like that, leaving _some_ -of the new objects in the object database, but just dangling and useless. - -Anyway, once you are sure that you're not interested in any dangling -state, you can just prune all unreachable objects: - - git prune - -and they'll be gone. But you should only run "git prune" on a quiescent -repository - it's kind of like doing a filesystem fsck recovery: you don't -want to do that while the filesystem is mounted. - -(The same is true of "git-fsck-objects" itself, btw - but since -git-fsck-objects never actually *changes* the repository, it just reports -on what it found, git-fsck-objects itself is never "dangerous" to run. -Running it while somebody is actually changing the repository can cause -confusing and scary messages, but it won't actually do anything bad. In -contrast, running "git prune" while somebody is actively changing the -repository is a *BAD* idea). - - Linus - diff --git a/howto/isolate-bugs-with-bisect.txt b/howto/isolate-bugs-with-bisect.txt deleted file mode 100644 index 926bbdc3c..000000000 --- a/howto/isolate-bugs-with-bisect.txt +++ /dev/null @@ -1,65 +0,0 @@ -From: Linus Torvalds -To: git@vger.kernel.org -Date: 2005-11-08 1:31:34 -Subject: Real-life kernel debugging scenario -Abstract: Short-n-sweet, Linus tells us how to leverage `git-bisect` to perform - bug isolation on a repository where "good" and "bad" revisions are known - in order to identify a suspect commit. - - -How To Use git-bisect To Isolate a Bogus Commit -=============================================== - -The way to use "git bisect" couldn't be easier. - -Figure out what the oldest bad state you know about is (that's usually the -head of "master", since that's what you just tried to boot and failed at). -Also, figure out the most recent known-good commit (usually the _previous_ -kernel you ran: and if you've only done a single "pull" in between, it -will be ORIG_HEAD). - -Then do - - git bisect start - git bisect bad master <- mark "master" as the bad state - git bisect good ORIG_HEAD <- mark ORIG_HEAD as good (or - whatever other known-good - thing you booted last) - -and at this point "git bisect" will churn for a while, and tell you what -the mid-point between those two commits are, and check that state out as -the head of the new "bisect" branch. - -Compile and reboot. - -If it's good, just do - - git bisect good <- mark current head as good - -otherwise, reboot into a good kernel instead, and do (surprise surprise, -git really is very intuitive): - - git bisect bad <- mark current head as bad - -and whatever you do, git will select a new half-way point. Do this for a -while, until git tells you exactly which commit was the first bad commit. -That's your culprit. - -It really works wonderfully well, except for the case where there was -_another_ commit that broke something in between, like introduced some -stupid compile error. In that case you should not mark that commit good or -bad: you should try to find another commit close-by, and do a "git reset ---hard " to try out _that_ commit instead, and then test that -instead (and mark it good or bad). - -You can do "git bisect visualize" while you do all this to see what's -going on by starting up gitk on the bisection range. - -Finally, once you've figured out exactly which commit was bad, you can -then go back to the master branch, and try reverting just that commit: - - git checkout master - git revert - -to verify that the top-of-kernel works with that single commit reverted. - diff --git a/howto/make-dist.txt b/howto/make-dist.txt deleted file mode 100644 index 00e330b29..000000000 --- a/howto/make-dist.txt +++ /dev/null @@ -1,52 +0,0 @@ -Date: Fri, 12 Aug 2005 22:39:48 -0700 (PDT) -From: Linus Torvalds -To: Dave Jones -cc: git@vger.kernel.org -Subject: Re: Fwd: Re: git checkout -f branch doesn't remove extra files -Abstract: In this article, Linus talks about building a tarball, - incremental patch, and ChangeLog, given a base release and two - rc releases, following the convention of giving the patch from - the base release and the latest rc, with ChangeLog between the - last rc and the latest rc. - -On Sat, 13 Aug 2005, Dave Jones wrote: -> -> > Git actually has a _lot_ of nifty tools. I didn't realize that people -> > didn't know about such basic stuff as "git-tar-tree" and "git-ls-files". -> -> Maybe its because things are moving so fast :) Or maybe I just wasn't -> paying attention on that day. (I even read the git changes via RSS, -> so I should have no excuse). - -Well, git-tar-tree has been there since late April - it's actually one of -those really early commands. I'm pretty sure the RSS feed came later ;) - -I use it all the time in doing releases, it's a lot faster than creating a -tar tree by reading the filesystem (even if you don't have to check things -out). A hidden pearl. - -This is my crappy "release-script": - - [torvalds@g5 ~]$ cat bin/release-script - #!/bin/sh - stable="$1" - last="$2" - new="$3" - echo "# git-tag v$new" - echo "git-tar-tree v$new linux-$new | gzip -9 > ../linux-$new.tar.gz" - echo "git-diff-tree -p v$stable v$new | gzip -9 > ../patch-$new.gz" - echo "git-rev-list --pretty v$new ^v$last > ../ChangeLog-$new" - echo "git-rev-list --pretty=short v$new ^v$last | git-shortlog > ../ShortLog" - echo "git-diff-tree -p v$last v$new | git-apply --stat > ../diffstat-$new" - -and when I want to do a new kernel release I literally first tag it, and -then do - - release-script 2.6.12 2.6.13-rc6 2.6.13-rc7 - -and check that things look sane, and then just cut-and-paste the commands. - -Yeah, it's stupid. - - Linus - diff --git a/howto/using-topic-branches.txt b/howto/using-topic-branches.txt deleted file mode 100644 index 2c98194cb..000000000 --- a/howto/using-topic-branches.txt +++ /dev/null @@ -1,296 +0,0 @@ -Date: Mon, 15 Aug 2005 12:17:41 -0700 -From: tony.luck@intel.com -Subject: Some tutorial text (was git/cogito workshop/bof at linuxconf au?) -Abstract: In this article, Tony Luck discusses how he uses GIT - as a Linux subsystem maintainer. - -Here's something that I've been putting together on how I'm using -GIT as a Linux subsystem maintainer. - --Tony - -Last updated w.r.t. GIT 1.1 - -Linux subsystem maintenance using GIT -------------------------------------- - -My requirements here are to be able to create two public trees: - -1) A "test" tree into which patches are initially placed so that they -can get some exposure when integrated with other ongoing development. -This tree is available to Andrew for pulling into -mm whenever he wants. - -2) A "release" tree into which tested patches are moved for final -sanity checking, and as a vehicle to send them upstream to Linus -(by sending him a "please pull" request.) - -Note that the period of time that each patch spends in the "test" tree -is dependent on the complexity of the change. Since GIT does not support -cherry picking, it is not practical to simply apply all patches to the -test tree and then pull to the release tree as that would leave trivial -patches blocked in the test tree waiting for complex changes to accumulate -enough test time to graduate. - -Back in the BitKeeper days I achieved this by creating small forests of -temporary trees, one tree for each logical grouping of patches, and then -pulling changes from these trees first to the test tree, and then to the -release tree. At first I replicated this in GIT, but then I realised -that I could so this far more efficiently using branches inside a single -GIT repository. - -So here is the step-by-step guide how this all works for me. - -First create your work tree by cloning Linus's public tree: - - $ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6.git work - -Change directory into the cloned tree you just created - - $ cd work - -Set up a remotes file so that you can fetch the latest from Linus' master -branch into a local branch named "linus": - - $ cat > .git/remotes/linus - URL: git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6.git - Pull: master:linus - ^D - -and create the linus branch: - - $ git branch linus - -The "linus" branch will be used to track the upstream kernel. To update it, -you simply run: - - $ git fetch linus - -you can do this frequently (and it should be safe to do so with pending -work in your tree, but perhaps not if you are in mid-merge). - -If you need to keep track of other public trees, you can add remote branches -for them too: - - $ git branch another - $ cat > .git/remotes/another - URL: ... insert URL here ... - Pull: name-of-branch-in-this-remote-tree:another - ^D - -and run: - - $ git fetch another - -Now create the branches in which you are going to work, these start -out at the current tip of the linus branch. - - $ git branch test linus - $ git branch release linus - -These can be easily kept up to date by merging from the "linus" branch: - - $ git checkout test && git merge "Auto-update from upstream" test linus - $ git checkout release && git merge "Auto-update from upstream" release linus - -Important note! If you have any local changes in these branches, then -this merge will create a commit object in the history (with no local -changes git will simply do a "Fast forward" merge). Many people dislike -the "noise" that this creates in the Linux history, so you should avoid -doing this capriciously in the "release" branch, as these noisy commits -will become part of the permanent history when you ask Linus to pull -from the release branch. - -Set up so that you can push upstream to your public tree (you need to -log-in to the remote system and create an empty tree there before the -first push). - - $ cat > .git/remotes/mytree - URL: master.kernel.org:/pub/scm/linux/kernel/git/aegl/linux-2.6.git - Push: release - Push: test - ^D - -and the push both the test and release trees using: - - $ git push mytree - -or push just one of the test and release branches using: - - $ git push mytree test -or - $ git push mytree release - -Now to apply some patches from the community. Think of a short -snappy name for a branch to hold this patch (or related group of -patches), and create a new branch from the current tip of the -linus branch: - - $ git checkout -b speed-up-spinlocks linus - -Now you apply the patch(es), run some tests, and commit the change(s). If -the patch is a multi-part series, then you should apply each as a separate -commit to this branch. - - $ ... patch ... test ... commit [ ... patch ... test ... commit ]* - -When you are happy with the state of this change, you can pull it into the -"test" branch in preparation to make it public: - - $ git checkout test && git merge "Pull speed-up-spinlock changes" test speed-up-spinlocks - -It is unlikely that you would have any conflicts here ... but you might if you -spent a while on this step and had also pulled new versions from upstream. - -Some time later when enough time has passed and testing done, you can pull the -same branch into the "release" tree ready to go upstream. This is where you -see the value of keeping each patch (or patch series) in its own branch. It -means that the patches can be moved into the "release" tree in any order. - - $ git checkout release && git merge "Pull speed-up-spinlock changes" release speed-up-spinlocks - -After a while, you will have a number of branches, and despite the -well chosen names you picked for each of them, you may forget what -they are for, or what status they are in. To get a reminder of what -changes are in a specific branch, use: - - $ git-whatchanged branchname ^linus | git-shortlog - -To see whether it has already been merged into the test or release branches -use: - - $ git-rev-list branchname ^test -or - $ git-rev-list branchname ^release - -[If this branch has not yet been merged you will see a set of SHA1 values -for the commits, if it has been merged, then there will be no output] - -Once a patch completes the great cycle (moving from test to release, then -pulled by Linus, and finally coming back into your local "linus" branch) -the branch for this change is no longer needed. You detect this when the -output from: - - $ git-rev-list branchname ^linus - -is empty. At this point the branch can be deleted: - - $ git branch -d branchname - -Some changes are so trivial that it is not necessary to create a separate -branch and then merge into each of the test and release branches. For -these changes, just apply directly to the "release" branch, and then -merge that into the "test" branch. - -To create diffstat and shortlog summaries of changes to include in a "please -pull" request to Linus you can use: - - $ git-whatchanged -p release ^linus | diffstat -p1 -and - $ git-whatchanged release ^linus | git-shortlog - - -Here are some of the scripts that I use to simplify all this even further. - -==== update script ==== -# Update a branch in my GIT tree. If the branch to be updated -# is "linus", then pull from kernel.org. Otherwise merge local -# linus branch into test|release branch - -case "$1" in -test|release) - git checkout $1 && git merge "Auto-update from upstream" $1 linus - ;; -linus) - before=$(cat .git/refs/heads/linus) - git fetch linus - after=$(cat .git/refs/heads/linus) - if [ $before != $after ] - then - git-whatchanged $after ^$before | git-shortlog - fi - ;; -*) - echo "Usage: $0 linus|test|release" 1>&2 - exit 1 - ;; -esac - -==== merge script ==== -# Merge a branch into either the test or release branch - -pname=$0 - -usage() -{ - echo "Usage: $pname branch test|release" 1>&2 - exit 1 -} - -if [ ! -f .git/refs/heads/"$1" ] -then - echo "Can't see branch <$1>" 1>&2 - usage -fi - -case "$2" in -test|release) - if [ $(git-rev-list $1 ^$2 | wc -c) -eq 0 ] - then - echo $1 already merged into $2 1>&2 - exit 1 - fi - git checkout $2 && git merge "Pull $1 into $2 branch" $2 $1 - ;; -*) - usage - ;; -esac - -==== status script ==== -# report on status of my ia64 GIT tree - -gb=$(tput setab 2) -rb=$(tput setab 1) -restore=$(tput setab 9) - -if [ `git-rev-list release ^test | wc -c` -gt 0 ] -then - echo $rb Warning: commits in release that are not in test $restore - git-whatchanged release ^test -fi - -for branch in `ls .git/refs/heads` -do - if [ $branch = linus -o $branch = test -o $branch = release ] - then - continue - fi - - echo -n $gb ======= $branch ====== $restore " " - status= - for ref in test release linus - do - if [ `git-rev-list $branch ^$ref | wc -c` -gt 0 ] - then - status=$status${ref:0:1} - fi - done - case $status in - trl) - echo $rb Need to pull into test $restore - ;; - rl) - echo "In test" - ;; - l) - echo "Waiting for linus" - ;; - "") - echo $rb All done $restore - ;; - *) - echo $rb "<$status>" $restore - ;; - esac - git-whatchanged $branch ^linus | git-shortlog -done diff --git a/user-manual.html b/user-manual.html index 5406bb892..ad1337bf6 100644 --- a/user-manual.html +++ b/user-manual.html @@ -1,81 +1,12 @@ -Git User's Manual (for version 1.5.1 or newer)

Git User's Manual (for version 1.5.1 or newer)


Table of Contents

Preface
1. Git Quick Start
Creating a new repository
Managing branches
Exploring history
Making changes
Merging
Sharing your changes
Repository maintenance
2. Repositories and Branches
How to get a git repository
How to check out a different version of a project
Understanding History: Commits
Understanding history: commits, parents, and reachability
Understanding history: History diagrams
Understanding history: What is a branch?
Manipulating branches
Examining an old version without creating a new branch
Examining branches from a remote repository
Naming branches, tags, and other references
Updating a repository with git fetch
Fetching branches from other repositories
3. Exploring git history
How to use bisect to find a regression
Naming commits
Creating tags
Browsing revisions
Generating diffs
Viewing old file versions
Examples
Check whether two branches point at the same history
Find first tagged version including a given fix
4. Developing with git
Telling git your name
Creating a new repository
How to make a commit
Creating good commit messages
How to merge
Resolving a merge
Getting conflict-resolution help during a merge
Undoing a merge
Fast-forward merges
Fixing mistakes
Fixing a mistake with a new commit
Fixing a mistake by editing history
Checking out an old version of a file
Ensuring good performance
Ensuring reliability
Checking the repository for corruption
Recovering lost changes
5. Sharing development with others
Getting updates with git pull
Submitting patches to a project
Importing patches to a project
Setting up a public repository
Exporting a git repository via http
Exporting a git repository via the git protocol
Pushing changes to a public repository
Setting up a shared repository
Allow web browsing of a repository
Examples
6. Rewriting history and maintaining patch series
Creating the perfect patch series
Keeping a patch series up to date using git-rebase
Modifying a single commit
Reordering or selecting from a patch series
Other tools
Problems with rewriting history
7. Advanced branch management
Fetching individual branches
git fetch and fast-forwards
Forcing git fetch to do non-fast-forward updates
Configuring remote branches
8. Git internals
The Object Database
Blob Object
Tree Object
Commit Object
Trust
Tag Object
The "index" aka "Current Directory Cache"
The Workflow
working directory -> index
index -> object database
object database -> index
index -> working directory
Tying it all together
Examining the data
Merging multiple trees
Merging multiple trees, continued
How git stores objects efficiently: pack files
Dangling objects
9. GIT Glossary
10. Notes and todo list for this manual

Preface

This manual is designed to be readable by someone with basic unix -command-line skills, but no previous knowledge of git.

Chapter 1 gives a brief overview of git commands, without any -explanation; you may prefer to skip to chapter 2 on a first reading.

Chapters 2 and 3 explain how to fetch and study a project using -git—the tools you'd need to build and test a particular version of a -software project, to search for regressions, and so on.

Chapter 4 explains how to do development with git, and chapter 5 how -to share that development with others.

Further chapters cover more specialized topics.

Comprehensive reference documentation is available through the man -pages. For a command such as "git clone", just use

$ man git-clone

Chapter 1. Git Quick Start

This is a quick summary of the major commands; the following chapters -will explain how these work in more detail.

Creating a new repository

From a tarball:

$ tar xzf project.tar.gz
-$ cd project
-$ git init
-Initialized empty Git repository in .git/
-$ git add .
-$ git commit

From a remote repository:

$ git clone git://example.com/pub/project.git
-$ cd project

Managing branches

$ git branch         # list all local branches in this repo
-$ git checkout test  # switch working directory to branch "test"
-$ git branch new     # create branch "new" starting at current HEAD
-$ git branch -d new  # delete branch "new"

Instead of basing new branch on current HEAD (the default), use:

$ git branch new test    # branch named "test"
-$ git branch new v2.6.15 # tag named v2.6.15
-$ git branch new HEAD^   # commit before the most recent
-$ git branch new HEAD^^  # commit before that
-$ git branch new test~10 # ten commits before tip of branch "test"

Create and switch to a new branch at the same time:

$ git checkout -b new v2.6.15

Update and examine branches from the repository you cloned from:

$ git fetch             # update
-$ git branch -r         # list
-  origin/master
-  origin/next
-  ...
-$ git checkout -b masterwork origin/master

Fetch a branch from a different repository, and give it a new -name in your repository:

$ git fetch git://example.com/project.git theirbranch:mybranch
-$ git fetch git://example.com/project.git v2.6.15:mybranch

Keep a list of repositories you work with regularly:

$ git remote add example git://example.com/project.git
-$ git remote                    # list remote repositories
-example
-origin
-$ git remote show example       # get details
-* remote example
-  URL: git://example.com/project.git
-  Tracked remote branches
-    master next ...
-$ git fetch example             # update branches from example
-$ git branch -r                 # list all remote branches

Exploring history

$ gitk                      # visualize and browse history
-$ git log                   # list all commits
-$ git log src/              # ...modifying src/
-$ git log v2.6.15..v2.6.16  # ...in v2.6.16, not in v2.6.15
-$ git log master..test      # ...in branch test, not in branch master
-$ git log test..master      # ...in branch master, but not in test
-$ git log test...master     # ...in one branch, not in both
-$ git log -S'foo()'         # ...where difference contain "foo()"
-$ git log --since="2 weeks ago"
-$ git log -p                # show patches as well
-$ git show                  # most recent commit
-$ git diff v2.6.15..v2.6.16 # diff between two tagged versions
-$ git diff v2.6.15..HEAD    # diff with current head
-$ git grep "foo()"          # search working directory for "foo()"
-$ git grep v2.6.15 "foo()"  # search old tree for "foo()"
-$ git show v2.6.15:a.txt    # look at old version of a.txt

Search for regressions:

$ git bisect start
-$ git bisect bad                # current version is bad
-$ git bisect good v2.6.13-rc2   # last known good revision
-Bisecting: 675 revisions left to test after this
-                                # test here, then:
-$ git bisect good               # if this revision is good, or
-$ git bisect bad                # if this revision is bad.
-                                # repeat until done.

Making changes

Make sure git knows who to blame:

$ cat >>~/.gitconfig <<\EOF
-[user]
-        name = Your Name Comes Here
-        email = you@yourdomain.example.com
-EOF

Select file contents to include in the next commit, then make the -commit:

$ git add a.txt    # updated file
-$ git add b.txt    # new file
-$ git rm c.txt     # old file
-$ git commit

Or, prepare and create the commit in one step:

$ git commit d.txt # use latest content only of d.txt
-$ git commit -a    # use latest content of all tracked files

Merging

$ git merge test   # merge branch "test" into the current branch
-$ git pull git://example.com/project.git master
-                   # fetch and merge in remote branch
-$ git pull . test  # equivalent to git merge test

Sharing your changes

Importing or exporting patches:

$ git format-patch origin..HEAD # format a patch for each commit
-                                # in HEAD but not in origin
-$ git am mbox # import patches from the mailbox "mbox"

Fetch a branch in a different git repository, then merge into the -current branch:

$ git pull git://example.com/project.git theirbranch

Store the fetched branch into a local branch before merging into the -current branch:

$ git pull git://example.com/project.git theirbranch:mybranch

After creating commits on a local branch, update the remote -branch with your commits:

$ git push ssh://example.com/project.git mybranch:theirbranch

When remote and local branch are both named "test":

$ git push ssh://example.com/project.git test

Shortcut version for a frequently used remote repository:

$ git remote add example ssh://example.com/project.git
-$ git push example test

Repository maintenance

Check for corruption:

$ git fsck

Recompress, remove unused cruft:

$ git gc

Chapter 2. Repositories and Branches

How to get a git repository

It will be useful to have a git repository to experiment with as you +Git User's Manual (for version 1.5.1 or newer)

Git User's Manual (for version 1.5.1 or newer)


Table of Contents

Preface
1. Repositories and Branches
How to get a git repository
How to check out a different version of a project
Understanding History: Commits
Understanding history: commits, parents, and reachability
Understanding history: History diagrams
Understanding history: What is a branch?
Manipulating branches
Examining an old version without creating a new branch
Examining branches from a remote repository
Naming branches, tags, and other references
Updating a repository with git fetch
Fetching branches from other repositories
2. Exploring git history
How to use bisect to find a regression
Naming commits
Creating tags
Browsing revisions
Generating diffs
Viewing old file versions
Examples
Counting the number of commits on a branch
Check whether two branches point at the same history
Find first tagged version including a given fix
Showing commits unique to a given branch
Creating a changelog and tarball for a software release
3. Developing with git
Telling git your name
Creating a new repository
How to make a commit
Creating good commit messages
How to merge
Resolving a merge
Getting conflict-resolution help during a merge
Undoing a merge
Fast-forward merges
Fixing mistakes
Fixing a mistake with a new commit
Fixing a mistake by editing history
Checking out an old version of a file
Ensuring good performance
Ensuring reliability
Checking the repository for corruption
Recovering lost changes
4. Sharing development with others
Getting updates with git pull
Submitting patches to a project
Importing patches to a project
Public git repositories
Setting up a public repository
Exporting a git repository via the git protocol
Exporting a git repository via http
Pushing changes to a public repository
Setting up a shared repository
Allowing web browsing of a repository
Examples
Maintaining topic branches for a Linux subsystem maintainer
5. Rewriting history and maintaining patch series
Creating the perfect patch series
Keeping a patch series up to date using git-rebase
Modifying a single commit
Reordering or selecting from a patch series
Other tools
Problems with rewriting history
6. Advanced branch management
Fetching individual branches
git fetch and fast-forwards
Forcing git fetch to do non-fast-forward updates
Configuring remote branches
7. Git internals
The Object Database
Blob Object
Tree Object
Commit Object
Trust
Tag Object
The "index" aka "Current Directory Cache"
The Workflow
working directory -> index
index -> object database
object database -> index
index -> working directory
Tying it all together
Examining the data
Merging multiple trees
Merging multiple trees, continued
How git stores objects efficiently: pack files
Dangling objects
A birds-eye view of Git's source code
8. GIT Glossary
A. Git Quick Start
Creating a new repository
Managing branches
Exploring history
Making changes
Merging
Sharing your changes
Repository maintenance
B. Notes and todo list for this manual

Preface

Git is a fast distributed revision control system.

This manual is designed to be readable by someone with basic unix +command-line skills, but no previous knowledge of git.

Chapter 1, Repositories and Branches and Chapter 2, Exploring git history explain how +to fetch and study a project using git—read these chapters to learn how +to build and test a particular version of a software project, search for +regressions, and so on.

People needing to do actual development will also want to read +Chapter 3, Developing with git and Chapter 4, Sharing development with others.

Further chapters cover more specialized topics.

Comprehensive reference documentation is available through the man +pages. For a command such as "git clone", just use

$ man git-clone

See also Appendix A, Git Quick Start for a brief overview of git commands, +without any explanation.

Also, see Appendix B, Notes and todo list for this manual for ways that you can help make this manual more +complete.

Chapter 1. Repositories and Branches

How to get a git repository

It will be useful to have a git repository to experiment with as you read this manual.

The best way to get one is by using the git-clone(1) command to download a copy of an existing repository for a project that you are interested in. If you don't already have a project in mind, here @@ -157,7 +88,7 @@ example in email), then you are guaranteed that name will refer to the same commit in their repository that it does in yours (assuming their repository has that commit at all). Since the object name is computed as a hash over the contents of the commit, you are guaranteed that the commit can never change -without its name also changing.

In fact, in Chapter 8, Git internals we shall see that everything stored in git +without its name also changing.

In fact, in Chapter 7, Git internals we shall see that everything stored in git history, including file data and directory contents, is stored in an object with a name that is a hash of its contents.

Understanding history: commits, parents, and reachability

Every commit (except the very first commit in a project) also has a parent commit which shows what happened before this commit. @@ -293,7 +224,7 @@ a new stanza:

$ ...

This is what causes git to track the remote's branches; you may modify or delete these configuration options by editing .git/config with a text editor. (See the "CONFIGURATION FILE" section of -git-config(1) for details.)

Chapter 2. Exploring git history

Git is best thought of as a tool for storing the history of a collection of files. It does this by storing compressed snapshots of the contents of a file heirarchy, together with "commits" which show the relationships between these snapshots.

Git provides extremely flexible and fast tools for exploring the @@ -384,7 +315,10 @@ will not be the same as the diff produced by the git-diff example.

$ git show v2.5:fs/locks.c

Before the colon may be anything that names a commit, and after it -may be any path to a file tracked by git.

Examples

Check whether two branches point at the same history

Suppose you want to check whether two branches point at the same point +may be any path to a file tracked by git.

Examples

Counting the number of commits on a branch

Suppose you want to know how many commits you've made on "mybranch" +since it diverged from "origin":

$ git log --pretty=oneline origin..mybranch | wc -l

Alternatively, you may often see this sort of thing done with the +lower-level command git-rev-list(1), which just lists the SHA1's +of all the given commits:

$ git rev-list origin..mybranch | wc -l

Check whether two branches point at the same history

Suppose you want to check whether two branches point at the same point in history.

$ git diff origin..master

will tell you whether the contents of the project are the same at the two branches; in theory, however, it's possible that the same project contents could have been arrived at by two different historical @@ -420,7 +354,38 @@ available
   ! [v1.5.0-rc2] GIT v1.5.0-rc2
...

then search for a line that looks like

+ ++ [e05db0fd] Fix warnings in sha1_file.c - use C99 printf format if
available

Which shows that e05db0fd is reachable from itself, from v1.5.0-rc1, and -from v1.5.0-rc2, but not from v1.5.0-rc0.

Chapter 4. Developing with git

Telling git your name

Before creating any commits, you should introduce yourself to git. The +from v1.5.0-rc2, but not from v1.5.0-rc0.

Showing commits unique to a given branch

Suppose you would like to see all the commits reachable from the branch +head named "master" but not from any other head in your repository.

We can list all the heads in this repository with +git-show-ref(1):

$ git show-ref --heads
+bf62196b5e363d73353a9dcf094c59595f3153b7 refs/heads/core-tutorial
+db768d5504c1bb46f63ee9d6e1772bd047e05bf9 refs/heads/maint
+a07157ac624b2524a059a3414e99f6f44bebc1e7 refs/heads/master
+24dbc180ea14dc1aebe09f14c8ecf32010690627 refs/heads/tutorial-2
+1e87486ae06626c2f31eaa63d26fc0fd646c8af2 refs/heads/tutorial-fixes

We can get just the branch-head names, and remove "master", with +the help of the standard utilities cut and grep:

$ git show-ref --heads | cut -d' ' -f2 | grep -v '^refs/heads/master'
+refs/heads/core-tutorial
+refs/heads/maint
+refs/heads/tutorial-2
+refs/heads/tutorial-fixes

And then we can ask to see all the commits reachable from master +but not from these other heads:

$ gitk master --not $( git show-ref --heads | cut -d' ' -f2 |
+                                grep -v '^refs/heads/master' )

Obviously, endless variations are possible; for example, to see all +commits reachable from some head but not from any tag in the repository:

$ gitk ($ git show-ref --heads ) --not  $( git show-ref --tags )

(See git-rev-parse(1) for explanations of commit-selecting +syntax such as —not.)

Creating a changelog and tarball for a software release

The git-archive(1) command can create a tar or zip archive from +any version of a project; for example:

$ git archive --format=tar --prefix=project/ HEAD | gzip >latest.tar.gz

will use HEAD to produce a tar archive in which each filename is +preceded by "prefix/".

If you're releasing a new version of a software project, you may want +to simultaneously make a changelog to include in the release +announcement.

Linus Torvalds, for example, makes new kernel releases by tagging them, +then running:

$ release-script 2.6.12 2.6.13-rc6 2.6.13-rc7

where release-script is a shell script that looks like:

#!/bin/sh
+stable="$1"
+last="$2"
+new="$3"
+echo "# git tag v$new"
+echo "git archive --prefix=linux-$new/ v$new | gzip -9 > ../linux-$new.tar.gz"
+echo "git diff v$stable v$new | gzip -9 > ../patch-$new.gz"
+echo "git log --no-merges v$new ^v$last > ../ChangeLog-$new"
+echo "git shortlog --no-merges v$new ^v$last > ../ShortLog"
+echo "git diff --stat --summary -M v$last v$new > ../diffstat-$new"

and then he just cut-and-pastes the output commands after verifying that +they look OK.

Chapter 3. Developing with git

Telling git your name

Before creating any commits, you should introduce yourself to git. The easiest way to do so is to make sure the following lines appear in a file named .gitconfig in your home directory:

[user]
        name = Your Name Comes Here
@@ -581,7 +546,7 @@ changes, giving you a chance to edit the old commit message first.

Again, been merged into another branch; use git-revert(1) instead in that case.

It is also possible to edit commits further back in the history, but this is an advanced topic to be left for -another chapter.

Checking out an old version of a file

In the process of undoing a previous bad change, you may find it +another chapter.

Checking out an old version of a file

In the process of undoing a previous bad change, you may find it useful to check out an older version of a particular file using git-checkout(1). We've used git checkout before to switch branches, but it has quite different behavior if it is given a path @@ -644,7 +609,7 @@ you get exactly the history reachable from that commit that is lost. "tip of the line" as being dangling, but there might be a whole deep and complex commit history that was dropped.)

If you decide you want the history back, you can always create a new reference pointing to it, for example, a new branch:

$ git branch recovered-branch 7281251ddd

Other types of dangling objects (blobs and trees) are also possible, and -dangling objects can arise in other situations.

Chapter 4. Sharing development with others

Getting updates with git pull

After you clone a repository and make a few changes of your own, you may wish to check the original repository for updates and merge them into your own work.

We have already seen how to keep remote tracking branches up to date with git-fetch(1), and how to merge two branches. So you can merge in changes from the @@ -681,18 +646,17 @@ leave your tree and index untouched, you may omit that option.)

Once the i resolution, instead of creating a new commit, just run

$ git am --resolved

and git will create the commit for you and continue applying the remaining patches from the mailbox.

The final result will be a series of commits, one for each patch in the original mailbox, with authorship and commit log message each -taken from the message containing each patch.

Setting up a public repository

Another way to submit changes to a project is to simply tell the -maintainer of that project to pull from your repository, exactly as -you did in the section "Getting updates with git pull".

If you and maintainer both have accounts on the same machine, then -then you can just pull changes from each other's repositories -directly; note that all of the commands (git-clone(1), -git-fetch[1], git-pull[1], etc.) that accept a URL as an argument -will also accept a local directory name; so, for example, you can -use

$ git clone /path/to/repository
-$ git pull /path/to/other/repository

If this sort of setup is inconvenient or impossible, another (more -common) option is to set up a public repository on a public server. -This also allows you to cleanly separate private work in progress -from publicly visible work.

You will continue to do your day-to-day work in your personal +taken from the message containing each patch.

Public git repositories

Another way to submit changes to a project is to tell the maintainer of +that project to pull the changes from your repository using git-pull[1]. +In the section "Getting updates with git pull" we described this as a way to get updates from the "main" +repository, but it works just as well in the other direction.

If you and the maintainer both have accounts on the same machine, then +you can just pull changes from each other's repositories directly; +commands that accepts repository URLs as arguments will also accept a +local directory name:

$ git clone /path/to/repository
+$ git pull /path/to/other/repository

However, the more common way to do this is to maintain a separate public +repository (usually on a different host) for others to pull changes +from. This is usually more convenient, and allows you to cleanly +separate private work in progress from publicly visible work.

You will continue to do your day-to-day work in your personal repository, but periodically "push" changes from your personal repository into your public repository, allowing other developers to pull from that repository. So the flow of changes, in a situation @@ -705,15 +669,23 @@ your personal repo ------------------> your public repo | | | | | they push V -their public repo <------------------- their repo

Now, assume your personal repository is in the directory ~/proj. We -first create a new clone of the repository:

$ git clone --bare ~/proj proj.git

The resulting directory proj.git contains a "bare" git repository—it is -just the contents of the ".git" directory, without a checked-out copy of -a working directory.

Next, copy proj.git to the server where you plan to host the +their public repo <------------------- their repo

Setting up a public repository

Assume your personal repository is in the directory ~/proj. We +first create a new clone of the repository and tell git-daemon that it +is meant to be public:

$ git clone --bare ~/proj proj.git
+$ touch proj.git/git-daemon-export-ok

The resulting directory proj.git contains a "bare" git repository—it is +just the contents of the ".git" directory, without any files checked out +around it.

Next, copy proj.git to the server where you plan to host the public repository. You can use scp, rsync, or whatever is most -convenient.

If somebody else maintains the public server, they may already have -set up a git service for you, and you may skip to the section -"Pushing changes to a public repository", below.

Otherwise, the following sections explain how to export your newly -created public repository:

Exporting a git repository via http

The git protocol gives better performance and reliability, but on a +convenient.

Exporting a git repository via the git protocol

This is the preferred method.

If someone else administers the server, they should tell you what +directory to put the repository in, and what git:// url it will appear +at. You can then skip to the section +"Pushing changes to a public repository", below.

Otherwise, all you need to do is start git-daemon(1); it will +listen on port 9418. By default, it will allow access to any directory +that looks like a git directory and contains the magic file +git-daemon-export-ok. Passing some directory paths as git-daemon +arguments will further restrict the exports to those paths.

You can also run git-daemon as an inetd service; see the +git-daemon(1) man page for details. (See especially the +examples section.)

Exporting a git repository via http

The git protocol gives better performance and reliability, but on a host with a web server set up, http exports may be simpler to set up.

All you need to do is place the newly created bare git repository in a directory that is exported by the web server, and make some adjustments to give web clients some extra information they need:

$ mv proj.git /home/you/public_html/proj.git
@@ -725,8 +697,7 @@ $ clone or pull from that url, for example with a commandline like:

$ git clone http://yourserver.com/~you/proj.git

(See also setup-git-server-over-http for a slightly more sophisticated setup using WebDAV which also -allows pushing over http.)

Exporting a git repository via the git protocol

This is the preferred method.

For now, we refer you to the git-daemon(1) man page for -instructions. (See especially the examples section.)

Pushing changes to a public repository

Note that the two techniques outline above (exporting via +allows pushing over http.)

Pushing changes to a public repository

Note that the two techniques outlined above (exporting via http or git) allow other maintainers to fetch your latest changes, but they do not allow write access, which you will need to update the public repository with the @@ -742,13 +713,168 @@ save typing; so, for example, after

$         url = ssh://yourserver.com/~you/proj.git
EOF

you should be able to perform the above push with just

$ git push public-repo master

See the explanations of the remote.<name>.url, branch.<name>.remote, and remote.<name>.push options in git-config(1) for -details.

Setting up a shared repository

Another way to collaborate is by using a model similar to that +details.

Setting up a shared repository

Another way to collaborate is by using a model similar to that commonly used in CVS, where several developers with special rights all push to and pull from a single shared repository. See git for CVS users for instructions on how to -set this up.

Allow web browsing of a repository

The gitweb cgi script provides users an easy way to browse your +set this up.

Allowing web browsing of a repository

The gitweb cgi script provides users an easy way to browse your project's files and history without having to install git; see the file -gitweb/INSTALL in the git source tree for instructions on setting it up.

Examples

TODO: topic branches, typical roles as in everyday.txt, ?

Chapter 6. Rewriting history and maintaining patch series

Normally commits are only added to a project, never taken away or +gitweb/INSTALL in the git source tree for instructions on setting it up.

Examples

Maintaining topic branches for a Linux subsystem maintainer

This describes how Tony Luck uses git in his role as maintainer of the +IA64 architecture for the Linux kernel.

He uses two public branches:

  • +A "test" tree into which patches are initially placed so that they + can get some exposure when integrated with other ongoing development. + This tree is available to Andrew for pulling into -mm whenever he + wants. +
  • +A "release" tree into which tested patches are moved for final sanity + checking, and as a vehicle to send them upstream to Linus (by sending + him a "please pull" request.) +

He also uses a set of temporary branches ("topic branches"), each +containing a logical grouping of patches.

To set this up, first create your work tree by cloning Linus's public +tree:

$ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6.git work
+$ cd work

Linus's tree will be stored in the remote branch named origin/master, +and can be updated using git-fetch(1); you can track other +public trees using git-remote(1) to set up a "remote" and +git-fetch[1] to keep them up-to-date; see Chapter 1, Repositories and Branches.

Now create the branches in which you are going to work; these start out +at the current tip of origin/master branch, and should be set up (using +the —track option to git-branch(1)) to merge changes in from +Linus by default.

$ git branch --track test origin/master
+$ git branch --track release origin/master

These can be easily kept up to date using git-pull(1)

$ git checkout test && git pull
+$ git checkout release && git pull

Important note! If you have any local changes in these branches, then +this merge will create a commit object in the history (with no local +changes git will simply do a "Fast forward" merge). Many people dislike +the "noise" that this creates in the Linux history, so you should avoid +doing this capriciously in the "release" branch, as these noisy commits +will become part of the permanent history when you ask Linus to pull +from the release branch.

A few configuration variables (see git-config(1)) can +make it easy to push both branches to your public tree. (See +the section called “Setting up a public repository”.)

$ cat >> .git/config <<EOF
+[remote "mytree"]
+        url =  master.kernel.org:/pub/scm/linux/kernel/git/aegl/linux-2.6.git
+        push = release
+        push = test
+EOF

Then you can push both the test and release trees using +git-push(1):

$ git push mytree

or push just one of the test and release branches using:

$ git push mytree test

or

$ git push mytree release

Now to apply some patches from the community. Think of a short +snappy name for a branch to hold this patch (or related group of +patches), and create a new branch from the current tip of Linus's +branch:

$ git checkout -b speed-up-spinlocks origin

Now you apply the patch(es), run some tests, and commit the change(s). If +the patch is a multi-part series, then you should apply each as a separate +commit to this branch.

$ ... patch ... test  ... commit [ ... patch ... test ... commit ]*

When you are happy with the state of this change, you can pull it into the +"test" branch in preparation to make it public:

$ git checkout test && git pull . speed-up-spinlocks

It is unlikely that you would have any conflicts here … but you might if you +spent a while on this step and had also pulled new versions from upstream.

Some time later when enough time has passed and testing done, you can pull the +same branch into the "release" tree ready to go upstream. This is where you +see the value of keeping each patch (or patch series) in its own branch. It +means that the patches can be moved into the "release" tree in any order.

$ git checkout release && git pull . speed-up-spinlocks

After a while, you will have a number of branches, and despite the +well chosen names you picked for each of them, you may forget what +they are for, or what status they are in. To get a reminder of what +changes are in a specific branch, use:

$ git log linux..branchname | git-shortlog

To see whether it has already been merged into the test or release branches +use:

$ git log test..branchname

or

$ git log release..branchname

(If this branch has not yet been merged you will see some log entries. +If it has been merged, then there will be no output.)

Once a patch completes the great cycle (moving from test to release, +then pulled by Linus, and finally coming back into your local +"origin/master" branch) the branch for this change is no longer needed. +You detect this when the output from:

$ git log origin..branchname

is empty. At this point the branch can be deleted:

$ git branch -d branchname

Some changes are so trivial that it is not necessary to create a separate +branch and then merge into each of the test and release branches. For +these changes, just apply directly to the "release" branch, and then +merge that into the "test" branch.

To create diffstat and shortlog summaries of changes to include in a "please +pull" request to Linus you can use:

$ git diff --stat origin..release

and

$ git log -p origin..release | git shortlog

Here are some of the scripts that simplify all this even further.

==== update script ====
+# Update a branch in my GIT tree.  If the branch to be updated
+# is origin, then pull from kernel.org.  Otherwise merge
+# origin/master branch into test|release branch
+
+case "$1" in
+test|release)
+        git checkout $1 && git pull . origin
+        ;;
+origin)
+        before=$(cat .git/refs/remotes/origin/master)
+        git fetch origin
+        after=$(cat .git/refs/remotes/origin/master)
+        if [ $before != $after ]
+        then
+                git log $before..$after | git shortlog
+        fi
+        ;;
+*)
+        echo "Usage: $0 origin|test|release" 1>&2
+        exit 1
+        ;;
+esac

==== merge script ====
+# Merge a branch into either the test or release branch
+
+pname=$0
+
+usage()
+{
+        echo "Usage: $pname branch test|release" 1>&2
+        exit 1
+}
+
+if [ ! -f .git/refs/heads/"$1" ]
+then
+        echo "Can't see branch <$1>" 1>&2
+        usage
+fi
+
+case "$2" in
+test|release)
+        if [ $(git log $2..$1 | wc -c) -eq 0 ]
+        then
+                echo $1 already merged into $2 1>&2
+                exit 1
+        fi
+        git checkout $2 && git pull . $1
+        ;;
+*)
+        usage
+        ;;
+esac

==== status script ====
+# report on status of my ia64 GIT tree
+
+gb=$(tput setab 2)
+rb=$(tput setab 1)
+restore=$(tput setab 9)
+
+if [ `git rev-list test..release | wc -c` -gt 0 ]
+then
+        echo $rb Warning: commits in release that are not in test $restore
+        git log test..release
+fi
+
+for branch in `ls .git/refs/heads`
+do
+        if [ $branch = test -o $branch = release ]
+        then
+                continue
+        fi
+
+        echo -n $gb ======= $branch ====== $restore " "
+        status=
+        for ref in test release origin/master
+        do
+                if [ `git rev-list $ref..$branch | wc -c` -gt 0 ]
+                then
+                        status=$status${ref:0:1}
+                fi
+        done
+        case $status in
+        trl)
+                echo $rb Need to pull into test $restore
+                ;;
+        rl)
+                echo "In test"
+                ;;
+        l)
+                echo "Waiting for linus"
+                ;;
+        "")
+                echo $rb All done $restore
+                ;;
+        *)
+                echo $rb "<$status>" $restore
+                ;;
+        esac
+        git log origin/master..$branch | git shortlog
+done

Chapter 5. Rewriting history and maintaining patch series

Normally commits are only added to a project, never taken away or replaced. Git is designed with this assumption, and violating it will cause git's merge machinery (for example) to do the wrong thing.

However, there is a situation in which it can be useful to violate this assumption.

Creating the perfect patch series

Suppose you are a contributor to a large project, and you want to add a @@ -842,7 +968,7 @@ new. The results are likely to be unexpected.

You may still choose to pub and it may be useful for others to be able to fetch those branches in order to examine or test them, but they should not attempt to pull such branches into their own work.

For true distributed development that supports proper merging, -published branches should never be rewritten.

Chapter 7. Advanced branch management

Fetching individual branches

Instead of using git-remote(1), you can also choose just +published branches should never be rewritten.

Chapter 6. Advanced branch management

Fetching individual branches

Instead of using git-remote(1), you can also choose just to update one branch at a time, and to store it locally under an arbitrary name:

$ git fetch origin todo:my-todo-work

The first argument, "origin", just tells git to fetch from the repository you originally cloned from. The second argument tells git @@ -888,7 +1014,7 @@ $ throwing away commits on mybranch.

Also note that all of the above configuration can be performed by directly editing the file .git/config instead of using git-config(1).

See git-config(1) for more details on the configuration -options mentioned above.

Chapter 7. Git internals

Git depends on two fundamental abstractions: the "object database", and the "current directory cache" aka "index".

The Object Database

The object database is literally just a content-addressable collection of objects. All objects are named by their content, which is approximated by the SHA1 hash of the object itself. Objects may refer @@ -1222,7 +1348,7 @@ objects will work exactly as they did before.

The

Dangling objects

The git-fsck(1) command will sometimes complain about dangling objects. They are not a problem.

The most common cause of dangling objects is that you've rebased a branch, or you have pulled from somebody else who rebased a branch—see -Chapter 6, Rewriting history and maintaining patch series. In that case, the old head of the original +Chapter 5, Rewriting history and maintaining patch series. In that case, the old head of the original branch still exists, as does everything it pointed to. The branch pointer itself just doesn't, since you replaced it with another one.

There are also other situations that cause dangling objects. For example, a "dangling blob" may arise because you did a "git add" of a @@ -1260,16 +1386,113 @@ on what it found, git-fsck itself is never "dangerous" to run. Running it while somebody is actually changing the repository can cause confusing and scary messages, but it won't actually do anything bad. In contrast, running "git prune" while somebody is actively changing the -repository is a BAD idea).

Chapter 9. GIT Glossary

+repository is a BAD idea).

A birds-eye view of Git's source code

It is not always easy for new developers to find their way through Git's +source code. This section gives you a little guidance to show where to +start.

A good place to start is with the contents of the initial commit, with:

$ git checkout e83c5163

The initial revision lays the foundation for almost everything git has +today, but is small enough to read in one sitting.

Note that terminology has changed since that revision. For example, the +README in that revision uses the word "changeset" to describe what we +now call a commit.

Also, we do not call it "cache" any more, but "index", however, the +file is still called cache.h. Remark: Not much reason to change it now, +especially since there is no good single name for it anyway, because it is +basically _the_ header file which is included by _all_ of Git's C sources.

If you grasp the ideas in that initial commit, you should check out a +more recent version and skim cache.h, object.h and commit.h.

In the early days, Git (in the tradition of UNIX) was a bunch of programs +which were extremely simple, and which you used in scripts, piping the +output of one into another. This turned out to be good for initial +development, since it was easier to test new things. However, recently +many of these parts have become builtins, and some of the core has been +"libified", i.e. put into libgit.a for performance, portability reasons, +and to avoid code duplication.

By now, you know what the index is (and find the corresponding data +structures in cache.h), and that there are just a couple of object types +(blobs, trees, commits and tags) which inherit their common structure from +struct object, which is their first member (and thus, you can cast e.g. +(struct object *)commit to achieve the _same_ as &commit->object, i.e. +get at the object name and flags).

Now is a good point to take a break to let this information sink in.

Next step: get familiar with the object naming. Read the section called “Naming commits”. +There are quite a few ways to name an object (and not only revisions!). +All of these are handled in sha1_name.c. Just have a quick look at +the function get_sha1(). A lot of the special handling is done by +functions like get_sha1_basic() or the likes.

This is just to get you into the groove for the most libified part of Git: +the revision walker.

Basically, the initial version of git log was a shell script:

$ git-rev-list --pretty $(git-rev-parse --default HEAD "$@") | \
+        LESS=-S ${PAGER:-less}

What does this mean?

git-rev-list is the original version of the revision walker, which +_always_ printed a list of revisions to stdout. It is still functional, +and needs to, since most new Git programs start out as scripts using +git-rev-list.

git-rev-parse is not as important any more; it was only used to filter out +options that were relevant for the different plumbing commands that were +called by the script.

Most of what git-rev-list did is contained in revision.c and +revision.h. It wraps the options in a struct named rev_info, which +controls how and what revisions are walked, and more.

The original job of git-rev-parse is now taken by the function +setup_revisions(), which parses the revisions and the common command line +options for the revision walker. This information is stored in the struct +rev_info for later consumption. You can do your own command line option +parsing after calling setup_revisions(). After that, you have to call +prepare_revision_walk() for initialization, and then you can get the +commits one by one with the function get_revision().

If you are interested in more details of the revision walking process, +just have a look at the first implementation of cmd_log(); call +git-show v1.3.0155^24 and scroll down to that function (note that you +no longer need to call setup_pager() directly).

Nowadays, git log is a builtin, which means that it is _contained_ in the +command git. The source side of a builtin is

  • +a function called cmd_<bla>, typically defined in builtin-<bla>.c, + and declared in builtin.h, +
  • +an entry in the commands[] array in git.c, and +
  • +an entry in BUILTIN_OBJECTS in the Makefile. +

Sometimes, more than one builtin is contained in one source file. For +example, cmd_whatchanged() and cmd_log() both reside in builtin-log.c, +since they share quite a bit of code. In that case, the commands which are +_not_ named like the .c file in which they live have to be listed in +BUILT_INS in the Makefile.

git log looks more complicated in C than it does in the original script, +but that allows for a much greater flexibility and performance.

Here again it is a good point to take a pause.

Lesson three is: study the code. Really, it is the best way to learn about +the organization of Git (after you know the basic concepts).

So, think about something which you are interested in, say, "how can I +access a blob just knowing the object name of it?". The first step is to +find a Git command with which you can do it. In this example, it is either +git show or git cat-file.

For the sake of clarity, let's stay with git cat-file, because it

  • +is plumbing, and +
  • +was around even in the initial commit (it literally went only through + some 20 revisions as cat-file.c, was renamed to builtin-cat-file.c + when made a builtin, and then saw less than 10 versions). +

So, look into builtin-cat-file.c, search for cmd_cat_file() and look what +it does.

        git_config(git_default_config);
+        if (argc != 3)
+                usage("git-cat-file [-t|-s|-e|-p|<type>] <sha1>");
+        if (get_sha1(argv[2], sha1))
+                die("Not a valid object name %s", argv[2]);

Let's skip over the obvious details; the only really interesting part +here is the call to get_sha1(). It tries to interpret argv[2] as an +object name, and if it refers to an object which is present in the current +repository, it writes the resulting SHA-1 into the variable sha1.

Two things are interesting here:

  • +get_sha1() returns 0 on _success_. This might surprise some new + Git hackers, but there is a long tradition in UNIX to return different + negative numbers in case of different errors — and 0 on success. +
  • +the variable sha1 in the function signature of get_sha1() is unsigned + char *, but is actually expected to be a pointer to unsigned + char[20]. This variable will contain the 160-bit SHA-1 of the given + commit. Note that whenever a SHA-1 is passed as unsigned char *, it + is the binary representation, as opposed to the ASCII representation in + hex characters, which is passed as char *. +

You will see both of these things throughout the code.

Now, for the meat:

        case 0:
+                buf = read_object_with_reference(sha1, argv[1], &size, NULL);

This is how you read a blob (actually, not only a blob, but any type of +object). To know how the function read_object_with_reference() actually +works, find the source code for it (something like git grep +read_object_with | grep ":[a-z]" in the git repository), and read +the source.

To find out how the result can be used, just read on in cmd_cat_file():

        write_or_die(1, buf, size);

Sometimes, you do not know where to look for a feature. In many such cases, +it helps to search through the output of git log, and then git show the +corresponding commit.

Example: If you know that there was some test case for git bundle, but +do not remember where it was (yes, you _could_ git grep bundle t/, but that +does not illustrate the point!):

$ git log --no-merges t/

In the pager (less), just search for "bundle", go a few lines back, +and see that it is in commit 18449ab0… Now just copy this object name, +and paste it into the command line

$ git show 18449ab0

Voila.

Another example: Find out what to do in order to make some script a +builtin:

$ git log --no-merges --diff-filter=A builtin-*.c

You see, Git is actually the best tool to find out about the source of Git +itself!

Chapter 8. GIT Glossary

alternate object database
- Via the alternates mechanism, a repository can - inherit part of its object database from another - object database, which is called "alternate". + Via the alternates mechanism, a repository + can inherit part of its object database + from another object database, which is called "alternate".
bare repository
- A bare repository is normally an appropriately + A bare repository is normally an appropriately named directory with a .git suffix that does not have a locally checked-out copy of any of the files under revision control. That is, all of the git @@ -1285,10 +1508,15 @@ repository is a BAD idea).

branch
- A non-cyclical graph of revisions, i.e. the complete history of a - particular revision, which is called the - branch head. The heads - are stored in $GIT_DIR/refs/heads/. + A "branch" is an active line of development. The most recent + commit on a branch is referred to as the tip of + that branch. The tip of the branch is referenced by a branch + head, which moves forward as additional development + is done on the branch. A single git + repository can track an arbitrary number of + branches, but your working tree is + associated with just one of them (the "current" or "checked out" + branch), and HEAD points to that branch.
cache
@@ -1319,21 +1547,26 @@ repository is a BAD idea).

commit and to record it based on the tip - of the current branch as a new commit. + of the current branch as a new commit.
clean
- A working tree is clean, if it + A working tree is clean, if it corresponds to the revision referenced by the current head. Also see "dirty".
commit -
- As a verb: The action of storing the current state of the - index in the object database. The - result is a revision. As a noun: Short hand for - commit object. -
+

+ As a noun: A single point in the + git history; the entire history of a project is represented as a + set of interrelated commits. The word "commit" is often + used by git in the same places other revision control systems + use the words "revision" or "version". Also used as a short + hand for commit object. +

As a verb: The action of storing a new snapshot of the project's +state in the git history, by creating a new commit representing the current +state of the index and advancing HEAD +to point at the new commit.

commit object
An object which contains the information about a @@ -1351,7 +1584,7 @@ repository is a BAD idea).

Directed acyclic graph. The commit objects form a directed acyclic graph, because they have parents (directed), and the - graph of commit objects is acyclic (there is no + graph of commit objects is acyclic (there is no chain which begins and ends with the same object).
@@ -1359,9 +1592,16 @@ repository is a BAD idea).

An unreachable object which is not reachable even from other unreachable objects; a - dangling object has no references to it from any + dangling object has no references to it from any reference or object in the repository.
+detached HEAD +
+ Normally the HEAD stores the name of a + branch. However, git also allows you to check + out an arbitrary commit that isn't necessarily the tip of any + particular branch. In this case HEAD is said to be "detached". +
dircache
You are waaaaay behind. @@ -1372,7 +1612,7 @@ repository is a BAD idea).

dirty
- A working tree is said to be dirty if + A working tree is said to be "dirty" if it contains modifications which have not been committed to the current branch.
@@ -1389,17 +1629,17 @@ repository is a BAD idea).

branch's changes that happen to be a descendant of what you have. In such these cases, you do not make a new merge commit but instead just update to his - revision. This will happen frequently on a + revision. This will happen frequently on a tracking branch of a remote repository.
fetch
Fetching a branch means to get the - branch's head ref from a remote - repository, to find out which objects are missing - from the local object database, and to get them, - too. + branch's head ref from a remote + repository, to find out which objects are + missing from the local object database, + and to get them, too. See also git-fetch(1).
file system
@@ -1416,7 +1656,7 @@ repository is a BAD idea).

commit has - is different from what was recorded when the commit was + is different from what was recorded when the commit was created. Configured via the .git/info/grafts file.
hash @@ -1425,14 +1665,22 @@ repository is a BAD idea).

head
- The top of a branch. It contains a ref to the - corresponding commit object. + A named reference to the commit at the tip of a + branch. Heads are stored in + $GIT_DIR/refs/heads/, except when using packed refs. (See + git-pack-refs(1).) +
+HEAD +
+ The current branch. In more detail: Your working tree is normally derived from the state of the tree + referred to by HEAD. HEAD is a reference to one of the + heads in your repository, except when using a + detached HEAD, in which case it may + reference an arbitrary commit.
head ref
- A ref pointing to a head. Often, this is - abbreviated to "head". Head refs are stored in - $GIT_DIR/refs/heads/. + A synonym for head.
hook
@@ -1440,14 +1688,14 @@ repository is a BAD idea).

hook scripts are found in the + operation is done. The hook scripts are found in the $GIT_DIR/hooks/ directory, and are enabled by simply making them executable.
index
A collection of files with stat information, whose contents are stored - as objects. The index is a stored version of your working + as objects. The index is a stored version of your working tree. Truth be told, it can also contain a second, and even a third version of a working tree, which are used when merging. @@ -1455,31 +1703,43 @@ repository is a BAD idea).

index entry
The information regarding a particular file, stored in the - index. An index entry can be unmerged, - if a merge was started, but not yet finished (i.e. if the - index contains multiple versions of that file). + index. An index entry can be unmerged, if a + merge was started, but not yet finished (i.e. if + the index contains multiple versions of that file).
master
- The default development branch. Whenever you create a git - repository, a branch named - "master" is created, and becomes the active - branch. In most cases, this contains the local - development, though that is purely conventional and not required. + The default development branch. Whenever you + create a git repository, a branch named + "master" is created, and becomes the active branch. In most + cases, this contains the local development, though that is + purely by convention and is not required.
merge -
- To merge branches means to try to accumulate the changes - since a common ancestor and apply them to the first - branch. An automatic merge uses heuristics - to accomplish that. Evidently, an automatic merge can - fail. -
+

+ As a verb: To bring the contents of another + branch (possibly from an external + repository) into the current branch. In the + case where the merged-in branch is from a different repository, + this is done by first fetching the remote branch + and then merging the result into the current branch. This + combination of fetch and merge operations is called a + pull. Merging is performed by an automatic process + that identifies changes made since the branches diverged, and + then applies all those changes together. In cases where changes + conflict, manual intervention may be required to complete the + merge. +

As a noun: unless it is a fast forward, a +successful merge results in the creation of a new commit +representing the result of the merge, and having as +parents the tips of the merged branches. +This commit is referred to as a "merge commit", or sometimes just a +"merge".

object
The unit of storage in git. It is uniquely identified by the SHA1 of its contents. Consequently, an - object can not be changed. + object can not be changed.
object database
@@ -1494,9 +1754,9 @@ repository is a BAD idea).

object name
The unique identifier of an object. The hash - of the object's contents using the Secure Hash Algorithm + of the object's contents using the Secure Hash Algorithm 1 and usually represented by the 40 character hexadecimal encoding of - the hash of the object (possibly followed by + the hash of the object (possibly followed by a white space).
object type @@ -1514,7 +1774,7 @@ repository is a BAD idea).

The default upstream repository. Most projects have at least one upstream project which they track. By default - origin is used for that purpose. New upstream updates + origin is used for that purpose. New upstream updates will be fetched into remote tracking branches named origin/name-of-upstream-branch, which you can see using "git branch -r". @@ -1528,7 +1788,7 @@ repository is a BAD idea).

The list of identifiers, and other information, of the objects in a pack, to assist in efficiently accessing the contents of a - pack. + pack.
parent
@@ -1552,32 +1812,32 @@ repository is a BAD idea).

Cute name for programs and program suites depending on core git, presenting a high level access to - core git. Porcelains expose more of a SCM + core git. Porcelains expose more of a SCM interface than the plumbing.
pull
Pulling a branch means to fetch it and - merge it. + merge it. See also git-pull(1).
push
- Pushing a branch means to get the branch's + Pushing a branch means to get the branch's head ref from a remote repository, - find out if it is an ancestor to the branch's local - head ref is a direct, and in that case, putting all + find out if it is an ancestor to the branch's local + head ref is a direct, and in that case, putting all objects, which are reachable from the local - head ref, and which are missing from the remote - repository, into the remote + head ref, and which are missing from the remote + repository, into the remote object database, and updating the remote - head ref. If the remote head is not an - ancestor to the local head, the push fails. + head ref. If the remote head is not an + ancestor to the local head, the push fails.
reachable
All of the ancestors of a given commit are said to be - reachable from that commit. More - generally, one object is reachable from + "reachable" from that commit. More + generally, one object is reachable from another if we can reach the one from the other by a chain that follows tags to whatever they tag, commits to their parents or trees, and @@ -1599,24 +1859,24 @@ repository is a BAD idea).

refspec
A refspec is used by fetch and - push to describe the mapping between remote ref - and local ref. They are combined with a colon in the format - <src>:<dst>, preceded by an optional plus sign, +. For example: git - fetch $URL refs/heads/master:refs/heads/origin means - "grab the master branch head - from the $URL and store it as my origin - branch head". And git push - $URL refs/heads/master:refs/heads/to-upstream means - "publish my master branch - head as to-upstream branch at $URL". See - also git-push(1) + push to describe the mapping between remote + ref and local ref. They are combined with a colon in + the format <src>:<dst>, preceded by an optional plus sign, +. + For example: git fetch $URL + refs/heads/master:refs/heads/origin means "grab the master + branch head from the $URL and store + it as my origin branch head". And git push + $URL refs/heads/master:refs/heads/to-upstream means "publish my + master branch head as to-upstream branch at $URL". See also + git-push(1)
repository
- A collection of refs together with an object database containing all objects which are reachable - from the refs, possibly accompanied by meta data from one or more - porcelains. A repository can share an - object database with other repositories. + A collection of refs together with an + object database containing all objects + which are reachable from the refs, possibly + accompanied by meta data from one or more porcelains. A + repository can share an object database with other repositories.
resolve
@@ -1644,38 +1904,39 @@ repository is a BAD idea).

shallow repository
- A shallow repository has an incomplete + A shallow repository has an incomplete history some of whose commits have parents cauterized away (in other words, git is told to pretend that these commits do not have the parents, even though they are recorded in the commit object). This is sometimes useful when you are interested only in the recent history of a project even though the real history recorded in the - upstream is much larger. A shallow repository + upstream is much larger. A shallow repository is created by giving the —depth option to git-clone(1), and its history can be later deepened with git-fetch(1).
symref
- Symbolic reference: instead of containing the SHA1 id - itself, it is of the format ref: refs/some/thing and when - referenced, it recursively dereferences to this reference. HEAD is a - prime example of a symref. Symbolic references are - manipulated with the git-symbolic-ref(1) command. + Symbolic reference: instead of containing the SHA1 + id itself, it is of the format ref: refs/some/thing and when + referenced, it recursively dereferences to this reference. + HEAD is a prime example of a symref. Symbolic + references are manipulated with the git-symbolic-ref(1) + command.
tag
- A ref pointing to a tag or + A ref pointing to a tag or commit object. In contrast to a head, a tag is not changed by a commit. Tags (not tag objects) are stored in $GIT_DIR/refs/tags/. A git tag has nothing to do with a Lisp tag (which would be called an object type in git's context). A tag is most typically used to mark a particular point in the - commit ancestry chain. + commit ancestry chain.
tag object
An object containing a ref pointing to - another object, which can contain a message just like a + another object, which can contain a message just like a commit object. It can also contain a (PGP) signature, in which case it is called a "signed tag object".
@@ -1690,15 +1951,16 @@ repository is a BAD idea).

tracking branch
A regular git branch that is used to follow changes from - another repository. A tracking branch should not contain direct modifications or have local commits - made to it. A tracking branch can usually be + another repository. A tracking + branch should not contain direct modifications or have local commits + made to it. A tracking branch can usually be identified as the right-hand-side ref in a Pull: refspec.
tree
- Either a working tree, or a tree object together with the dependent blob and tree objects - (i.e. a stored representation of a working tree). + Either a working tree, or a tree object together with the dependent blob and tree objects + (i.e. a stored representation of a working tree).
tree object
@@ -1708,8 +1970,7 @@ repository is a BAD idea).

tree-ish
- A ref pointing to either a commit object, a tree object, or a tag object pointing to a tag or commit or - tree object. + A ref pointing to either a commit object, a tree object, or a tag object pointing to a tag or commit or tree object.
unmerged index
@@ -1723,9 +1984,81 @@ repository is a BAD idea).

working tree
- The set of files and directories currently being worked on, i.e. you can - work in your working tree without using git at all. -

Chapter 10. Notes and todo list for this manual

This is a work in progress.

The basic requirements: + The tree of actual checked out files. The working tree is + normally equal to the HEAD plus any local changes + that you have made but not yet committed. +

Appendix A. Git Quick Start

This is a quick summary of the major commands; the following chapters +will explain how these work in more detail.

Creating a new repository

From a tarball:

$ tar xzf project.tar.gz
+$ cd project
+$ git init
+Initialized empty Git repository in .git/
+$ git add .
+$ git commit

From a remote repository:

$ git clone git://example.com/pub/project.git
+$ cd project

Managing branches

$ git branch         # list all local branches in this repo
+$ git checkout test  # switch working directory to branch "test"
+$ git branch new     # create branch "new" starting at current HEAD
+$ git branch -d new  # delete branch "new"

Instead of basing new branch on current HEAD (the default), use:

$ git branch new test    # branch named "test"
+$ git branch new v2.6.15 # tag named v2.6.15
+$ git branch new HEAD^   # commit before the most recent
+$ git branch new HEAD^^  # commit before that
+$ git branch new test~10 # ten commits before tip of branch "test"

Create and switch to a new branch at the same time:

$ git checkout -b new v2.6.15

Update and examine branches from the repository you cloned from:

$ git fetch             # update
+$ git branch -r         # list
+  origin/master
+  origin/next
+  ...
+$ git checkout -b masterwork origin/master

Fetch a branch from a different repository, and give it a new +name in your repository:

$ git fetch git://example.com/project.git theirbranch:mybranch
+$ git fetch git://example.com/project.git v2.6.15:mybranch

Keep a list of repositories you work with regularly:

$ git remote add example git://example.com/project.git
+$ git remote                    # list remote repositories
+example
+origin
+$ git remote show example       # get details
+* remote example
+  URL: git://example.com/project.git
+  Tracked remote branches
+    master next ...
+$ git fetch example             # update branches from example
+$ git branch -r                 # list all remote branches

Exploring history

$ gitk                      # visualize and browse history
+$ git log                   # list all commits
+$ git log src/              # ...modifying src/
+$ git log v2.6.15..v2.6.16  # ...in v2.6.16, not in v2.6.15
+$ git log master..test      # ...in branch test, not in branch master
+$ git log test..master      # ...in branch master, but not in test
+$ git log test...master     # ...in one branch, not in both
+$ git log -S'foo()'         # ...where difference contain "foo()"
+$ git log --since="2 weeks ago"
+$ git log -p                # show patches as well
+$ git show                  # most recent commit
+$ git diff v2.6.15..v2.6.16 # diff between two tagged versions
+$ git diff v2.6.15..HEAD    # diff with current head
+$ git grep "foo()"          # search working directory for "foo()"
+$ git grep v2.6.15 "foo()"  # search old tree for "foo()"
+$ git show v2.6.15:a.txt    # look at old version of a.txt

Search for regressions:

$ git bisect start
+$ git bisect bad                # current version is bad
+$ git bisect good v2.6.13-rc2   # last known good revision
+Bisecting: 675 revisions left to test after this
+                                # test here, then:
+$ git bisect good               # if this revision is good, or
+$ git bisect bad                # if this revision is bad.
+                                # repeat until done.

Making changes

Make sure git knows who to blame:

$ cat >>~/.gitconfig <<\EOF
+[user]
+        name = Your Name Comes Here
+        email = you@yourdomain.example.com
+EOF

Select file contents to include in the next commit, then make the +commit:

$ git add a.txt    # updated file
+$ git add b.txt    # new file
+$ git rm c.txt     # old file
+$ git commit

Or, prepare and create the commit in one step:

$ git commit d.txt # use latest content only of d.txt
+$ git commit -a    # use latest content of all tracked files

Merging

$ git merge test   # merge branch "test" into the current branch
+$ git pull git://example.com/project.git master
+                   # fetch and merge in remote branch
+$ git pull . test  # equivalent to git merge test

Sharing your changes

Importing or exporting patches:

$ git format-patch origin..HEAD # format a patch for each commit
+                                # in HEAD but not in origin
+$ git am mbox # import patches from the mailbox "mbox"

Fetch a branch in a different git repository, then merge into the +current branch:

$ git pull git://example.com/project.git theirbranch

Store the fetched branch into a local branch before merging into the +current branch:

$ git pull git://example.com/project.git theirbranch:mybranch

After creating commits on a local branch, update the remote +branch with your commits:

$ git push ssh://example.com/project.git mybranch:theirbranch

When remote and local branch are both named "test":

$ git push ssh://example.com/project.git test

Shortcut version for a frequently used remote repository:

$ git remote add example ssh://example.com/project.git
+$ git push example test

Repository maintenance

Check for corruption:

$ git fsck

Recompress, remove unused cruft:

$ git gc

Appendix B. Notes and todo list for this manual

This is a work in progress.

The basic requirements: - It must be readable in order, from beginning to end, by someone intelligent with a basic grasp of the unix commandline, but without any special knowledge of git. If diff --git a/user-manual.txt b/user-manual.txt index 8d6688633..c4bff474d 100644 --- a/user-manual.txt +++ b/user-manual.txt @@ -1,18 +1,19 @@ Git User's Manual (for version 1.5.1 or newer) ______________________________________________ + +Git is a fast distributed revision control system. + This manual is designed to be readable by someone with basic unix command-line skills, but no previous knowledge of git. -Chapter 1 gives a brief overview of git commands, without any -explanation; you may prefer to skip to chapter 2 on a first reading. +<> and <> explain how +to fetch and study a project using git--read these chapters to learn how +to build and test a particular version of a software project, search for +regressions, and so on. -Chapters 2 and 3 explain how to fetch and study a project using -git--the tools you'd need to build and test a particular version of a -software project, to search for regressions, and so on. - -Chapter 4 explains how to do development with git, and chapter 5 how -to share that development with others. +People needing to do actual development will also want to read +<> and <>. Further chapters cover more specialized topics. @@ -23,237 +24,12 @@ pages. For a command such as "git clone", just use $ man git-clone ------------------------------------------------ -[[git-quick-start]] -Git Quick Start -=============== - -This is a quick summary of the major commands; the following chapters -will explain how these work in more detail. - -[[quick-creating-a-new-repository]] -Creating a new repository -------------------------- - -From a tarball: - ------------------------------------------------ -$ tar xzf project.tar.gz -$ cd project -$ git init -Initialized empty Git repository in .git/ -$ git add . -$ git commit ------------------------------------------------ - -From a remote repository: - ------------------------------------------------ -$ git clone git://example.com/pub/project.git -$ cd project ------------------------------------------------ - -[[managing-branches]] -Managing branches ------------------ - ------------------------------------------------ -$ git branch # list all local branches in this repo -$ git checkout test # switch working directory to branch "test" -$ git branch new # create branch "new" starting at current HEAD -$ git branch -d new # delete branch "new" ------------------------------------------------ - -Instead of basing new branch on current HEAD (the default), use: - ------------------------------------------------ -$ git branch new test # branch named "test" -$ git branch new v2.6.15 # tag named v2.6.15 -$ git branch new HEAD^ # commit before the most recent -$ git branch new HEAD^^ # commit before that -$ git branch new test~10 # ten commits before tip of branch "test" ------------------------------------------------ - -Create and switch to a new branch at the same time: - ------------------------------------------------ -$ git checkout -b new v2.6.15 ------------------------------------------------ - -Update and examine branches from the repository you cloned from: - ------------------------------------------------ -$ git fetch # update -$ git branch -r # list - origin/master - origin/next - ... -$ git checkout -b masterwork origin/master ------------------------------------------------ - -Fetch a branch from a different repository, and give it a new -name in your repository: - ------------------------------------------------ -$ git fetch git://example.com/project.git theirbranch:mybranch -$ git fetch git://example.com/project.git v2.6.15:mybranch ------------------------------------------------ - -Keep a list of repositories you work with regularly: - ------------------------------------------------ -$ git remote add example git://example.com/project.git -$ git remote # list remote repositories -example -origin -$ git remote show example # get details -* remote example - URL: git://example.com/project.git - Tracked remote branches - master next ... -$ git fetch example # update branches from example -$ git branch -r # list all remote branches ------------------------------------------------ - - -[[exploring-history]] -Exploring history ------------------ - ------------------------------------------------ -$ gitk # visualize and browse history -$ git log # list all commits -$ git log src/ # ...modifying src/ -$ git log v2.6.15..v2.6.16 # ...in v2.6.16, not in v2.6.15 -$ git log master..test # ...in branch test, not in branch master -$ git log test..master # ...in branch master, but not in test -$ git log test...master # ...in one branch, not in both -$ git log -S'foo()' # ...where difference contain "foo()" -$ git log --since="2 weeks ago" -$ git log -p # show patches as well -$ git show # most recent commit -$ git diff v2.6.15..v2.6.16 # diff between two tagged versions -$ git diff v2.6.15..HEAD # diff with current head -$ git grep "foo()" # search working directory for "foo()" -$ git grep v2.6.15 "foo()" # search old tree for "foo()" -$ git show v2.6.15:a.txt # look at old version of a.txt ------------------------------------------------ - -Search for regressions: - ------------------------------------------------ -$ git bisect start -$ git bisect bad # current version is bad -$ git bisect good v2.6.13-rc2 # last known good revision -Bisecting: 675 revisions left to test after this - # test here, then: -$ git bisect good # if this revision is good, or -$ git bisect bad # if this revision is bad. - # repeat until done. ------------------------------------------------ - -[[making-changes]] -Making changes --------------- - -Make sure git knows who to blame: - ------------------------------------------------- -$ cat >>~/.gitconfig <<\EOF -[user] - name = Your Name Comes Here - email = you@yourdomain.example.com -EOF ------------------------------------------------- - -Select file contents to include in the next commit, then make the -commit: - ------------------------------------------------ -$ git add a.txt # updated file -$ git add b.txt # new file -$ git rm c.txt # old file -$ git commit ------------------------------------------------ - -Or, prepare and create the commit in one step: - ------------------------------------------------ -$ git commit d.txt # use latest content only of d.txt -$ git commit -a # use latest content of all tracked files ------------------------------------------------ - -[[merging]] -Merging -------- - ------------------------------------------------ -$ git merge test # merge branch "test" into the current branch -$ git pull git://example.com/project.git master - # fetch and merge in remote branch -$ git pull . test # equivalent to git merge test ------------------------------------------------ - -[[sharing-your-changes]] -Sharing your changes --------------------- - -Importing or exporting patches: - ------------------------------------------------ -$ git format-patch origin..HEAD # format a patch for each commit - # in HEAD but not in origin -$ git am mbox # import patches from the mailbox "mbox" ------------------------------------------------ - -Fetch a branch in a different git repository, then merge into the -current branch: - ------------------------------------------------ -$ git pull git://example.com/project.git theirbranch ------------------------------------------------ - -Store the fetched branch into a local branch before merging into the -current branch: - ------------------------------------------------ -$ git pull git://example.com/project.git theirbranch:mybranch ------------------------------------------------ - -After creating commits on a local branch, update the remote -branch with your commits: - ------------------------------------------------ -$ git push ssh://example.com/project.git mybranch:theirbranch ------------------------------------------------ - -When remote and local branch are both named "test": - ------------------------------------------------ -$ git push ssh://example.com/project.git test ------------------------------------------------ - -Shortcut version for a frequently used remote repository: - ------------------------------------------------ -$ git remote add example ssh://example.com/project.git -$ git push example test ------------------------------------------------ - -[[repository-maintenance]] -Repository maintenance ----------------------- - -Check for corruption: - ------------------------------------------------ -$ git fsck ------------------------------------------------ +See also <> for a brief overview of git commands, +without any explanation. -Recompress, remove unused cruft: +Also, see <> for ways that you can help make this manual more +complete. ------------------------------------------------ -$ git gc ------------------------------------------------ [[repositories-and-branches]] Repositories and Branches @@ -917,6 +693,25 @@ may be any path to a file tracked by git. Examples -------- +[[counting-commits-on-a-branch]] +Counting the number of commits on a branch +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + +Suppose you want to know how many commits you've made on "mybranch" +since it diverged from "origin": + +------------------------------------------------- +$ git log --pretty=oneline origin..mybranch | wc -l +------------------------------------------------- + +Alternatively, you may often see this sort of thing done with the +lower-level command gitlink:git-rev-list[1], which just lists the SHA1's +of all the given commits: + +------------------------------------------------- +$ git rev-list origin..mybranch | wc -l +------------------------------------------------- + [[checking-for-equal-branches]] Check whether two branches point at the same history ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ @@ -1035,24 +830,114 @@ available Which shows that e05db0fd is reachable from itself, from v1.5.0-rc1, and from v1.5.0-rc2, but not from v1.5.0-rc0. +[[showing-commits-unique-to-a-branch]] +Showing commits unique to a given branch +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -[[Developing-with-git]] -Developing with git -=================== +Suppose you would like to see all the commits reachable from the branch +head named "master" but not from any other head in your repository. -[[telling-git-your-name]] -Telling git your name ---------------------- +We can list all the heads in this repository with +gitlink:git-show-ref[1]: -Before creating any commits, you should introduce yourself to git. The -easiest way to do so is to make sure the following lines appear in a -file named .gitconfig in your home directory: +------------------------------------------------- +$ git show-ref --heads +bf62196b5e363d73353a9dcf094c59595f3153b7 refs/heads/core-tutorial +db768d5504c1bb46f63ee9d6e1772bd047e05bf9 refs/heads/maint +a07157ac624b2524a059a3414e99f6f44bebc1e7 refs/heads/master +24dbc180ea14dc1aebe09f14c8ecf32010690627 refs/heads/tutorial-2 +1e87486ae06626c2f31eaa63d26fc0fd646c8af2 refs/heads/tutorial-fixes +------------------------------------------------- ------------------------------------------------- -[user] - name = Your Name Comes Here - email = you@yourdomain.example.com ------------------------------------------------- +We can get just the branch-head names, and remove "master", with +the help of the standard utilities cut and grep: + +------------------------------------------------- +$ git show-ref --heads | cut -d' ' -f2 | grep -v '^refs/heads/master' +refs/heads/core-tutorial +refs/heads/maint +refs/heads/tutorial-2 +refs/heads/tutorial-fixes +------------------------------------------------- + +And then we can ask to see all the commits reachable from master +but not from these other heads: + +------------------------------------------------- +$ gitk master --not $( git show-ref --heads | cut -d' ' -f2 | + grep -v '^refs/heads/master' ) +------------------------------------------------- + +Obviously, endless variations are possible; for example, to see all +commits reachable from some head but not from any tag in the repository: + +------------------------------------------------- +$ gitk ($ git show-ref --heads ) --not $( git show-ref --tags ) +------------------------------------------------- + +(See gitlink:git-rev-parse[1] for explanations of commit-selecting +syntax such as `--not`.) + +[[making-a-release]] +Creating a changelog and tarball for a software release +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + +The gitlink:git-archive[1] command can create a tar or zip archive from +any version of a project; for example: + +------------------------------------------------- +$ git archive --format=tar --prefix=project/ HEAD | gzip >latest.tar.gz +------------------------------------------------- + +will use HEAD to produce a tar archive in which each filename is +preceded by "prefix/". + +If you're releasing a new version of a software project, you may want +to simultaneously make a changelog to include in the release +announcement. + +Linus Torvalds, for example, makes new kernel releases by tagging them, +then running: + +------------------------------------------------- +$ release-script 2.6.12 2.6.13-rc6 2.6.13-rc7 +------------------------------------------------- + +where release-script is a shell script that looks like: + +------------------------------------------------- +#!/bin/sh +stable="$1" +last="$2" +new="$3" +echo "# git tag v$new" +echo "git archive --prefix=linux-$new/ v$new | gzip -9 > ../linux-$new.tar.gz" +echo "git diff v$stable v$new | gzip -9 > ../patch-$new.gz" +echo "git log --no-merges v$new ^v$last > ../ChangeLog-$new" +echo "git shortlog --no-merges v$new ^v$last > ../ShortLog" +echo "git diff --stat --summary -M v$last v$new > ../diffstat-$new" +------------------------------------------------- + +and then he just cut-and-pastes the output commands after verifying that +they look OK. + +[[Developing-with-git]] +Developing with git +=================== + +[[telling-git-your-name]] +Telling git your name +--------------------- + +Before creating any commits, you should introduce yourself to git. The +easiest way to do so is to make sure the following lines appear in a +file named .gitconfig in your home directory: + +------------------------------------------------ +[user] + name = Your Name Comes Here + email = you@yourdomain.example.com +------------------------------------------------ (See the "CONFIGURATION FILE" section of gitlink:git-config[1] for details on the configuration file.) @@ -1789,31 +1674,30 @@ The final result will be a series of commits, one for each patch in the original mailbox, with authorship and commit log message each taken from the message containing each patch. -[[setting-up-a-public-repository]] -Setting up a public repository ------------------------------- +[[public-repositories]] +Public git repositories +----------------------- -Another way to submit changes to a project is to simply tell the -maintainer of that project to pull from your repository, exactly as -you did in the section "<>". +Another way to submit changes to a project is to tell the maintainer of +that project to pull the changes from your repository using git-pull[1]. +In the section "<>" we described this as a way to get updates from the "main" +repository, but it works just as well in the other direction. -If you and maintainer both have accounts on the same machine, then -then you can just pull changes from each other's repositories -directly; note that all of the commands (gitlink:git-clone[1], -git-fetch[1], git-pull[1], etc.) that accept a URL as an argument -will also accept a local directory name; so, for example, you can -use +If you and the maintainer both have accounts on the same machine, then +you can just pull changes from each other's repositories directly; +commands that accepts repository URLs as arguments will also accept a +local directory name: ------------------------------------------------- $ git clone /path/to/repository $ git pull /path/to/other/repository ------------------------------------------------- -If this sort of setup is inconvenient or impossible, another (more -common) option is to set up a public repository on a public server. -This also allows you to cleanly separate private work in progress -from publicly visible work. +However, the more common way to do this is to maintain a separate public +repository (usually on a different host) for others to pull changes +from. This is usually more convenient, and allows you to cleanly +separate private work in progress from publicly visible work. You will continue to do your day-to-day work in your personal repository, but periodically "push" changes from your personal @@ -1832,32 +1716,52 @@ like this: | they push V their public repo <------------------- their repo -Now, assume your personal repository is in the directory ~/proj. We -first create a new clone of the repository: +[[setting-up-a-public-repository]] +Setting up a public repository +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + +Assume your personal repository is in the directory ~/proj. We +first create a new clone of the repository and tell git-daemon that it +is meant to be public: ------------------------------------------------- $ git clone --bare ~/proj proj.git +$ touch proj.git/git-daemon-export-ok ------------------------------------------------- The resulting directory proj.git contains a "bare" git repository--it is -just the contents of the ".git" directory, without a checked-out copy of -a working directory. +just the contents of the ".git" directory, without any files checked out +around it. Next, copy proj.git to the server where you plan to host the public repository. You can use scp, rsync, or whatever is most convenient. -If somebody else maintains the public server, they may already have -set up a git service for you, and you may skip to the section +[[exporting-via-git]] +Exporting a git repository via the git protocol +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + +This is the preferred method. + +If someone else administers the server, they should tell you what +directory to put the repository in, and what git:// url it will appear +at. You can then skip to the section "<>", below. -Otherwise, the following sections explain how to export your newly -created public repository: +Otherwise, all you need to do is start gitlink:git-daemon[1]; it will +listen on port 9418. By default, it will allow access to any directory +that looks like a git directory and contains the magic file +git-daemon-export-ok. Passing some directory paths as git-daemon +arguments will further restrict the exports to those paths. + +You can also run git-daemon as an inetd service; see the +gitlink:git-daemon[1] man page for details. (See especially the +examples section.) [[exporting-via-http]] Exporting a git repository via http ------------------------------------ +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ The git protocol gives better performance and reliability, but on a host with a web server set up, http exports may be simpler to set up. @@ -1889,20 +1793,11 @@ link:howto/setup-git-server-over-http.txt[setup-git-server-over-http] for a slightly more sophisticated setup using WebDAV which also allows pushing over http.) -[[exporting-via-git]] -Exporting a git repository via the git protocol ------------------------------------------------ - -This is the preferred method. - -For now, we refer you to the gitlink:git-daemon[1] man page for -instructions. (See especially the examples section.) - [[pushing-changes-to-a-public-repository]] Pushing changes to a public repository --------------------------------------- +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ -Note that the two techniques outline above (exporting via +Note that the two techniques outlined above (exporting via <> or <>) allow other maintainers to fetch your latest changes, but they do not allow write access, which you will need to update the public repository with the @@ -1954,7 +1849,7 @@ details. [[setting-up-a-shared-repository]] Setting up a shared repository ------------------------------- +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Another way to collaborate is by using a model similar to that commonly used in CVS, where several developers with special rights @@ -1963,8 +1858,8 @@ link:cvs-migration.html[git for CVS users] for instructions on how to set this up. [[setting-up-gitweb]] -Allow web browsing of a repository ----------------------------------- +Allowing web browsing of a repository +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ The gitweb cgi script provides users an easy way to browse your project's files and history without having to install git; see the file @@ -1974,7 +1869,302 @@ gitweb/INSTALL in the git source tree for instructions on setting it up. Examples -------- -TODO: topic branches, typical roles as in everyday.txt, ? +[[maintaining-topic-branches]] +Maintaining topic branches for a Linux subsystem maintainer +~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ + +This describes how Tony Luck uses git in his role as maintainer of the +IA64 architecture for the Linux kernel. + +He uses two public branches: + + - A "test" tree into which patches are initially placed so that they + can get some exposure when integrated with other ongoing development. + This tree is available to Andrew for pulling into -mm whenever he + wants. + + - A "release" tree into which tested patches are moved for final sanity + checking, and as a vehicle to send them upstream to Linus (by sending + him a "please pull" request.) + +He also uses a set of temporary branches ("topic branches"), each +containing a logical grouping of patches. + +To set this up, first create your work tree by cloning Linus's public +tree: + +------------------------------------------------- +$ git clone git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux-2.6.git work +$ cd work +------------------------------------------------- + +Linus's tree will be stored in the remote branch named origin/master, +and can be updated using gitlink:git-fetch[1]; you can track other +public trees using gitlink:git-remote[1] to set up a "remote" and +git-fetch[1] to keep them up-to-date; see <>. + +Now create the branches in which you are going to work; these start out +at the current tip of origin/master branch, and should be set up (using +the --track option to gitlink:git-branch[1]) to merge changes in from +Linus by default. + +------------------------------------------------- +$ git branch --track test origin/master +$ git branch --track release origin/master +------------------------------------------------- + +These can be easily kept up to date using gitlink:git-pull[1] + +------------------------------------------------- +$ git checkout test && git pull +$ git checkout release && git pull +------------------------------------------------- + +Important note! If you have any local changes in these branches, then +this merge will create a commit object in the history (with no local +changes git will simply do a "Fast forward" merge). Many people dislike +the "noise" that this creates in the Linux history, so you should avoid +doing this capriciously in the "release" branch, as these noisy commits +will become part of the permanent history when you ask Linus to pull +from the release branch. + +A few configuration variables (see gitlink:git-config[1]) can +make it easy to push both branches to your public tree. (See +<>.) + +------------------------------------------------- +$ cat >> .git/config <&2 + exit 1 + ;; +esac +------------------------------------------------- + +------------------------------------------------- +==== merge script ==== +# Merge a branch into either the test or release branch + +pname=$0 + +usage() +{ + echo "Usage: $pname branch test|release" 1>&2 + exit 1 +} + +if [ ! -f .git/refs/heads/"$1" ] +then + echo "Can't see branch <$1>" 1>&2 + usage +fi + +case "$2" in +test|release) + if [ $(git log $2..$1 | wc -c) -eq 0 ] + then + echo $1 already merged into $2 1>&2 + exit 1 + fi + git checkout $2 && git pull . $1 + ;; +*) + usage + ;; +esac +------------------------------------------------- + +------------------------------------------------- +==== status script ==== +# report on status of my ia64 GIT tree + +gb=$(tput setab 2) +rb=$(tput setab 1) +restore=$(tput setab 9) + +if [ `git rev-list test..release | wc -c` -gt 0 ] +then + echo $rb Warning: commits in release that are not in test $restore + git log test..release +fi + +for branch in `ls .git/refs/heads` +do + if [ $branch = test -o $branch = release ] + then + continue + fi + + echo -n $gb ======= $branch ====== $restore " " + status= + for ref in test release origin/master + do + if [ `git rev-list $ref..$branch | wc -c` -gt 0 ] + then + status=$status${ref:0:1} + fi + done + case $status in + trl) + echo $rb Need to pull into test $restore + ;; + rl) + echo "In test" + ;; + l) + echo "Waiting for linus" + ;; + "") + echo $rb All done $restore + ;; + *) + echo $rb "<$status>" $restore + ;; + esac + git log origin/master..$branch | git shortlog +done +------------------------------------------------- [[cleaning-up-history]] @@ -3160,12 +3350,454 @@ confusing and scary messages, but it won't actually do anything bad. In contrast, running "git prune" while somebody is actively changing the repository is a *BAD* idea). +[[birdview-on-the-source-code]] +A birds-eye view of Git's source code +------------------------------------- + +It is not always easy for new developers to find their way through Git's +source code. This section gives you a little guidance to show where to +start. + +A good place to start is with the contents of the initial commit, with: + +---------------------------------------------------- +$ git checkout e83c5163 +---------------------------------------------------- + +The initial revision lays the foundation for almost everything git has +today, but is small enough to read in one sitting. + +Note that terminology has changed since that revision. For example, the +README in that revision uses the word "changeset" to describe what we +now call a <>. + +Also, we do not call it "cache" any more, but "index", however, the +file is still called `cache.h`. Remark: Not much reason to change it now, +especially since there is no good single name for it anyway, because it is +basically _the_ header file which is included by _all_ of Git's C sources. + +If you grasp the ideas in that initial commit, you should check out a +more recent version and skim `cache.h`, `object.h` and `commit.h`. + +In the early days, Git (in the tradition of UNIX) was a bunch of programs +which were extremely simple, and which you used in scripts, piping the +output of one into another. This turned out to be good for initial +development, since it was easier to test new things. However, recently +many of these parts have become builtins, and some of the core has been +"libified", i.e. put into libgit.a for performance, portability reasons, +and to avoid code duplication. + +By now, you know what the index is (and find the corresponding data +structures in `cache.h`), and that there are just a couple of object types +(blobs, trees, commits and tags) which inherit their common structure from +`struct object`, which is their first member (and thus, you can cast e.g. +`(struct object *)commit` to achieve the _same_ as `&commit->object`, i.e. +get at the object name and flags). + +Now is a good point to take a break to let this information sink in. + +Next step: get familiar with the object naming. Read <>. +There are quite a few ways to name an object (and not only revisions!). +All of these are handled in `sha1_name.c`. Just have a quick look at +the function `get_sha1()`. A lot of the special handling is done by +functions like `get_sha1_basic()` or the likes. + +This is just to get you into the groove for the most libified part of Git: +the revision walker. + +Basically, the initial version of `git log` was a shell script: + +---------------------------------------------------------------- +$ git-rev-list --pretty $(git-rev-parse --default HEAD "$@") | \ + LESS=-S ${PAGER:-less} +---------------------------------------------------------------- + +What does this mean? + +`git-rev-list` is the original version of the revision walker, which +_always_ printed a list of revisions to stdout. It is still functional, +and needs to, since most new Git programs start out as scripts using +`git-rev-list`. + +`git-rev-parse` is not as important any more; it was only used to filter out +options that were relevant for the different plumbing commands that were +called by the script. + +Most of what `git-rev-list` did is contained in `revision.c` and +`revision.h`. It wraps the options in a struct named `rev_info`, which +controls how and what revisions are walked, and more. + +The original job of `git-rev-parse` is now taken by the function +`setup_revisions()`, which parses the revisions and the common command line +options for the revision walker. This information is stored in the struct +`rev_info` for later consumption. You can do your own command line option +parsing after calling `setup_revisions()`. After that, you have to call +`prepare_revision_walk()` for initialization, and then you can get the +commits one by one with the function `get_revision()`. + +If you are interested in more details of the revision walking process, +just have a look at the first implementation of `cmd_log()`; call +`git-show v1.3.0~155^2~4` and scroll down to that function (note that you +no longer need to call `setup_pager()` directly). + +Nowadays, `git log` is a builtin, which means that it is _contained_ in the +command `git`. The source side of a builtin is + +- a function called `cmd_`, typically defined in `builtin-.c`, + and declared in `builtin.h`, + +- an entry in the `commands[]` array in `git.c`, and + +- an entry in `BUILTIN_OBJECTS` in the `Makefile`. + +Sometimes, more than one builtin is contained in one source file. For +example, `cmd_whatchanged()` and `cmd_log()` both reside in `builtin-log.c`, +since they share quite a bit of code. In that case, the commands which are +_not_ named like the `.c` file in which they live have to be listed in +`BUILT_INS` in the `Makefile`. + +`git log` looks more complicated in C than it does in the original script, +but that allows for a much greater flexibility and performance. + +Here again it is a good point to take a pause. + +Lesson three is: study the code. Really, it is the best way to learn about +the organization of Git (after you know the basic concepts). + +So, think about something which you are interested in, say, "how can I +access a blob just knowing the object name of it?". The first step is to +find a Git command with which you can do it. In this example, it is either +`git show` or `git cat-file`. + +For the sake of clarity, let's stay with `git cat-file`, because it + +- is plumbing, and + +- was around even in the initial commit (it literally went only through + some 20 revisions as `cat-file.c`, was renamed to `builtin-cat-file.c` + when made a builtin, and then saw less than 10 versions). + +So, look into `builtin-cat-file.c`, search for `cmd_cat_file()` and look what +it does. + +------------------------------------------------------------------ + git_config(git_default_config); + if (argc != 3) + usage("git-cat-file [-t|-s|-e|-p|] "); + if (get_sha1(argv[2], sha1)) + die("Not a valid object name %s", argv[2]); +------------------------------------------------------------------ + +Let's skip over the obvious details; the only really interesting part +here is the call to `get_sha1()`. It tries to interpret `argv[2]` as an +object name, and if it refers to an object which is present in the current +repository, it writes the resulting SHA-1 into the variable `sha1`. + +Two things are interesting here: + +- `get_sha1()` returns 0 on _success_. This might surprise some new + Git hackers, but there is a long tradition in UNIX to return different + negative numbers in case of different errors -- and 0 on success. + +- the variable `sha1` in the function signature of `get_sha1()` is `unsigned + char \*`, but is actually expected to be a pointer to `unsigned + char[20]`. This variable will contain the 160-bit SHA-1 of the given + commit. Note that whenever a SHA-1 is passed as `unsigned char \*`, it + is the binary representation, as opposed to the ASCII representation in + hex characters, which is passed as `char *`. + +You will see both of these things throughout the code. + +Now, for the meat: + +----------------------------------------------------------------------------- + case 0: + buf = read_object_with_reference(sha1, argv[1], &size, NULL); +----------------------------------------------------------------------------- + +This is how you read a blob (actually, not only a blob, but any type of +object). To know how the function `read_object_with_reference()` actually +works, find the source code for it (something like `git grep +read_object_with | grep ":[a-z]"` in the git repository), and read +the source. + +To find out how the result can be used, just read on in `cmd_cat_file()`: + +----------------------------------- + write_or_die(1, buf, size); +----------------------------------- + +Sometimes, you do not know where to look for a feature. In many such cases, +it helps to search through the output of `git log`, and then `git show` the +corresponding commit. + +Example: If you know that there was some test case for `git bundle`, but +do not remember where it was (yes, you _could_ `git grep bundle t/`, but that +does not illustrate the point!): + +------------------------ +$ git log --no-merges t/ +------------------------ + +In the pager (`less`), just search for "bundle", go a few lines back, +and see that it is in commit 18449ab0... Now just copy this object name, +and paste it into the command line + +------------------- +$ git show 18449ab0 +------------------- + +Voila. + +Another example: Find out what to do in order to make some script a +builtin: + +------------------------------------------------- +$ git log --no-merges --diff-filter=A builtin-*.c +------------------------------------------------- + +You see, Git is actually the best tool to find out about the source of Git +itself! + [[glossary]] include::glossary.txt[] +[[git-quick-start]] +Appendix A: Git Quick Start +=========================== + +This is a quick summary of the major commands; the following chapters +will explain how these work in more detail. + +[[quick-creating-a-new-repository]] +Creating a new repository +------------------------- + +From a tarball: + +----------------------------------------------- +$ tar xzf project.tar.gz +$ cd project +$ git init +Initialized empty Git repository in .git/ +$ git add . +$ git commit +----------------------------------------------- + +From a remote repository: + +----------------------------------------------- +$ git clone git://example.com/pub/project.git +$ cd project +----------------------------------------------- + +[[managing-branches]] +Managing branches +----------------- + +----------------------------------------------- +$ git branch # list all local branches in this repo +$ git checkout test # switch working directory to branch "test" +$ git branch new # create branch "new" starting at current HEAD +$ git branch -d new # delete branch "new" +----------------------------------------------- + +Instead of basing new branch on current HEAD (the default), use: + +----------------------------------------------- +$ git branch new test # branch named "test" +$ git branch new v2.6.15 # tag named v2.6.15 +$ git branch new HEAD^ # commit before the most recent +$ git branch new HEAD^^ # commit before that +$ git branch new test~10 # ten commits before tip of branch "test" +----------------------------------------------- + +Create and switch to a new branch at the same time: + +----------------------------------------------- +$ git checkout -b new v2.6.15 +----------------------------------------------- + +Update and examine branches from the repository you cloned from: + +----------------------------------------------- +$ git fetch # update +$ git branch -r # list + origin/master + origin/next + ... +$ git checkout -b masterwork origin/master +----------------------------------------------- + +Fetch a branch from a different repository, and give it a new +name in your repository: + +----------------------------------------------- +$ git fetch git://example.com/project.git theirbranch:mybranch +$ git fetch git://example.com/project.git v2.6.15:mybranch +----------------------------------------------- + +Keep a list of repositories you work with regularly: + +----------------------------------------------- +$ git remote add example git://example.com/project.git +$ git remote # list remote repositories +example +origin +$ git remote show example # get details +* remote example + URL: git://example.com/project.git + Tracked remote branches + master next ... +$ git fetch example # update branches from example +$ git branch -r # list all remote branches +----------------------------------------------- + + +[[exploring-history]] +Exploring history +----------------- + +----------------------------------------------- +$ gitk # visualize and browse history +$ git log # list all commits +$ git log src/ # ...modifying src/ +$ git log v2.6.15..v2.6.16 # ...in v2.6.16, not in v2.6.15 +$ git log master..test # ...in branch test, not in branch master +$ git log test..master # ...in branch master, but not in test +$ git log test...master # ...in one branch, not in both +$ git log -S'foo()' # ...where difference contain "foo()" +$ git log --since="2 weeks ago" +$ git log -p # show patches as well +$ git show # most recent commit +$ git diff v2.6.15..v2.6.16 # diff between two tagged versions +$ git diff v2.6.15..HEAD # diff with current head +$ git grep "foo()" # search working directory for "foo()" +$ git grep v2.6.15 "foo()" # search old tree for "foo()" +$ git show v2.6.15:a.txt # look at old version of a.txt +----------------------------------------------- + +Search for regressions: + +----------------------------------------------- +$ git bisect start +$ git bisect bad # current version is bad +$ git bisect good v2.6.13-rc2 # last known good revision +Bisecting: 675 revisions left to test after this + # test here, then: +$ git bisect good # if this revision is good, or +$ git bisect bad # if this revision is bad. + # repeat until done. +----------------------------------------------- + +[[making-changes]] +Making changes +-------------- + +Make sure git knows who to blame: + +------------------------------------------------ +$ cat >>~/.gitconfig <<\EOF +[user] + name = Your Name Comes Here + email = you@yourdomain.example.com +EOF +------------------------------------------------ + +Select file contents to include in the next commit, then make the +commit: + +----------------------------------------------- +$ git add a.txt # updated file +$ git add b.txt # new file +$ git rm c.txt # old file +$ git commit +----------------------------------------------- + +Or, prepare and create the commit in one step: + +----------------------------------------------- +$ git commit d.txt # use latest content only of d.txt +$ git commit -a # use latest content of all tracked files +----------------------------------------------- + +[[merging]] +Merging +------- + +----------------------------------------------- +$ git merge test # merge branch "test" into the current branch +$ git pull git://example.com/project.git master + # fetch and merge in remote branch +$ git pull . test # equivalent to git merge test +----------------------------------------------- + +[[sharing-your-changes]] +Sharing your changes +-------------------- + +Importing or exporting patches: + +----------------------------------------------- +$ git format-patch origin..HEAD # format a patch for each commit + # in HEAD but not in origin +$ git am mbox # import patches from the mailbox "mbox" +----------------------------------------------- + +Fetch a branch in a different git repository, then merge into the +current branch: + +----------------------------------------------- +$ git pull git://example.com/project.git theirbranch +----------------------------------------------- + +Store the fetched branch into a local branch before merging into the +current branch: + +----------------------------------------------- +$ git pull git://example.com/project.git theirbranch:mybranch +----------------------------------------------- + +After creating commits on a local branch, update the remote +branch with your commits: + +----------------------------------------------- +$ git push ssh://example.com/project.git mybranch:theirbranch +----------------------------------------------- + +When remote and local branch are both named "test": + +----------------------------------------------- +$ git push ssh://example.com/project.git test +----------------------------------------------- + +Shortcut version for a frequently used remote repository: + +----------------------------------------------- +$ git remote add example ssh://example.com/project.git +$ git push example test +----------------------------------------------- + +[[repository-maintenance]] +Repository maintenance +---------------------- + +Check for corruption: + +----------------------------------------------- +$ git fsck +----------------------------------------------- + +Recompress, remove unused cruft: + +----------------------------------------------- +$ git gc +----------------------------------------------- + + [[todo]] -Notes and todo list for this manual -=================================== +Appendix B: Notes and todo list for this manual +=============================================== This is a work in progress. -- 2.26.2