Setup and Config
Getting and Creating Projects
Branching and Merging
Sharing and Updating Projects
Inspection and Comparison
- Command-line interface conventions
- Everyday Git
- Frequently Asked Questions (FAQ)
- All guides...
- 2.35.1 → 2.40.0 no changes
- 2.35.0 01/24/22
- 2.29.1 → 2.34.7 no changes
- 2.29.0 10/19/20
- 2.27.1 → 2.28.1 no changes
- 2.27.0 06/01/20
- 2.25.1 → 2.26.3 no changes
- 2.25.0 01/13/20
A submodule is a repository embedded inside another repository. The submodule has its own history; the repository it is embedded in is called a superproject.
On the filesystem, a submodule usually (but not always - see FORMS below)
consists of (i) a Git directory located under the
directory of its superproject, (ii) a working directory inside the
superproject’s working directory, and a
.git file at the root of
the submodule’s working directory pointing to (i).
Assuming the submodule has a Git directory at
and a working directory at
path/to/bar/, the superproject tracks the
submodule via a
gitlink entry in the tree at
path/to/bar and an entry
.gitmodules file (see gitmodules) of the form
submodule.foo.path = path/to/bar.
gitlink entry contains the object name of the commit that the
superproject expects the submodule’s working directory to be at.
submodule.foo.* in the
.gitmodules file gives additional
hints to Git’s porcelain layer. For example, the
setting specifies where to obtain the submodule.
Submodules can be used for at least two different use cases:
Using another project while maintaining independent history. Submodules allow you to contain the working tree of another project within your own working tree while keeping the history of both projects separate. Also, since submodules are fixed to an arbitrary version, the other project can be independently developed without affecting the superproject, allowing the superproject project to fix itself to new versions only when desired.
Splitting a (logically single) project into multiple repositories and tying them back together. This can be used to overcome current limitations of Git’s implementation to have finer grained access:
Size of the Git repository: In its current form Git scales up poorly for large repositories containing content that is not compressed by delta computation between trees. For example, you can use submodules to hold large binary assets and these repositories can be shallowly cloned such that you do not have a large history locally.
Transfer size: In its current form Git requires the whole working tree present. It does not allow partial trees to be transferred in fetch or clone. If the project you work on consists of multiple repositories tied together as submodules in a superproject, you can avoid fetching the working trees of the repositories you are not interested in.
Access control: By restricting user access to submodules, this can be used to implement read/write policies for different users.
The configuration of submodules
Submodule operations can be configured using the following mechanisms (from highest to lowest precedence):
The command line for those commands that support taking submodules as part of their pathspecs. Most commands have a boolean flag
--recurse-submoduleswhich specify whether to recurse into submodules. Examples are
checkout. Some commands take enums, such as
push, where you can specify how submodules are affected.
The configuration inside the submodule. This includes
$GIT_DIR/configin the submodule, but also settings in the tree such as a
.gitignorefiles that specify behavior of commands inside the submodule.
For example an effect from the submodule’s
.gitignorefile would be observed when you run
git status --ignore-submodules=nonein the superproject. This collects information from the submodule’s working directory by running
statusin the submodule while paying attention to the
.gitignorefile of the submodule.
$GIT_DIR/configfile would come into play when running
git push --recurse-submodules=checkin the superproject, as this would check if the submodule has any changes not published to any remote. The remotes are configured in the submodule as usual in the
The configuration file
$GIT_DIR/configin the superproject. Git only recurses into active submodules (see "ACTIVE SUBMODULES" section below).
If the submodule is not yet initialized, then the configuration inside the submodule does not exist yet, so where to obtain the submodule from is configured here for example.
.gitmodulesfile inside the superproject. A project usually uses this file to suggest defaults for the upstream collection of repositories for the mapping that is required between a submodule’s name and its path.
This file mainly serves as the mapping between the name and path of submodules in the superproject, such that the submodule’s Git directory can be located.
If the submodule has never been initialized, this is the only place where submodule configuration is found. It serves as the last fallback to specify where to obtain the submodule from.
Submodules can take the following forms:
The basic form described in DESCRIPTION with a Git directory, a working directory, a
gitlink, and a
"Old-form" submodule: A working directory with an embedded
.gitdirectory, and the tracking
.gitmodulesentry in the superproject. This is typically found in repositories generated using older versions of Git.
It is possible to construct these old form repositories manually.
When deinitialized or deleted (see below), the submodule’s Git directory is automatically moved to
$GIT_DIR/modules/<name>/of the superproject.
Deinitialized submodule: A
gitlink, and a
.gitmodulesentry, but no submodule working directory. The submodule’s Git directory may be there as after deinitializing the Git directory is kept around. The directory which is supposed to be the working directory is empty instead.
A submodule can be deinitialized by running
git submodule deinit. Besides emptying the working directory, this command only modifies the superproject’s
$GIT_DIR/configfile, so the superproject’s history is not affected. This can be undone using
git submodule init.
Deleted submodule: A submodule can be deleted by running
git rm <submodule path> && git commit. This can be undone using
The deletion removes the superproject’s tracking data, which are both the
gitlinkentry and the section in the
.gitmodulesfile. The submodule’s working directory is removed from the file system, but the Git directory is kept around as it to make it possible to checkout past commits without requiring fetching from another repository.
To completely remove a submodule, manually delete
A submodule is considered active,
submodule.<name>.activeis set to
if the submodule’s path matches the pathspec in
and these are evaluated in this order.
[submodule "foo"] active = false url = https://example.org/foo [submodule "bar"] active = true url = https://example.org/bar [submodule "baz"] url = https://example.org/baz
In the above config only the submodule bar and baz are active, bar due to (1) and baz due to (3). foo is inactive because (1) takes precedence over (3)
Note that (3) is a historical artefact and will be ignored if the
(1) and (2) specify that the submodule is not active. In other words,
if we have a
submodule.<name>.active set to
false or if the
submodule’s path is excluded in the pathspec in
url doesn’t matter whether it is present or not. This is illustrated in
the example that follows.
[submodule "foo"] active = true url = https://example.org/foo [submodule "bar"] url = https://example.org/bar [submodule "baz"] url = https://example.org/baz [submodule "bob"] ignore = true [submodule] active = b* active = :(exclude) baz
In here all submodules except baz (foo, bar, bob) are active. foo due to its own active flag and all the others due to the submodule active pathspec, which specifies that any submodule starting with b except baz are also active, regardless of the presence of the .url field.
Workflow for a third party library
# Add a submodule git submodule add <URL> <path>
# Occasionally update the submodule to a new version: git -C <path> checkout <new version> git add <path> git commit -m "update submodule to new version"
# See the list of submodules in a superproject git submodule status
# See FORMS on removing submodules
Workflow for an artificially split repo
# Enable recursion for relevant commands, such that # regular commands recurse into submodules by default git config --global submodule.recurse true
# Unlike most other commands below, clone still needs # its own recurse flag: git clone --recurse <URL> <directory> cd <directory>
# Get to know the code: git grep foo git ls-files --recurse-submodules
# Get new code git fetch git pull --rebase
# Change worktree git checkout git reset
When cloning or pulling a repository containing submodules the submodules
will not be checked out by default; you can instruct
clone to recurse
into submodules. The
update subcommands of
will maintain submodules checked out and at an appropriate revision in
your working tree. Alternatively you can set
submodule.recurse to have
checkout recursing into submodules (note that
affects other Git commands, see git-config for a complete list).
Part of the git suite