29. Python Developer FAQ — Python Developer's Guide

29.2.1. For everyone

The following FAQs are intended for both core developers and contributors.

29.2.1.2. I already know how to use Git, can I use that instead?

While the main workflow for core developers requires Mercurial, if you just want to generate patches with git diff and post them to the issue tracker, there is a semi-offical read-only git mirror of the main CPython repository. To create a local clone based on this mirror rather than the main repository:

git clone git://github.com/python/cpython

The mirror’s master branch tracks the main repository’s default branch, while the maintenance branch names (2.7, 3.4, etc) are mapped directly.

Please only use this approach if you’re already an experienced Git user and don’t require assistance with the specifics of version control commands. All other parts of this developer’s guide assume the use of Mercurial for local version control.

29.2.1.3. What do I need to use Mercurial?

29.2.1.3.1. UNIX

First, you need to download Mercurial. Most UNIX-based operating systems have binary packages available. Most package management systems also have native Mercurial packages available.

If you have push rights, you need OpenSSH. This is needed to verify your identity when performing commits. As with Mercurial, binary packages are typically available either online or through the platform’s package management system.

Mercurial does not use its own compression via SSH because it is better to enable compression at the SSH level. Enabling SSH compression can make cloning a remote repository much faster. You can configure it in your ~/.ssh/config file; for example:

Host hg.python.org
  Compression yes
29.2.1.3.2. Windows

The recommended option on Windows is to download TortoiseHg which integrates with Windows Explorer and also bundles the command line client (meaning you can type hg in a DOS box). Note that most entries in this FAQ only cover the command line client in detail - refer to the TortoiseHg documentation for assistance with its graphical interface.

If you have push rights, you need to configure Mercurial to work with your SSH keys. For that, open your Mercurial configuration file (you can do so by opening the TortoiseHg Global Settings dialog and then clicking “Edit File”). If there is no [ui] section, create it by typing just that on a line by itself. Then add the following line:

ssh = TortoisePlink.exe -ssh -2 -C -i C:\path\to\yourkey.ppk

where C:\path\to\yourkey.ppk should be replaced with the actual path to your SSH private key.

Note

If your private key is in OpenSSH format, you must first convert it to PuTTY format by loading it into PuTTYgen.

29.2.1.4. What’s a working copy? What’s a repository?

Mercurial is a “distributed” version control system. This means that each participant, even casual contributors, download a complete copy (called a clone, since it is obtained by calling hg clone) of the central repository which can be treated as a stand-alone repository for all purposes. That copy is called in the FAQ the local repository, to differentiate with any remote repository you might also interact with.

But you don’t modify files directly in the local repository; Mercurial doesn’t allow for it. You modify files in what’s called the working copy associated with your local repository: you also run compilations and tests there. Once you are satisfied with your changes, you can commit them; committing records the changes as a new revision in the local repository.

Changes in your local repository don’t get automatically shared with the rest of the world. Mercurial ensures that you have to do so explicitly (this allows you to experiment quite freely with multiple branches of development, all on your private computer). The main commands for doing so are hg pull and hg push.

29.2.1.5. Which branches are in my local repository?

Typing hg branches displays the open branches in your local repository:

$ hg branches
default                    93085:030fda7b1de8
2.7                        93060:7ba47bbfe38d
3.5                        99283:4d5417444961 (inactive)
3.4                        93082:5fd481150b35 (inactive)
3.3                        93079:cda907a02a80 (inactive)
3.2                        92975:eac54f7a8018 (inactive)

29.2.1.6. Why are some branches marked “inactive”?

Assuming you get the following output:

$ hg branches
default                    93085:030fda7b1de8
3.5                        99283:4d5417444961 (inactive)

This means all changesets in the “3.5” branch have been merged into the “default” branch (or any other branch, if such exists).

29.2.1.7. Which branch is currently checked out in my working copy?

Use:

Or to get more information:

$ hg summary
parent: 68026:f12ef116dd10 tip
 In FTP.close() method, make sure to also close the socket object, not only the file.
branch: default
commit: (clean)
update: (current)

29.2.1.8. How do I switch between branches inside my working copy?

Simply use hg update to checkout another branch in the current directory:

$ hg branch
default
$ hg update 3.5
86 files updated, 0 files merged, 11 files removed, 0 files unresolved
$ hg branch
3.5

Adding the -v option to hg update will list all updated files.

Note that, due to some previously built executables being used as a part of the build process, you may sometimes run into issues when attempting to switch between Python 2.x and Python 3.x branches. In these cases, it is best to run a make distclean to ensure that all previously built files are removed.

29.2.1.9. I want to keep a separate working copy per development branch, is it possible?

There are two ways:

  1. Use the “share extension” as described in the Multiple Clones Approach section;
  2. Create several clones of your local repository;

If you want to use the second way, you can do:

$ hg clone cpython py35
updating to branch default
3434 files updated, 0 files merged, 0 files removed, 0 files unresolved
$ cd py35
$ hg update 3.5
86 files updated, 0 files merged, 11 files removed, 0 files unresolved

The current branch in a working copy is “sticky”: if you pull in some new changes, hg update will update to the head of the current branch.

29.2.1.11. How do I create a shorthand alias for a remote repository?

In your global .hgrc file add a section similar to the following:

[paths]
dg = ssh://hg@hg.python.org/devguide

This example creates a dg alias for the devguide repository on hg.python.org. This allows “dg” to be entered instead of the full URL for commands taking a repository argument (e.g. hg pull dg or hg outgoing dg).

Anywhere that <remote repository> is used in the commands in this FAQ, hg should accept an alias in place of a complete remote URL.

29.2.1.12. How do I compare my local repository to a remote repository?

To display the list of changes that are in your local repository, but not in the remote, use:

hg outgoing <remote repository>

This is the list of changes that will be sent if you call hg push <remote repository>. It does not include any uncommitted changes in your working copy!

Conversely, for the list of changes that are in the remote repository but not in the local, use:

hg incoming <remote repository>

This is the list of changes that will be retrieved if you call hg pull <remote repository>.

Note

In most daily use, you will work against the default remote repository, and therefore simply type hg outgoing and hg incoming.

In this case, you can also get a synthetic summary using hg summary --remote.

29.2.1.13. How do I update my local repository to be in sync with a remote repository?

Run:

hg pull <remote repository>

from the repository you wish to pull the latest changes into. Most of the time, that repository is a clone of the repository you want to pull from, so you can simply type:

This doesn’t update your working copy, though. See below:

29.2.1.14. How do I update my working copy with the latest changes?

Do:

This will update your working copy with the latest changes on the current branch. If you had uncommitted changes in your working copy, they will be merged in.

If you find yourself typing often hg pull followed by hg update, be aware that you can combine them in a single command:

29.2.1.15. How do I apply a patch?

If you want to try out or review a patch generated using Mercurial, do:

patch -p1 < somework.patch

This will apply the changes in your working copy without committing them. If the patch was not created by Mercurial (for example, a patch created by Subversion and thus lacking any a/b directory prefixes in the patch), replace -p1 with -p0.

If the patch contains renames, deletions or copies, and you intend committing it after your review, you might prefer using:

hg import --no-commit somework.patch

If you want to work on the patch using mq (Mercurial Queues), type instead:

hg qimport somework.patch

This will create a patch in your queue with a name that matches the filename. You can use the -n argument to specify a different name. To have the patch applied to the working copy, type:

Finally, to delete the patch, first un-apply it if necessary using hg qpop, then do:

hg qdelete somework.patch

29.2.1.16. How do I solve conflicts when applying a patch fails?

The standard patch command, as well as hg import, will produce unhelpful *.rej files when it fails applying parts of a patch. We suggest you try the mpatch utility, which can help resolve a number of common causes of patch rejects.

To make use of mpatch transparent, you can define a shell alias in one of your startup files. For example, if you want it to open the kdiff3 merge program to fix failing patch hunks:

alias patch='mpatch --merge=kdiff3'

or if you want it to automatically solve conflicts by using heuristics:

alias patch='mpatch --auto --no-merge'

29.2.1.17. How do I add a file or directory to the repository?

Simply specify the path to the file or directory to add and run:

If PATH is a directory, Mercurial will recursively add any files in that directory and its descendants.

If you want Mercurial to figure out by itself which files should be added and/or removed, just run:

Be careful though, as it might add some files that are not desired in the repository (such as build products, cache files, or other data).

You will then need to run hg commit (as discussed below) to commit the file(s) to your local repository.

29.2.1.18. What’s the best way to split a file into several files?

To split a file into several files (e.g. a module converted to a package or a long doc file divided in two separate documents) use hg copy:

hg copy module.rst module2.rst

and then remove the parts that are not necessary from module.rst and module2.rst. This allows Mercurial to know that the content of module2.rst used to be in module.rst, and will make subsequent merges easier. If necessary, you can also use hg copy several times.

If you simply create module2.rst, add it with hg add, and copy part of the content from module.rst, Mercurial won’t know that the two file are related.

29.2.1.19. How do I delete a file or directory in the repository?

Specify the path to be removed with:

This will remove the file or the directory from your working copy; you will have to commit your changes for the removal to be recorded in your local repository.

29.2.1.20. What files are modified in my working copy?

Running:

will list any pending changes in the working copy. These changes will get committed to the local repository if you issue an hg commit without specifying any path.

Some key indicators that can appear in the first column of output are:

A Scheduled to be added
R Scheduled to be removed
M Modified locally
? Not under version control

If you want a line-by-line listing of the differences, use:

29.2.1.21. How do I revert a file I have modified back to the version in the repository?

Running:

will revert PATH to its version in the repository, throwing away any changes you made locally. If you run:

from the root of your working copy it will recursively restore everything to match up with the repository.

29.2.1.22. How do I find out who edited or what revision changed a line last?

You want:

This will output to stdout every line of the file along with which revision last modified that line. When you have the revision number, it is then easy to display it in detail.

29.2.1.23. How can I see a list of log messages for a file or specific revision?

To see the history of changes for a specific file, run:

That will list all messages of revisions which modified the file specified in PATH. If PATH is omitted, all revisions are listed.

If you want to display line-by-line differences for each revision as well, add the -p option:

If you want to view the differences for a specific revision, run:

hg log -vp -r <revision number>

29.2.1.24. How can I see the changeset graph in my repository?

In Mercurial repositories, changesets don’t form a simple list, but rather a graph: every changeset has one or two parents (it’s called a merge changeset in the latter case), and can have any number of children.

The graphlog extension is very useful for examining the structure of the changeset graph. It is bundled with Mercurial.

Graphical tools, such as TortoiseHG, will display the changeset graph by default.

29.2.1.25. How do I update to a specific release tag?

Run:

to get a list of tags. To update your working copy to a specific tag, use:

29.2.1.26. How do I find which changeset introduced a bug or regression?

hg bisect, as the name indicates, helps you do a bisection of a range of changesets.

You need two changesets to start the search: one that is “good” (doesn’t have the bug), and one that is “bad” (has the bug). Usually, you have just noticed the bug in your working copy, so you can start with:

Then you must update to a previous changeset that doesn’t have the bug. You can conveniently choose a faraway changeset (for example a former release), and check that it is indeed “good”. Then type:

Mercurial will automatically bisect so as to narrow the range of possible culprits, until a single changeset is isolated. Each time Mercurial presents you with a new changeset, re-compile Python and run the offending test, for example:

make -j2
./python -m test -uall test_sometest

Then, type either hg bisect --good or hg bisect --bad depending on whether the test succeeded or failed.

29.2.1.27. How come feature XYZ isn’t available in Mercurial?

Mercurial comes with many bundled extensions which can be explicitly enabled. You can get a list of them by typing hg help extensions. Some of these extensions, such as color, can prettify output; others, such as fetch or graphlog, add new Mercurial commands.

There are also many configuration options to tweak various aspects of the command line and other Mercurial behaviour; typing man hgrc displays their documentation inside your terminal.

In the end, please refer to the Mercurial wiki, especially the pages about extensions (including third-party ones) and the tips and tricks.

29.2.2. For core developers

These FAQs are intended mainly for core developers.

29.2.2.1. How do I commit a change to a file?

To commit any changes to a file (which includes adding a new file or deleting an existing one), you use the command:

PATH is optional: if it is omitted, all changes in your working copy will be committed to the local repository. When you commit, be sure that all changes are desired by reviewing them first; also, when making commits that you intend to push to public repositories, you should not commit together unrelated changes.

To abort a commit that you are in the middle of, leave the message empty (i.e., close the text editor without adding any text for the message). Mercurial will then abort the commit operation so that you can try again later.

Once a change is committed to your local repository, it is still only visible by you. This means you are free to experiment with as many local commits you feel like.

Note

If you do not like the default text editor Mercurial uses for entering commit messages, you may specify a different editor, either by changing the EDITOR environment variable or by setting a Mercurial-specific editor in your global .hgrc with the editor option in the [ui] section.

29.2.2.2. How do I solve merge conflicts?

The easiest way is to install KDiff3 — Mercurial will open it automatically in case of conflicts, and you can then use it to solve the conflicts and save the resulting file(s). KDiff3 will also take care of marking the conflicts as resolved.

If you don’t use a merge tool, you can use hg resolve --list to list the conflicting files, resolve the conflicts manually, and the use hg resolve --mark <file path> to mark these conflicts as resolved. You can also use hg resolve -am to mark all the conflicts as resolved.

Note

Mercurial will use KDiff3 automatically if it’s installed and it can find it — you don’t need to change any settings. KDiff3 is also already included in the installer of TortoiseHg. For more information, see http://mercurial.selenic.com/wiki/KDiff3.

29.2.2.3. How do I make a null merge?

If you committed something (e.g. on 3.5) that shouldn’t be ported on newer branches (e.g. on default), you have to do a null merge:

cd 3.x
hg merge 3.5
hg revert -ar default
hg resolve -am  # needed only if the merge created conflicts
hg ci -m '#12345: null merge with 3.5.'

Before committing, hg status should list all the merged files as M, but hg diff should produce no output. This will record the merge without actually changing the content of the files.

29.2.2.4. I got “abort: push creates new remote heads!” while pushing, what do I do?

If you see this message while pushing, it means that you committed something on a clone that was not up to date, thus creating a new head. This usually happens for two reasons:

  1. You forgot to run hg pull and/or hg up before committing;
  2. Someone else pushed on the main repo just before you, causing a push race;

First of all you should pull the new changesets using hg pull. Then you can use hg heads to see which branches have multiple heads.

If only one branch has multiple heads, you can do:

cd default
hg heads .
hg up csid-of-the-other-head
hg merge
hg ci -m 'Merge heads.'

hg heads . will show you the two heads of the current branch: the one you pulled and the one you created with your commit (you can also specify a branch with hg heads <branch>). While not strictly necessary, it is highly recommended to switch to the other head before merging. This way you will be merging only your changeset with the rest, and in case of conflicts it will be a lot easier.

If more than one branch has multiple heads, you have to repeat these steps for each branch. Since this creates new changesets, you will also have to merge them between branches. For example, if both 3.5 and default have multiple heads, you should first merge heads in 3.5, then merge heads in default, and finally merge 3.5 with default using hg merge 3.5 as usual.

In order to avoid this, you should always remember to pull and update before committing.

29.2.2.5. How do I undo the changes made in a recent commit?

First, this should not happen if you take the habit of reviewing changes before committing them.

In any case, run:

hg backout <revision number>

This will modify your working copy so that all changes in <revision number> (including added or deleted files) are undone. You then need to commit these changes so that the backout gets permanently recorded.

Note

These instructions are for Mercurial 1.7 and higher. hg backout has a slightly different behaviour in versions before 1.7.