Erik Elfström 0179ca7a62 clean: improve performance when removing lots of directories
"git clean" uses resolve_gitlink_ref() to check for the presence of
nested git repositories, but it has the drawback of creating a
ref_cache entry for every directory that should potentially be
cleaned. The linear search through the ref_cache list causes a massive
performance hit for large number of directories.

Modify clean.c:remove_dirs to use setup.c:is_git_directory and
setup.c:read_gitfile_gently instead.

Both these functions will open files and parse contents when they find
something that looks like a git repository. This is ok from a
performance standpoint since finding repository candidates should be
comparatively rare.

Using is_git_directory and read_gitfile_gently should give a more
standardized check for what is and what isn't a git repository but
also gives three behavioral changes.

The first change is that we will now detect and avoid cleaning empty
nested git repositories (only init run). This is desirable.

Second, we will no longer die when cleaning a file named ".git" with
garbage content (it will be cleaned instead). This is also desirable.

The last change is that we will detect and avoid cleaning empty bare
repositories that have been placed in a directory named ".git". This
is not desirable but should have no real user impact since we already
fail to clean non-empty bare repositories in the same scenario. This
is thus deemed acceptable.

On top of this we add some extra precautions. If read_gitfile_gently
fails to open the git file, read the git file or verify the path in
the git file we assume that the path with the git file is a valid
repository and avoid cleaning.

Update t7300 to reflect these changes in behavior.

The time to clean an untracked directory containing 100000 sub
directories went from 61s to 1.7s after this change.

Helped-by: Jeff King <peff@peff.net>
Signed-off-by: Erik Elfström <erik.elfstrom@gmail.com>
Signed-off-by: Junio C Hamano <gitster@pobox.com>
2015-06-15 13:14:24 -07:00
2014-02-27 14:01:48 -08:00
2015-05-26 13:24:46 -07:00
2015-06-05 12:23:18 -07:00
2015-04-18 18:35:48 -07:00
2015-03-12 13:45:18 -07:00
2015-01-14 09:32:04 -08:00
2014-07-28 10:14:33 -07:00
2015-05-05 21:00:23 -07:00
2015-05-05 21:00:23 -07:00
2014-05-15 09:49:12 -07:00
2014-05-15 09:49:12 -07:00
2012-10-29 03:08:30 -04:00
2015-05-05 21:00:23 -07:00
2015-03-10 20:53:52 -07:00
2015-05-05 21:00:23 -07:00
2013-12-09 14:54:48 -08:00
2015-05-20 10:19:12 -07:00
2014-12-22 12:27:20 -08:00
2014-12-12 14:31:42 -08:00
2014-03-31 15:29:27 -07:00
2015-05-19 13:17:49 -07:00
2015-05-26 13:24:46 -07:00
2015-06-05 12:17:37 -07:00
2014-01-17 12:21:20 -08:00
2014-12-22 12:27:41 -08:00
2014-10-29 10:09:35 -07:00
2015-06-01 12:45:14 -07:00
2015-02-26 20:19:21 +00:00
2015-06-05 12:22:33 -07:00
2015-05-11 14:23:39 -07:00
2014-10-08 13:05:25 -07:00
2014-09-29 12:36:11 -07:00
2014-07-07 13:56:38 -07:00
2014-07-07 13:56:38 -07:00
2015-06-05 12:17:37 -07:00
2013-05-08 15:31:54 -07:00
2015-03-13 22:43:11 -07:00
2015-06-05 12:17:37 -07:00
2015-03-27 13:02:32 -07:00
2015-05-22 12:41:45 -07:00
2014-10-20 12:23:48 -07:00
2015-03-23 11:12:58 -07:00
2013-07-29 12:32:25 -07:00
2014-10-19 15:28:30 -07:00
2014-07-21 12:35:39 -07:00
2014-03-31 15:29:27 -07:00
2015-05-26 13:24:46 -07:00
2014-10-10 16:02:26 -07:00
2015-05-25 12:19:39 -07:00
2015-06-05 12:22:33 -07:00
2015-06-05 12:17:37 -07:00
2015-05-22 09:33:08 -07:00
2014-09-15 11:29:46 -07:00
2015-02-11 13:44:07 -08:00
2015-05-05 21:00:23 -07:00
2015-06-05 12:17:37 -07:00
2014-06-13 11:49:40 -07:00
2014-12-22 12:27:30 -08:00
2014-12-22 12:27:30 -08:00
2015-06-05 12:17:37 -07:00
2014-03-31 15:29:27 -07:00
2015-05-11 14:23:39 -07:00
2015-03-22 21:39:18 -07:00
2015-01-07 19:56:44 -08:00
2014-09-02 13:28:44 -07:00
2015-06-05 12:17:37 -07:00
2015-06-05 12:17:36 -07:00

////////////////////////////////////////////////////////////////

	Git - the stupid content tracker

////////////////////////////////////////////////////////////////

"git" can mean anything, depending on your mood.

 - random three-letter combination that is pronounceable, and not
   actually used by any common UNIX command.  The fact that it is a
   mispronunciation of "get" may or may not be relevant.
 - stupid. contemptible and despicable. simple. Take your pick from the
   dictionary of slang.
 - "global information tracker": you're in a good mood, and it actually
   works for you. Angels sing, and a light suddenly fills the room.
 - "goddamn idiotic truckload of sh*t": when it breaks

Git is a fast, scalable, distributed revision control system with an
unusually rich command set that provides both high-level operations
and full access to internals.

Git is an Open Source project covered by the GNU General Public
License version 2 (some parts of it are under different licenses,
compatible with the GPLv2). It was originally written by Linus
Torvalds with help of a group of hackers around the net.

Please read the file INSTALL for installation instructions.

See Documentation/gittutorial.txt to get started, then see
Documentation/giteveryday.txt for a useful minimum set of commands, and
Documentation/git-commandname.txt for documentation of each command.
If git has been correctly installed, then the tutorial can also be
read with "man gittutorial" or "git help tutorial", and the
documentation of each command with "man git-commandname" or "git help
commandname".

CVS users may also want to read Documentation/gitcvs-migration.txt
("man gitcvs-migration" or "git help cvs-migration" if git is
installed).

Many Git online resources are accessible from http://git-scm.com/
including full documentation and Git related tools.

The user discussion and development of Git take place on the Git
mailing list -- everyone is welcome to post bug reports, feature
requests, comments and patches to git@vger.kernel.org (read
Documentation/SubmittingPatches for instructions on patch submission).
To subscribe to the list, send an email with just "subscribe git" in
the body to majordomo@vger.kernel.org. The mailing list archives are
available at http://news.gmane.org/gmane.comp.version-control.git/,
http://marc.info/?l=git and other archival sites.

The maintainer frequently sends the "What's cooking" reports that
list the current status of various development topics to the mailing
list.  The discussion following them give a good reference for
project status, development direction and remaining tasks.
Description
No description provided
Readme 582 MiB
Languages
C 50.5%
Shell 38.7%
Perl 4.5%
Tcl 3.2%
Python 0.8%
Other 2.1%