This is an archive of the discontinued Mercurial Phabricator instance.

Differential D4134

perf: call _generatechangelog() instead of group()
ClosedPublic

Authored by indygreg on Aug 6 2018, 3:51 PM.

Download Raw Diff

Details

Reviewers

None

Group Reviewers

hg-reviewers

Commits

rHGa1f694779b2f: perf: call _generatechangelog() instead of group()

Summary

Now that we have a separate function for generating just the changelog
bits, the perf command should call it so it gets more accurate
behavior.

This changes the results of this command on my hg repo significantly:

! wall 1.390502 comb 1.390000 user 1.370000 sys 0.020000 (best of 8)
! wall 1.768750 comb 1.760000 user 1.760000 sys 0.000000 (best of 6)

Profiling seems to reveal that ~20% of execution time is spent in
progress bar accounting and printing! If we run with
progress.disable=true:

! wall 1.639134 comb 1.650000 user 1.630000 sys 0.020000 (best of 7)

A nice speedup. But profiling still shows a good chunk of time being
spent in progress bar accounting code. The reason is that the
progress bar is conditionally enabled via an argument to
cgpacker.group(). The previous code in perf.py calling into group()
did not enable the progress bar but _generatechangelog() always does.

I think it is important for the perf* commands to capture real-world
use cases. And this code always runs with an active progress bar. So
the regression is acceptable.

That being said, terminal printing performance can vary substantially.
I don't think perf* commands should test terminal printing unless
explicitly desired. So I've disabled progress bar printing in this
command.

Diff Detail

Repository

rHG Mercurial

Lint

Automatic diff as part of commit; lint not applicable.

Unit

Automatic diff as part of commit; unit tests not applicable.

Event Timeline

indygreg created this revision.Aug 6 2018, 3:51 PM

Herald added a reviewer: hg-reviewers. · View Herald TranscriptAug 6 2018, 3:51 PM

Herald added a subscriber: mercurial-devel. · View Herald Transcript

indygreg added a child revision: D4135: changegroup: key off changelogdone.Aug 6 2018, 3:51 PM

Closed by commit rHGa1f694779b2f: perf: call _generatechangelog() instead of group() (authored by indygreg). · Explain WhyAug 9 2018, 2:15 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents
Changeset List

			Path	Packages
M			contrib/perf.py (17 lines)

Status	Author	Revision
Closed	indygreg	D4279 manifest: use rev() instead of nodemap.__contains__
Closed	indygreg	D4278 manifest: rename manifestlog._treeinmem to ._treemanifests
Closed	indygreg	D4277 manifest: add getstorage() to manifestlog and use it globally
Closed	indygreg	D4276 manifest: rename dir argument and attribute to tree
Closed	indygreg	D4275 manifest: set appropriate cache entry when clearing _dirlogcache()
Closed	indygreg	D4274 manifest: remove addgroup() from manifestlog and imanifestlog
Closed	indygreg	D4273 repository: clarify role of imanifestlog
Closed	indygreg	D4272 changegroup: change topics during generation
Closed	indygreg	D4271 changegroup: rename mfs to manifests
Closed	indygreg	D4270 changegroup: clean up changelog callback
Closed	indygreg	D4269 changegroup: call rev() on manifestlog instance
Closed	indygreg	D4268 manifest: rename dir to tree to avoid shadowing built-in
Closed	indygreg	D4236 repository: remove candelta() from ifileindex
Closed	indygreg	D4235 changegroup: rename dir to tree to avoid shadowing a built-in
Closed	indygreg	D4227 repository: remove storedeltachains from ifilestorage
Closed	indygreg	D4226 repository: establish API for emitting revision deltas
Closed	indygreg	D4225 repository: formalize interfaces for revision deltas and requests
Closed	indygreg	D4224 changegroup: move node sorting into deltagroup()
Closed	indygreg	D4217 changegroup: invert conditional and dedent
Closed	indygreg	D4216 changegroup: capture base node instead of rev in delta request
Closed	indygreg	D4215 changegroup: introduce requests to define delta generation
Closed	indygreg	D4214 changegroup: refactor delta parent code
Closed	indygreg	D4213 changegroup: differentiate between fulltext and diff based deltas
Closed	indygreg	D4212 changegroup: minor cleanups to deltagroup()
Closed	indygreg	D4211 changegroup: emit revisiondelta instances from deltagroup()
Closed	indygreg	D4210 changegroup: move file chunk emission to generate()
Closed	indygreg	D4209 changegroup: move manifest chunk emission to generate()
Closed	indygreg	D4208 changegroup: move size tracking and end of manifests to generate()
Closed	indygreg	D4207 changegroup: emit delta group close chunk outside of deltagroup()
Closed	indygreg	D4206 changegroup: extract cgpacker.group() to standalone function
Closed	indygreg	D4205 changegroup: pass all state into group()
Closed	indygreg	D4199 changegroup: inline _prune() into call sites
Closed	indygreg	D4198 changegroup: inline _packmanifests() into generatemanifests()
Closed	indygreg	D4197 changegroup: invert conditional and dedent
Closed	indygreg	D4196 changegroup: make _revisiondeltanarrow() a standalone function
Closed	indygreg	D4195 changegroup: pass state into _revisiondeltanarrow
Closed	indygreg	D4194 changegroup: inline _close()
Closed	indygreg	D4193 changegroup: pass clrevtolocalrev to each group
Closed	indygreg	D4192 changegroup: combine _generatefiles() into generatefiles()
Closed	indygreg	D4191 changegroup: define linknodes callbacks in generatefiles()
Closed	indygreg	D4190 changegroup: track changelog to manifest revision map explicitly
Closed	indygreg	D4189 changegroup: remove _clnodetorev
Closed	indygreg	D4188 changegroup: rename _fullnodes to _fullclnodes
Closed	indygreg	D4187 changegroup: move part of _revisiondeltanarrow into group()
Closed	indygreg	D4186 changegroup: populate _clnodetorev as part of changelog linknode lookup
Closed	indygreg	D4142 changegroup: extract _revisiondeltanormal() to standalone function
Closed	indygreg	D4141 changegroup: inline _revchunk() into group()
Closed	indygreg	D4140 changegroup: pass mfdicts properly
Closed	indygreg	D4139 changegroup: pass sorted revisions into group() (API)
Closed	indygreg	D4138 changegroup: pull _fileheader out of cgpacker
Closed	indygreg	D4137 changegroup: factor changelogdone into an argument
Closed	indygreg	D4136 changegroup: record changelogdone after fully consuming its data
Closed	indygreg	D4135 changegroup: key off changelogdone
Closed	indygreg	D4134 perf: call _generatechangelog() instead of group()
Closed	indygreg	D4133 changegroup: factor changelog chunk generation into own function
Closed	indygreg	D4132 changegroup: pass function to resolve delta parents into constructor
Closed	indygreg	D4155 changegroup: restore original behavior of _nextclrevtolocalrev

Diff 10141

contrib/perf.py

	This measures the time spent processing the changelog during a			This measures the time spent processing the changelog during a
	bundle operation. This occurs during `hg bundle` and on a server			bundle operation. This occurs during `hg bundle` and on a server
	processing a `getbundle` wire protocol request (handles clones			processing a `getbundle` wire protocol request (handles clones
	and pull requests).			and pull requests).

	By default, all revisions are added to the changegroup.			By default, all revisions are added to the changegroup.
	"""			"""
	cl = repo.changelog			cl = repo.changelog
	revs = [cl.lookup(r) for r in repo.revs(rev or 'all()')]			nodes = [cl.lookup(r) for r in repo.revs(rev or 'all()')]
	bundler = changegroup.getbundler(version, repo)			bundler = changegroup.getbundler(version, repo)

	def lookup(node):
	# The real bundler reads the revision in order to access the
	# manifest node and files list. Do that here.
	cl.read(node)
	return node

	def d():			def d():
	for chunk in bundler.group(revs, cl, lookup):			state, chunks = bundler._generatechangelog(cl, nodes)
				for chunk in chunks:
	pass			pass

	timer, fm = gettimer(ui, opts)			timer, fm = gettimer(ui, opts)

				# Terminal printing can interfere with timing. So disable it.
				with ui.configoverride({('progress', 'disable'): True}):
	timer(d)			timer(d)

	fm.end()			fm.end()

	@command('perfdirs', formatteropts)			@command('perfdirs', formatteropts)
	def perfdirs(ui, repo, **opts):			def perfdirs(ui, repo, **opts):
	timer, fm = gettimer(ui, opts)			timer, fm = gettimer(ui, opts)
	dirstate = repo.dirstate			dirstate = repo.dirstate
	'a' in dirstate			'a' in dirstate
	def d():			def d():

Diff	ID	Description	Created	Lint	Unit
Base		Base
Diff 1	10005		Aug 6 2018, 3:51 PM	★	★
Diff 2	10141	rHGa1f694779b2f891c522eeb5b833a62bbf29e1c84	Aug 6 2018, 1:43 PM	★	★