This is an archive of the discontinued Mercurial Phabricator instance.

Differential D5449

pull: fix inconsistent view of bookmarks during pull (issue4700)
ClosedPublic

Authored by valentin.gatienbaron on Dec 18 2018, 8:58 AM.

Download Raw Diff

Details

Reviewers

None

Group Reviewers

hg-reviewers

Commits

rHGbad05a6afdc8: pull: fix inconsistent view of bookmarks during pull (issue4700)

Summary

I had a share where a pull apparently pulled a bookmark but not the
revision pointed to by the bookmark, which I suspect is due to this
(and if not, we might as well remove known issues in this area).

I do this by combining doing all the queries that could read the
bookmarks in one round trip.

I had to change the handling of the case where the server doesn't
support the lookup query, because if it fails, it would otherwise make
fremotebookmark.result() block forever. This is due to
wireprotov1peer.peerexecutor.sendcommands's behavior (it fills a
single future if any query fails synchronously and leaves all other
futures unchanged), but I don't know if the fix is to cancel all other
futures, or to keep going with the other queries.

Diff Detail

Repository

rHG Mercurial

Lint

Lint Skipped

Unit

Unit Tests Skipped

Event Timeline

valentin.gatienbaron created this revision.Dec 18 2018, 8:58 AM

Herald added a reviewer: hg-reviewers. · View Herald TranscriptDec 18 2018, 8:58 AM

Herald added a subscriber: mercurial-devel. · View Herald Transcript

revs = [] # actually, nodes
+ if other.capable('lookupns'):
+ def lookupns(e, r):
+ return e.callcommand('lookupns', {'key': r}).result()
+ else:
+ def lookupns(e, r):
+ return e.callcommand('lookup', {'key': r}).result(), ''
for r in oldrevs:
    with other.commandexecutor() as e:
node = e.callcommand('lookup', {'key': r}).result() -

+ node, ns = lookupns(e, r)
+ if ns == 'bookmarks':
+ if r in remotebookmarks():
+ node = remotebookmarks()[r]

I'm not an expert, but I feel it's wrong to rely on client for data
consistency. Can't we somehow make the peer serve a "snapshot" of the
repository for the entire session? @indygreg Any updates in the v2 protocol
regarding this?

valentin.gatienbaron updated this revision to Diff 12895.Dec 19 2018, 8:31 AM

I went this way in part because it's fairly simple change, and might be useful in other circumstances (to interpret pulling a tag as actually pulling the tag itself perhaps, though I don't know concretely how that would work).

AFAIK, the protocol is never stateful (not sure how that'd work with http), so you can't rely on a notion of session-level state on the server to guarantee consistency.
I can see two plausible alternatives:

doing all the lookup queries and the bookmarks query in a single roundtrip. Is it guaranteed that a group of batchable queries will be sent in one go and won't be split up (if there are many of them for instance)?
tweak the new lookup query to be instead lookup(name, bookmark=node), which would mean: lookup(name) as if there exist a bookmark called name pointing to node. So kind of the snapshot idea, where you give the server the relevant part of the snapshot.

AFAIK, the protocol is never stateful (not sure how that'd work with http), so you can't rely on a notion of session-level state on the server to guarantee consistency.

True. I think the new v2 protocol will eventually get around the issue
since it will be more properly batched. @indygreg ?

I can see two plausible alternatives:
1. doing all the lookup queries and the bookmarks query in a single roundtrip. Is it guaranteed that a group of batchable queries will be sent in one go and won't be split up (if there are many of them for instance)?
2. tweak the new lookup query to be instead `lookup(name, bookmark=node)`, which would mean: `lookup(name)` as if there exist a bookmark called `name` pointing to `node`. So kind of the snapshot idea, where you give the server the relevant part of the snapshot.

do lookup() again at the end of the transaction to detect race, and abort

If the race doesn't occur frequently, I think it's okay to discard the pulled
data.

valentin.gatienbaron edited the summary of this revision. (Show Details)Dec 20 2018, 10:49 PM

valentin.gatienbaron updated this revision to Diff 12938.

I'd prefer not to have unpredictable aborts where, when they happen, the solution is "try again". A year or two ago, the "remote heads changed during push" errors were like that, and they were annoying.

I tried the batching approach, and it seems to work fine, and doesn't require protocol changes. Based on reading the code, batch of rpcs never get split up. Perhaps one concern is whether this limits the amount of -r arguments that can be passed when using http.

Queued, thanks.

I have a concern about compatibility with ancient Mercurial. Can you check it
and send a follow-up as needed?

I had to change the handling of the case where the server doesn't
support the lookup query, because if it fails, it would otherwise make
fremotebookmark.result() block forever. This is due to
wireprotov1peer.peerexecutor.sendcommands's behavior (it fills a
single future if any query fails synchronously and leaves all other
futures unchanged), but I don't know if the fix is to cancel all other
futures, or to keep going with the other queries.

@indygreg

+ if opts['bookmark'] or revs:
+ # The list of bookmark used here is the same used to actually update
+ # the bookmark names, to avoid the race from issue 4689 and we do
+ # all lookup and bookmark queries in one go so they see the same
+ # version of the server state (issue 4700).
+ nodes = []
+ fnodes = []
+ revs = revs or []
+ if revs and not other.capable('lookup'):
+ err = _("other repository doesn't support revision lookup, "
+ "so a rev cannot be specified.")
+ raise error.Abort(err)
+ with other.commandexecutor() as e:
+ fremotebookmarks = e.callcommand('listkeys', {
+ 'namespace': 'bookmarks'
+ })
+ for r in revs:
+ fnodes.append(e.callcommand('lookup', {'key': r}))

IIRC, listkeys is a newer command than lookup. If the peer doesn't support
listkeys, I suspect this batch query would fail. In that case, maybe listkeys
has to be skipped if the peer doesn't support it and if --bookmark is not
specified.

Closed by commit rHGbad05a6afdc8: pull: fix inconsistent view of bookmarks during pull (issue4700) (authored by valentin.gatienbaron). · Explain WhyDec 23 2018, 10:10 PM

This revision was automatically updated to reflect the committed changes.

Thanks!

IIRC, listkeys is a newer command than lookup. If the peer doesn't support listkeys, I suspect this batch query would fail. In that case, maybe listkeys has to be skipped if the peer doesn't support it and if --bookmark is not specified.

listkeys shouldn't need compatibility check in the caller, because it's defined like this in wireprotov1peer.py (https://www.mercurial-scm.org/repo/hg-committed/file/tip/mercurial/wireprotov1peer.py#l381):

 @batchable
 def listkeys(self, namespace):
     if not self.capable('pushkey'):
         yield {}, None
...

> IIRC, listkeys is a newer command than lookup. If the peer doesn't support listkeys, I suspect this batch query would fail. In that case, maybe listkeys has to be skipped if the peer doesn't support it and if --bookmark is not specified.
listkeys shouldn't need compatibility check in the caller, because it's defined like this in wireprotov1peer.py (https://www.mercurial-scm.org/repo/hg-committed/file/tip/mercurial/wireprotov1peer.py#l381):
   @batchable
   def listkeys(self, namespace):
       if not self.capable('pushkey'):
           yield {}, None

Yeah, but the batch executor appears not handle such cases.

https://www.mercurial-scm.org/repo/hg-committed/file/tip/mercurial/wireprotov1peer.py#l241

Revision Contents
Changeset List

			Path	Packages
M			mercurial/commands.py (60 lines)
M			tests/test-bookmarks-pushpull.t (3 lines)

Diff	ID	Description	Created	Lint	Unit
Base		Base
Diff 1	12893		Dec 18 2018, 8:58 AM	★	★
Diff 2	12895		Dec 19 2018, 8:31 AM	★	★
Diff 3	12938		Dec 20 2018, 10:49 PM	★	★
Diff 4	12965	rHGbad05a6afdc89cc58a2af320698ab29bd8de62d4	Dec 20 2018, 10:28 PM	★	★

Commit	Parents	Author	Summary	Date
		Valentin Gatien-Baron		Dec 20 2018, 10:28 PM

Status	Author	Revision
Closed	valentin.gatienbaron	D5449 pull: fix inconsistent view of bookmarks during pull (issue4700)
Abandoned	valentin.gatienbaron	D5448 pull: update comment and refactor in preparation for next commit
Closed	valentin.gatienbaron	D5447 test: adding test of issue4700

Diff 12938

mercurial/commands.py


	source, branches = hg.parseurl(ui.expandpath(source), opts.get('branch'))			source, branches = hg.parseurl(ui.expandpath(source), opts.get('branch'))
	ui.status(_('pulling from %s\n') % util.hidepassword(source))			ui.status(_('pulling from %s\n') % util.hidepassword(source))
	other = hg.peer(repo, opts, source)			other = hg.peer(repo, opts, source)
	try:			try:
	revs, checkout = hg.addbranchrevs(repo, other, branches,			revs, checkout = hg.addbranchrevs(repo, other, branches,
	opts.get('rev'))			opts.get('rev'))


	pullopargs = {}			pullopargs = {}
	if opts.get('bookmark'):
	if not revs:			nodes = None
	revs = []			if opts['bookmark'] or revs:
	# The list of bookmark used here is not the one used to actually			# The list of bookmark used here is the same used to actually update
	# update the bookmark name. This can result in the revision pulled			# the bookmark names, to avoid the race from issue 4689 and we do
	# not ending up with the name of the bookmark because of a race			# all lookup and bookmark queries in one go so they see the same
	# condition on the server. (See issue 4689 for details)			# version of the server state (issue 4700).
	remotebookmarks = other.listkeys('bookmarks')			nodes = []
				fnodes = []
				revs = revs or []
				if revs and not other.capable('lookup'):
				err = _("other repository doesn't support revision lookup, "
				"so a rev cannot be specified.")
				raise error.Abort(err)
				with other.commandexecutor() as e:
				fremotebookmarks = e.callcommand('listkeys', {
				'namespace': 'bookmarks'
				})
				for r in revs:
				fnodes.append(e.callcommand('lookup', {'key': r}))
				remotebookmarks = fremotebookmarks.result()
	remotebookmarks = bookmarks.unhexlifybookmarks(remotebookmarks)			remotebookmarks = bookmarks.unhexlifybookmarks(remotebookmarks)
	pullopargs['remotebookmarks'] = remotebookmarks			pullopargs['remotebookmarks'] = remotebookmarks
	for b in opts['bookmark']:			for b in opts['bookmark']:
	b = repo._bookmarks.expandname(b)			b = repo._bookmarks.expandname(b)
	if b not in remotebookmarks:			if b not in remotebookmarks:
	raise error.Abort(_('remote bookmark %s not found!') % b)			raise error.Abort(_('remote bookmark %s not found!') % b)
	revs.append(hex(remotebookmarks[b]))			nodes.append(remotebookmarks[b])
				for i, rev in enumerate(revs):
				node = fnodes[i].result()
				nodes.append(node)
				if rev == checkout:
				checkout = node

	if revs:
	try:
	# When 'rev' is a bookmark name, we cannot guarantee that it
	# will be updated with that name because of a race condition
	# server side. (See issue 4689 for details)
	oldrevs = revs
	revs = [] # actually, nodes
	for r in oldrevs:
	with other.commandexecutor() as e:
	node = e.callcommand('lookup', {'key': r}).result()

	revs.append(node)
	if r == checkout:
	checkout = node
	except error.CapabilityError:
	err = _("other repository doesn't support revision lookup, "
	"so a rev cannot be specified.")
	raise error.Abort(err)

	wlock = util.nullcontextmanager()			wlock = util.nullcontextmanager()
	if opts.get('update'):			if opts.get('update'):
	wlock = repo.wlock()			wlock = repo.wlock()
	with wlock:			with wlock:
	pullopargs.update(opts.get('opargs', {}))			pullopargs.update(opts.get('opargs', {}))
	modheads = exchange.pull(repo, other, heads=revs,			modheads = exchange.pull(repo, other, heads=nodes,
	force=opts.get('force'),			force=opts.get('force'),
	bookmarks=opts.get('bookmark', ()),			bookmarks=opts.get('bookmark', ()),
	opargs=pullopargs).cgresult			opargs=pullopargs).cgresult

	# brev is a name, which might be a bookmark to be activated at			# brev is a name, which might be a bookmark to be activated at
	# the end of the update. In other words, it is an explicit			# the end of the update. In other words, it is an explicit
	# destination of the update			# destination of the update
	brev = None			brev = None

tests/test-bookmarks-pushpull.t

	Z 1:0d2164f0ce0d			Z 1:0d2164f0ce0d
	$ hg pull -r Y			$ hg pull -r Y
	pulling from http://localhost:$HGPORT/			pulling from http://localhost:$HGPORT/
	searching for changes			searching for changes
	adding changesets			adding changesets
	adding manifests			adding manifests
	adding file changes			adding file changes
	added 1 changesets with 1 changes to 1 files			added 1 changesets with 1 changes to 1 files
				updating bookmark Y
	new changesets 0d60821d2197 (1 drafts)			new changesets 0d60821d2197 (1 drafts)
	(run 'hg update' to get a working copy)			(run 'hg update' to get a working copy)
	$ hg book			$ hg book
	@ 1:0d2164f0ce0d			@ 1:0d2164f0ce0d
	X 1:0d2164f0ce0d			X 1:0d2164f0ce0d
	* Y 5:35d1ef0a8d1b			* Y 6:0d60821d2197
	Z 1:0d2164f0ce0d			Z 1:0d2164f0ce0d
	$ hg -R $TESTTMP/pull-race book			$ hg -R $TESTTMP/pull-race book
	@ 1:0d2164f0ce0d			@ 1:0d2164f0ce0d
	X 1:0d2164f0ce0d			X 1:0d2164f0ce0d
	* Y 7:714424d9e8b8			* Y 7:714424d9e8b8
	Z 1:0d2164f0ce0d			Z 1:0d2164f0ce0d

	(done with this section of the test)			(done with this section of the test)