This is an archive of the discontinued Mercurial Phabricator instance.

Differential D5496

revset: add "samebranch" keyword argument to the merge revset
Needs RevisionPublic

Authored by angel.ezquerra on Jan 6 2019, 2:03 PM.

Download Raw Diff

Details

Reviewers

baymax

Group Reviewers

hg-reviewers

Summary

By default all merges are shown but if "samebranch" is set to False then merges
with the same branch (i.e. where both parents belong to the same branch) will
be filtered out.

Conversely, if "samebranch" is set to True then only merges with the same branch
will be shown.

This is useful to visualize at a high level the relationships between different
branches and how they are merged with each other.

With the addition of the merge(withbranch) idiom on a previous revision this
could already be done in a quite complicated way, by doing something like:

merge() and branch(somebranch) and not merge(somebranch)

This is not very practical ano only works for a single branch. Thus this new
option is added.

Diff Detail

Repository

rHG Mercurial

Lint

Lint Skipped

Unit

Unit Tests Skipped

Event Timeline

angel.ezquerra created this revision.Jan 6 2019, 2:03 PM

Herald added a reviewer: hg-reviewers. · View Herald TranscriptJan 6 2019, 2:03 PM

Herald added a subscriber: mercurial-devel. · View Herald Transcript

angel.ezquerra added a child revision: D5497: revset: add tests for the new merge() arguments (withbranch and samebranch).Jan 6 2019, 2:03 PM

angel.ezquerra updated this revision to Diff 13205.Jan 13 2019, 5:45 PM

-@predicate('merge(withbranch)', safe=True)
+@predicate('merge(withbranch, samebranch=True)', safe=True)

[, samebranch] or [, samebranch=False]`.

withbranch = ''
if 'withbranch' in args:
    withbranch = getstring(args['withbranch'],
                           _('withbranch argument must be a string'))
    kind, branchname, branchmatcher = stringutil.stringmatcher(withbranch)
+ samebranch = None
+ if 'samebranch' in args:
+ # i18n: "samebranch" is a keyword
+ samebranch = getboolean(args['samebranch'],
+ _('samebranch argument must be a True or False'))
cl = repo.changelog
# create the function that will be used to filter the subset
if withbranch:
    # matchfn is a function that returns true when a revision
    # is a merge and the second parent belongs to a branch that
    # matches the withbranch pattern (which can be a literal or a regex)
    if kind == 'literal':
matchfn = lambda r: (cl.parentrevs(r)[1] != -1

and repo[r].p2().branch() == withbranch)

+ basematchfn = lambda r: (cl.parentrevs(r)[1] != -1
+ and repo[r].p2().branch() == withbranch)
else:
matchfn = lambda r: (cl.parentrevs(r)[1] != -1

and branchmatcher(repo[r].p2().branch()))

else:

# matchfn is a function that returns true when a revision is a merge

matchfn = lambda r: cl.parentrevs(r)[1] != -1

+ basematchfn = lambda r: (cl.parentrevs(r)[1] != -1
+ and branchmatcher(repo[r].p2().branch()))
+ else:
+ basematchfn = lambda r: cl.parentrevs(r)[1] != -1
+ if samebranch is None:
+ matchfn = basematchfn
+ else:
+ # if samebranch was specified, build a new match function
+ # that on top of basematch checks if the parents belong (or not)
+ # to the same branch (depending on the value of samebranch)
+ def matchfn(r):
+ c = repo[r]
+ if not basematchfn(r):
+ return False
+ issamebranchmerge = c.p1().branch() == c.p2().branch()
+ return issamebranchmerge if samebranch else not issamebranchmerge

These conditions can be formed as followed:

matchfns = [lambda r: cl.parentrevs(r)[1] != -1]
if withbranch:
    matchfns.append(lambda r: branchmatcher(repo[r].p2().branch()))
if samebranch:
    matchfns.append(samebranchmatchfn)

if len(matchfns) == 1:
    # fast path for common case
    return subset.filter(matchfn[0], ...)
else:
    return subset.filter(lambda r: all(p(r) for p in matchfn), ...)

angel.ezquerra updated this revision to Diff 13211.Jan 14 2019, 6:26 PM

In D5496#82394, @yuja wrote:

-@predicate('merge(withbranch)', safe=True)
+@predicate('merge(withbranch, samebranch=True)', safe=True)

[, samebranch] or [, samebranch=False]`.

I guess that means:

@predicate('merge([withbranch [, samebranch=None]])', safe=True)

Right? (I realized that it is incorrect to say that samebranch's default value is False).

withbranch = ''
if 'withbranch' in args:
    withbranch = getstring(args['withbranch'],
                           _('withbranch argument must be a string'))
    kind, branchname, branchmatcher = stringutil.stringmatcher(withbranch)
+ samebranch = None
+ if 'samebranch' in args:
+ # i18n: "samebranch" is a keyword
+ samebranch = getboolean(args['samebranch'],
+ _('samebranch argument must be a True or False'))
cl = repo.changelog
# create the function that will be used to filter the subset
if withbranch:
    # matchfn is a function that returns true when a revision
    # is a merge and the second parent belongs to a branch that
    # matches the withbranch pattern (which can be a literal or a regex)
    if kind == 'literal':
matchfn = lambda r: (cl.parentrevs(r)[1] != -1

and repo[r].p2().branch() == withbranch)

+ basematchfn = lambda r: (cl.parentrevs(r)[1] != -1
+ and repo[r].p2().branch() == withbranch)
else:
matchfn = lambda r: (cl.parentrevs(r)[1] != -1

and branchmatcher(repo[r].p2().branch()))

else:

# matchfn is a function that returns true when a revision is a merge

matchfn = lambda r: cl.parentrevs(r)[1] != -1

+ basematchfn = lambda r: (cl.parentrevs(r)[1] != -1
+ and branchmatcher(repo[r].p2().branch()))
+ else:
+ basematchfn = lambda r: cl.parentrevs(r)[1] != -1
+ if samebranch is None:
+ matchfn = basematchfn
+ else:
+ # if samebranch was specified, build a new match function
+ # that on top of basematch checks if the parents belong (or not)
+ # to the same branch (depending on the value of samebranch)
+ def matchfn(r):
+ c = repo[r]
+ if not basematchfn(r):
+ return False
+ issamebranchmerge = c.p1().branch() == c.p2().branch()
+ return issamebranchmerge if samebranch else not issamebranchmerge
These conditions can be formed as followed:
matchfns = [lambda r: cl.parentrevs(r)[1] != -1]
if withbranch:
    matchfns.append(lambda r: branchmatcher(repo[r].p2().branch()))
if samebranch:
    matchfns.append(samebranchmatchfn)
if len(matchfns) == 1:
    # fast path for common case
    return subset.filter(matchfn[0], ...)
else:
    return subset.filter(lambda r: all(p(r) for p in matchfn), ...)

Do you think this makes the code simpler? In any case, if you think this approach is best I can do it, but perhaps it would be a little better to keep a single subset.filter call as follows:

if len(matchfns) == 1:
    finalmatchfn = matchfns[0]
else:
    finalmatchfn = lambda r: all(p(r) for p in matchfns)
return subset.filter(finalmatchfn, condrepr='<merge>')

What do you think?

> `[, samebranch]` or [, samebranch=False]`.
I guess that means:
@predicate('merge([withbranch [, samebranch=None]])', safe=True)
Right? (I realized that it is incorrect to say that samebranch's default value is False).

Okay, I didn't notice that. And it's tricky to map samebranch=False to
"different branch" constraint. I would read it as "I don't care whether
the branches are the same or not."

We can instead express it as merge() - merge(samebranch=True).

>   if len(matchfns) == 1:
>       # fast path for common case
>       return subset.filter(matchfn[0], ...)
>   else:
>       return subset.filter(lambda r: all(p(r) for p in matchfn), ...)
Do you think this makes the code simpler?

Yes. The original version was hard to find all possible call paths.
Separate function per constraint is easier to follow.

In any case, if you think this approach is best I can do it, but perhaps it would be a little better to keep a single subset.filter call as follows:
if len(matchfns) == 1:
    finalmatchfn = matchfns[0]
else:
    finalmatchfn = lambda r: all(p(r) for p in matchfns)
return subset.filter(finalmatchfn, condrepr='<merge>')

I don't care about these differences.

In D5496#82671, @yuja wrote:
> `[, samebranch]` or [, samebranch=False]`.
I guess that means:
@predicate('merge([withbranch [, samebranch=None]])', safe=True)
Right? (I realized that it is incorrect to say that samebranch's default value is False).
Okay, I didn't notice that. And it's tricky to map samebranch=False to
"different branch" constraint. I would read it as "I don't care whether
the branches are the same or not."

In D5496#82671, @yuja wrote:
> `[, samebranch]` or [, samebranch=False]`.
I guess that means:
@predicate('merge([withbranch [, samebranch=None]])', safe=True)
Right? (I realized that it is incorrect to say that samebranch's default value is False).
Okay, I didn't notice that. And it's tricky to map samebranch=False to
"different branch" constraint. I would read it as "I don't care whether
the branches are the same or not."
We can instead express it as merge() - merge(samebranch=True).

Do you mean that the flag should only indicate whether you want to hide the same branch merges? I guess that is OK too, since the main use case for this flag is to hide the merge from the same branch. However I think we should change the flag name then. Perhaps "hidesame"? Or "includesame" or "includeself", defaulting to True? Any ideas?

> Okay, I didn't notice that. And it's tricky to map `samebranch=False` to
>  "different branch" constraint. I would read it as "I don't care whether
>  the branches are the same or not."
>
> We can instead express it as `merge() - merge(samebranch=True)`.
Do you mean that the flag should only indicate whether you want to hide the same branch merges?

I just mean tri-state bool is confusing. <whatever>=False sounds like we
don't care about the <whatever> condition.

I guess that is OK too, since the main use case for this flag is to hide the merge from the same branch. However I think we should change the flag name then. Perhaps "hidesame"? Or "includesame" or "includeself", defaulting to True? Any ideas?

It could be an argument taking a string like 'same', but I can't think
of nice names. What's the best term describing a merge between two named
branches?

In D5496#82908, @angel.ezquerra wrote:

In D5496#82671, @yuja wrote:

Do you mean that the flag should only indicate whether you want to hide the same branch merges? I guess that is OK too, since the main use case for this flag is to hide the merge from the same branch. However I think we should change the flag name then. Perhaps "hidesame"? Or "includesame" or "includeself", defaulting to True? Any ideas?

Maybe anonymous, defaulting to True? That's in the glossary under Branch, anonymous, so not technically a merge, but I think it still conveys the point.

I think I have a similar reaction as Yuya, but in the opposite direction- merge(anonymous=True) makes me think that's all that's of interest. So maybe withanonymous?

I do like the compactness of:

merge() => all merges
merge(anonymous=True) => only merges with matching (p1, p2) branch names
merge(anonymous=False) => only merges with different (p1, p2) branch names

Otherwise finding only anonymous merges is something like merge() - merge(anonymous=False), and it took some thinking to get there with the double negative.

But I have no idea how well that applies to other things if we set that precedent here, and don't feel that strongly about it.

Maybe `anonymous`, defaulting to True?

To all: the default can't be True. It would break the current merge()
revset behavior. Only viable choice is to set the default to "don't care".

And I think tri-state bool is confusing in this context. So, IMHO, it's
better to add an argument taking a keyword string specifying constraint
such as merge(between="a-keyword-to-select-merges-of-named-branches").

That's in the glossary under Branch, anonymous, so not technically
a merge, but I think it still conveys the point.

I disagree. Merges of the same branch are pretty common if your team preferred
merge-based strategy. I wouldn't explicitly call new head pulled from public
repo as an anonymous head.

It can also be wrong if you're using bookmarks.

Maybe we should not try to use boolean. we could have

merge(parentbranches="same")

with the possible value being: same, different, any (an possibly some constains about branch(p1(x)) == branch(x)

Gentle ping on this series, the feature sounds interesting.

Herald added a subscriber: mercurial-patches. · View Herald TranscriptApr 22 2020, 12:00 PM

There seems to have been no activities on this Diff for the past 3 Months.

By policy, we are automatically moving it out of the need-review state.

Please, move it back to need-review without hesitation if this diff should still be discussed.

:baymax:need-review-idle:

This revision now requires changes to proceed.Jul 31 2020, 1:56 PM

Revision Contents
Changeset List

			Path	Packages
M			mercurial/revset.py (27 lines)
M			tests/test-help.t (7 lines)
M			tests/test-revset.t (25 lines)

Commit	Parents	Author	Summary	Date
		Angel Ezquerra		Jul 29 2018, 3:37 PM

Status	Author	Revision
Abandoned	angel.ezquerra	D5497 revset: add tests for the new merge() arguments (withbranch and samebranch)
Needs Revision	angel.ezquerra	D5496 revset: add "samebranch" keyword argument to the merge revset
Needs Revision	angel.ezquerra	D5495 revset: add "branch" positional arguments to the merge revset

Diff 13211

mercurial/revset.py

	if m in subset:			if m in subset:
	return baseset([m], datarepr=('<max %r, %r>', subset, os))			return baseset([m], datarepr=('<max %r, %r>', subset, os))
	except ValueError:			except ValueError:
	# os.max() throws a ValueError when the collection is empty.			# os.max() throws a ValueError when the collection is empty.
	# Same as python's max().			# Same as python's max().
	pass			pass
	return baseset(datarepr=('<max %r, %r>', subset, os))			return baseset(datarepr=('<max %r, %r>', subset, os))

	@predicate('merge([withbranch])', safe=True)			@predicate('merge([withbranch], samebranch=True)', safe=True)
	def merge(repo, subset, x):			def merge(repo, subset, x):
	"""Changeset is a merge changeset			"""Changeset is a merge changeset

	All merge revisions are returned by default. If a "withbranch"			All merge revisions are returned by default. If a "withbranch"
	pattern is provided only merges with (i.e. whose second parent			pattern is provided only merges with (i.e. whose second parent
	belongs to) those branches that match the pattern will be returned.			belongs to) those branches that match the pattern will be returned.
	The simplest pattern is the name of a single branch. It is also			The simplest pattern is the name of a single branch. It is also
	possible to specify a regular expression by starting the pattern			possible to specify a regular expression by starting the pattern
	with "re:". This can be used to match more than one branch			with "re:". This can be used to match more than one branch
	(e.g. "re:branch1\|branch2").			(e.g. "re:branch1\|branch2").

				It is also possible to only return merges where both parents belong to
				the same branch by specifying samebranch=True. If samebranch=False is
				set then only merges where both parents do not belong to the same branch
				will be returned.
	"""			"""
	cl = repo.changelog			cl = repo.changelog
	# matchfn is a function that returns true when a revision is a merge			# matchfn is a function that returns true when a revision is a merge
	matchfn = lambda r: cl.parentrevs(r)[1] != -1			matchfn = lambda r: cl.parentrevs(r)[1] != -1

	# i18n: "merge" is a keyword			# i18n: "merge" is a keyword
	args = getargsdict(x, 'merge', 'withbranch')			args = getargsdict(x, 'merge', 'withbranch samebranch')
	if 'withbranch' in args:			if 'withbranch' in args:
	withbranch = getstring(args['withbranch'],			withbranch = getstring(args['withbranch'],
	_('withbranch argument must be a string'))			_('withbranch argument must be a string'))
	kind, branchname, branchmatcher = stringutil.stringmatcher(withbranch)			kind, branchname, branchmatcher = stringutil.stringmatcher(withbranch)
	if branchname:			if branchname:
	# create the function that will filter the subset			# create the function that will filter the subset
	# is a merge and the second parent belongs to a branch that			# is a merge and the second parent belongs to a branch that
	# matches the withbranch pattern (which can be a literal or a regex)			# matches the withbranch pattern (which can be a literal or a regex)
	matchfn = lambda r: (cl.parentrevs(r)[1] != -1			matchfn = lambda r: (cl.parentrevs(r)[1] != -1
	and branchmatcher(repo[r].p2().branch()))			and branchmatcher(repo[r].p2().branch()))
				samebranch = None
				if 'samebranch' in args:
				# i18n: "samebranch" is a keyword
				samebranch = getboolean(
				args['samebranch'],
				_('samebranch argument must be a True or False'))
				if samebranch is not None:
				basematchfn = matchfn
				# if samebranch was specified, build a new match function
				# that on top of basematch checks if the parents belong (or not)
				# to the same branch (depending on the value of samebranch)
				def matchfn(r):
				c = repo[r]
				if not basematchfn(r):
				return False
				issamebranchmerge = c.p1().branch() == c.p2().branch()
				return issamebranchmerge if samebranch else not issamebranchmerge

	return subset.filter(matchfn, condrepr='<merge>')			return subset.filter(matchfn, condrepr='<merge>')

	@predicate('branchpoint()', safe=True)			@predicate('branchpoint()', safe=True)
	def branchpoint(repo, subset, x):			def branchpoint(repo, subset, x):
	"""Changesets with more than one child.			"""Changesets with more than one child.
	"""			"""
	# i18n: "branchpoint" is a keyword			# i18n: "branchpoint" is a keyword
	getargs(x, 0, 0, _("branchpoint takes no arguments"))			getargs(x, 0, 0, _("branchpoint takes no arguments"))

tests/test-help.t

	This paragraph is omitted, if 'hg help' is invoked without "-v" (for			This paragraph is omitted, if 'hg help' is invoked without "-v" (for
	topic)			topic)

	This paragraph is never omitted, too (for topic)			This paragraph is never omitted, too (for topic)

	Test section lookup			Test section lookup

	$ hg help revset.merge			$ hg help revset.merge
	"merge([withbranch])"			"merge([withbranch], samebranch=True)"
	Changeset is a merge changeset			Changeset is a merge changeset

	All merge revisions are returned by default. If a "withbranch" pattern			All merge revisions are returned by default. If a "withbranch" pattern
	is provided only merges with (i.e. whose second parent belongs to) those			is provided only merges with (i.e. whose second parent belongs to) those
	branches that match the pattern will be returned. The simplest pattern			branches that match the pattern will be returned. The simplest pattern
	is the name of a single branch. It is also possible to specify a regular			is the name of a single branch. It is also possible to specify a regular
	expression by starting the pattern with "re:". This can be used to match			expression by starting the pattern with "re:". This can be used to match
	more than one branch (e.g. "re:branch1\|branch2").			more than one branch (e.g. "re:branch1\|branch2").

				It is also possible to only return merges where both parents belong to
				the same branch by specifying samebranch=True. If samebranch=False is
				set then only merges where both parents do not belong to the same branch
				will be returned.

	$ hg help glossary.dag			$ hg help glossary.dag
	DAG			DAG
	The repository of changesets of a distributed version control system			The repository of changesets of a distributed version control system
	(DVCS) can be described as a directed acyclic graph (DAG), consisting			(DVCS) can be described as a directed acyclic graph (DAG), consisting
	of nodes and edges, where nodes correspond to changesets and edges			of nodes and edges, where nodes correspond to changesets and edges
	imply a parent -> child relation. This graph can be visualized by			imply a parent -> child relation. This graph can be visualized by
	graphical tools such as 'hg log --graph'. In Mercurial, the DAG is			graphical tools such as 'hg log --graph'. In Mercurial, the DAG is
	limited by the requirement for children to have at most two parents.			limited by the requirement for children to have at most two parents.

tests/test-revset.t

	created new head			created new head
	$ hg up 17			$ hg up 17
	1 files updated, 0 files merged, 0 files removed, 0 files unresolved			1 files updated, 0 files merged, 0 files removed, 0 files unresolved
	$ hg merge 18			$ hg merge 18
	1 files updated, 0 files merged, 0 files removed, 0 files unresolved			1 files updated, 0 files merged, 0 files removed, 0 files unresolved
	(branch merge, don't forget to commit)			(branch merge, don't forget to commit)
	$ hg commit -m "different branch merge 2"			$ hg commit -m "different branch merge 2"

				test that the merge revisions can be split between those where
				samebranch is True and those where it is False
				$ log 'merge()'
				6
				11
				13
				15
				17
				19

				show merges with the same branch
				$ log 'merge(samebranch=True)'
				11
				13
				15
				19

				show merges with other branches (but not the same branch)
				$ log 'merge(samebranch=False)'
				6
				17

	show merges with a particular branch			show merges with a particular branch
	$ log 'merge(.a.b.c.)'			$ log 'merge(.a.b.c.)'
	11			11
	13			13
	15			15
	17			17
	$ log 'merge(-a-b-c-)'			$ log 'merge(-a-b-c-)'
	6			6

				$ log 'merge(".a.b.c.", samebranch=False)'
				17

	show merges with multiple branches using a regex			show merges with multiple branches using a regex
	$ log 'merge("re:.a.b.c.")'			$ log 'merge("re:.a.b.c.")'
	6			6
	11			11
	13			13
	15			15
	17			17
	19			19

Diff	ID	Description	Created	Lint	Unit
Base		Base
Diff 1	13018		Jan 6 2019, 2:03 PM	★	★
Diff 2	13205		Jan 13 2019, 5:45 PM	★	★
Diff 3	13211		Jan 14 2019, 6:26 PM	★	★