This is an archive of the discontinued Mercurial Phabricator instance.

Differential D451

revset: remove order information from tree
ClosedPublic

Authored by quark on Aug 19 2017, 1:42 PM.

Download Raw Diff

Details

Reviewers

yuja

Group Reviewers

hg-reviewers

Commits

rHG1b28525e6698: revset: remove order information from tree (API)

Summary

Keeping order in tree makes AST operation harder. And there could be
invalid cases if trees could be generated and compounded freely, like:

SetA(order=define) & SetB(order=define)
                                ^^^^^^ couldn't be satisfied

This patch changes the code to calculate order on the fly, during tree
traversal. Optimization of reordering and arguments is preserved by
introducing a new internal operation flipand.

Diff Detail

Repository

rHG Mercurial

Lint

Automatic diff as part of commit; lint not applicable.

Unit

Automatic diff as part of commit; unit tests not applicable.

Event Timeline

quark created this revision.Aug 19 2017, 1:42 PM

Herald added a reviewer: hg-reviewers. · View Herald TranscriptAug 19 2017, 1:42 PM

Herald added a subscriber: mercurial-devel. · View Herald Transcript

quark mentioned this in D441: revset: optimize "draft() & ::x" pattern.Aug 19 2017, 1:44 PM

quark updated this revision to Diff 1091.Aug 19 2017, 2:01 PM

@yuja Let me know if this can simplify matchtree, buildtree implementation.

I also wonder if it makes sense to move (part of, mostly weight related) _optimize to runtime (getset), since the revset functions could have more information. For example, if sort gets rev as sort key, it could use getset(order=any) instead of getset(order=define). Some functions like ancestors(revs) also do not care about the order of revs, and we are being conservative - using define for all function arguments now.

tests/test-revset.t
2466–2467	The new code is less efficient here. I guess we it might be solvable by having a `_reverseand` operator that `_optimize` may use.

quark updated this revision to Diff 1092.Aug 19 2017, 2:19 PM

quark edited the summary of this revision. (Show Details)Aug 19 2017, 2:30 PM

quark updated this revision to Diff 1093.Aug 19 2017, 2:52 PM

quark updated this revision to Diff 1094.Aug 19 2017, 2:58 PM

quark edited the summary of this revision. (Show Details)Aug 19 2017, 4:05 PM

quark updated this revision to Diff 1095.

quark added inline comments.Aug 19 2017, 4:06 PM

tests/test-revset.t
2827	This is caused by `fullreposet` having a default order. If we remove that, it would be optimized to `<baseset [1, 3, 5]>` here.

quark marked an inline comment as done.Aug 19 2017, 4:09 PM

quark updated this revision to Diff 1096.Aug 19 2017, 4:50 PM

quark updated this revision to Diff 1097.Aug 19 2017, 5:03 PM

quark retitled this revision from [RFC] revset: remove order information from tree to revset: remove order information from tree.Aug 19 2017, 5:17 PM

quark updated this revision to Diff 1098.

quark added inline comments.Aug 19 2017, 10:58 PM

mercurial/revset.py
58–59	TODO can be removed.
126	In theory this should be: if order == defineorder: return xs & subset else: return subset & xs But it's a bit tricky to find a counterexample. I'm still trying.

Clever. I haven't looked this carefully, but the general direction seems fine.

@yuja Let me know if this can simplify matchtree, buildtree implementation.

Actually matchtree can ignore extra elements in a node tuple, so the existence
of order flag is acceptable, though this will slightly simplify the match function.

I also wonder if it makes sense to move (part of, mostly weight related) _optimize to runtime (getset), since the revset functions could have more information. For example, if sort gets rev as sort key, it could use getset(order=any) instead of getset(order=define). Some functions like ancestors(revs) also do not care about the order of revs, and we are being conservative - using define for all function arguments now.

That could be, perhaps.

quark updated this revision to Diff 1099.Aug 20 2017, 12:12 AM

quark marked an inline comment as done.Aug 20 2017, 12:13 AM

quark added a child revision: D452: revset: add an order-aware intersect helper function.Aug 20 2017, 3:26 AM

quark updated this revision to Diff 1100.

quark updated this revision to Diff 1103.Aug 20 2017, 4:43 AM

quark updated this revision to Diff 1108.Aug 20 2017, 3:58 PM

quark added a parent revision: D455: test-revset: make it work with chg.Aug 20 2017, 3:58 PM

(just scanned the series; no careful review yet)

Perhaps the name defineorder was misleading. As of lazyset was introduced,
most revset predicates "follow"ed the order of the input subset, which was by
default rev-ascending. This is also true now. And there are a few exceptions
(e.g. rangeset), which MAY enforce their ordering scheme if the flag is defineorder.

So, anyorder for not x would be my mistake because x and not y could be
theoretically flipped.

We could change this policy so all predicates must "define" their own ordering
schemes (as you did, I think), but I guess that would lead to bugs that are hardly
noticed. So I'm against to bring more "any" ordering without noticeable
performance win. Can you measure it?

mercurial/revset.py
126	`subset & xs` should be correct since `dagrange` doesn't have its own order unlike `rangeset`. Most revset functions "follow" the default order even if they are used where they may "define" order.
901	IIUC, `followorder` is correct because the ordering flags of `x and y` are flipped as if they were `y and x`.
1825	Can you split this to new patch, and preferably include a micro benchmark? Revset had historically lots of subtle ordering bugs, and I believe there are still some. Fewer "if"s should be better in general.

I think it's more correct if all core revsets support defineorder explicitly. The current code depends on revset.makematcher using ascending fullreposet as default to make defineorder implicitly functional. If the callsite passes an non-ascending (unordered, or descending) set to the returned matcher, the code would behave wrong (ex. _flipand(1:0, _flipand(1:0, 0::1)) would be wrong if a descending fullreposet is passed).

It seems to me that the reason why people can get ordering wrong is because of legacy APIs (repo, subset, x). I tried to address that by introducing intersect(subset, xs, order) and suggest repo, x API (D453). If people write code using the new API, it's much harder to have ordering issues.

D456 provides a good test coverage about core revset ordering issues. If we really want to address ordering issues of 3rd party code, maybe we can deprecate repo, subset, x API and force people to either take subset and order, or none of them. Or require an explicit supportorder flag to be defined to mark it as order-safe.

So I'm against to bring more "any" ordering without noticeable
performance win. Can you measure it?

Probably hard to measure. Since it's in theory useful for performance, I'm leaning towards keeping it. Test can random-shuffle anyorder sets and expect the result to still be correct (I didn't include that test change since D456 has better coverage with less code).

I'll wait for some comments before updating the series. The planned changes are:

move micro optimizations like anyorder in sort to a single patch.
move D456 to maybe the first patch in the series so we can track ordering correctness changes closely.

quark added inline comments.Aug 21 2017, 1:46 PM

mercurial/revset.py
126	For `subset & xs` to be correct, `subset` needs to be in ascending order. That is true currently. But it is not very obvious why `subset` is in ascending order here (or, the question is, who is responsible to sort it?). I think it's simpler to not depend on it and make every revset respect `defineorder` explicitly. That also allows us to remove some unnecessary sorting.
901	In this case, `y` is expected to completely redefine the order. So `y`'s `subset`'s order does not matter.
1825	I can do that.

quark added inline comments.Aug 21 2017, 1:48 PM

mercurial/revset.py
901	By "y's subset", I mean "getset(repo, subset, x, xorder)".

In D451#7281, @quark wrote:

I think it's more correct if all core revsets support defineorder explicitly. The current code depends on revset.makematcher using ascending fullreposet as default to make defineorder implicitly functional. If the callsite passes an non-ascending (unordered, or descending) set to the returned matcher, the code would behave wrong (ex. _flipand(1:0, _flipand(1:0, 0::1)) would be wrong if a descending fullreposet is passed).

Perhaps _flipand(1:0, _flipand(1:0, 0::1)) would return [1, 0] if the input set
were reversed. IMHO, that's correct under the original design.

It seems to me that the reason why people can get ordering wrong is because of legacy APIs (repo, subset, x). I tried to address that by introducing intersect(subset, xs, order) and suggest repo, x API (D453). If people write code using the new API, it's much harder to have ordering issues.
D456 provides a good test coverage about core revset ordering issues. If we really want to address ordering issues of 3rd party code, maybe we can deprecate repo, subset, x API and force people to either take subset and order, or none of them. Or require an explicit supportorder flag to be defined to mark it as order-safe.

I agree new decorator API would be slightly better for trivial cases, but the situation
for 3rd-party extensions would be worse. Before, subset was the canonical
source of ordering. Since most revset predicates should not have their own order,
they just needed to return a set in subset's order.

return subset.filter(...)

IIUC, this will be no longer be valid. All revset predicates will have to take care
of order flag if they need a subset argument, just because few predicates
want to enforce their order. This seems not a good balance.

mercurial/revset.py
55	Perhaps the default `order=defineorder` would be safer at this point.
901	So in your proposed design, that's true. x's order doesn't matter. I just meant, in the original design, `x` should follow the subset's order because `y` could have no explicit ordering (so `y` follows `x`, which follows `subset`.)

In D451#7257, @yuja wrote:

So, anyorder for not x would be my mistake because x and not y could be
theoretically flipped.

FWIW, this should be okay since not x follows the order of the input subset.
The order of the x doesn't matter.

In D451#7456, @yuja wrote:

Perhaps _flipand(1:0, _flipand(1:0, 0::1)) would return [1, 0] if the input set
were reversed. IMHO, that's correct under the original design.

I see. The old code allows "weak define" that "define" becomes "follow". There is no way to tell if a revset is "strong define" or "weak define" from the help text. So it could be confusing sometimes.

list(revset.match(None, '_flipand(1:0, _flipand(1:0, 0::1))', order=ORDER)(repo, revset.baseset(INITSET)))

OLD CODE	ORDER=define	ORDER=follow
INITSET=[0,1]	[0,1]	[0,1]
INITSET=[1,0]	[1,0]	[1,0]

NEW CODE	ORDER=define	ORDER=follow
INITSET=[0,1]	[0,1]	[0,1]
INITSET=[1,0]	[0,1]	[1,0]

Since set.sort() is lazy and optimized to a no-op. Migrating everything to "strong define" does not seem to hurt performance in the default use-case. So I'd like to do that for core revsets.

but the situation for 3rd-party extensions would be worse

I think not all 3rd-party extensions need subset. subset is only required if they need filter the subset. There are many cases where just defining a small set is enough (ex. remotenames()). I'd like to show another "define a small set" example: d6708c20, where it's easier to make mistake using the old API (well, it's the reviewer of that change to blame).

All revset predicates will have to take care of order flag if they need a subset argument, just because few predicates want to enforce their order.

For the old code, all predicates must not change order if it's not supposed to define. I now see that's why anyorder is less useful in the old code - not seems to be the only valid case. The new code would allow anyorder to be used in wider cases.

Not directly related to this series, I think some revsets might want a non-ascending "define" order. For example, it seems more natural if p1(A+B) could be equivalent to p1(A)+p1(B). Same applies to p2, parents, children and maybe roots, heads.

quark added inline comments.Aug 22 2017, 12:19 PM

mercurial/revset.py
55	Agree `anyorder` could be a surprise. I was trying to optimize aggressively. Maybe `followorder` is a better default since `subset & x` is the old default.

Interestingly, I checked some well known 3rd-party extensions:

hgsubversion:

svnrev() uses x for x in myset if x in subset therefore enforces "defineorder" and wrong.
fromsvn() is also wrong similarly.

hg-git:

fromgit(), gitnode(): happened to be "followorder" since it uses x for x in subset if ...

mutable-history:

troubled(), suspended(), precursors(), ...: all of them use subset & ... , therefore enforces "followorder"

remotenames:

remotenames(): uses subset & ... pattern
upstream(): uses smartset.filteredset(subset, ...), which could be changed to subset.filter(...). But it's literally subset & tipancestors and does not have to use subset.filter.
pushed(): same as upstream()

All of them could just remove subset argument without hurting performance. They may not be able to do that because they need to support older Mercurial. But I think that could be seen as an indication that new code probably does not need the subset argument.

I had also wanted to remove the need to pass subset, so I'd be happy so that change.

tests/test-revset.t
2827	Does that mean you'll remove the other.sort() in fullreposet.and?

quark added inline comments.Aug 22 2017, 2:28 PM

tests/test-revset.t
2827	In another way. With the new code (`anyorder` gets aggressively used), `fullrepo & xs` would be optimized to `xs & fullrepo` and the latter does not have the sort.

quark added inline comments.Aug 22 2017, 4:58 PM

mercurial/revset.py
55	Actually, `defineorder` as default also makes sense and seems to be better. I'll use it.

In D451#7499, @quark wrote:

hgsubversion:

svnrev() uses x for x in myset if x in subset therefore enforces "defineorder" and wrong.

fromsvn() is also wrong similarly.

Yes, they are wrong (and I've pointed out that before in hgsubversion thread.)
They could benefit from new subset-less API.

hg-git:

fromgit(), gitnode(): happened to be "followorder" since it uses x for x in subset if ...

mutable-history:

troubled(), suspended(), precursors(), ...: all of them use subset & ... , therefore enforces "followorder"

remotenames:

remotenames(): uses subset & ... pattern

upstream(): uses smartset.filteredset(subset, ...), which could be changed to subset.filter(...). But it's literally subset & tipancestors and does not have to use subset.filter.

pushed(): same as upstream()

All of them could just remove subset argument without hurting performance. They may not be able to do that because they need to support older Mercurial. But I think that could be seen as an indication that new code probably does not need the subset argument.

The point is these implementations would become invalid if subset were
optimized to "any" order. That's breaking change, but which is likely to be
not covered by tests.

As I said, most revset predicates have no explicit order. I introduced the
order flag in order to work around a few exceptions, which are x:y,
x + y, sort() and reverse(), IIRC. A strong "define" is exceptional.

So, is it really make sense to revamp the revset ordering rules? I don't
think so. I generally like this series, but -1 for bringing "any" order
everywhere.

In D451#7698, @yuja wrote:

The point is these implementations would become invalid if subset were
optimized to "any" order. That's breaking change, but which is likely to be
not covered by tests.

How about making registrar.revsetpredicate conservative? If it sees any
predicate registered with the old API, Disable anyorder optimization and
change it to followorder in runtime?

That seems to address the concern about correctness of legacy code.

As I said, most revset predicates have no explicit order. I introduced the
order flag in order to work around a few exceptions, which are x:y,
x + y, sort() and reverse(), IIRC. A strong "define" is exceptional.

I feel it inconsistent if x:y has a strong order but x::y does not.
I also want to change p1, etc's define order as mentioned above.

So, is it really make sense to revamp the revset ordering rules? I don't
think so. I generally like this series, but -1 for bringing "any" order
everywhere.

I think in general, the new code is more explicit and better testable.
I'd like to move forward and not get blocked by legacy code.

I understand "anyorder" in the old code is almost a mistake. But with
the new code, and suppose no legacy code is used, "anyoder" is safe.
It seems to be simple (only used in a few places), less error-prone
(core hg has and encourages strong defineorder), and do have value
for optimization. I'd like to keep it. I can gate it by a config
option but I don't think that's necessary if we make registrar handle
it automatically.

How about making registrar.revsetpredicate conservative? If it sees any
predicate registered with the old API, Disable anyorder optimization and
change it to followorder in runtime?

Sounds unnecessarily complicated. How fast is the anyorder set in real-world
queries? Is that worth introducing more "if"s and corresponding tests?

We can apply anyorder optimization to inner queries even if we decided to not
optimize _flipand(). For example, think ancestors(complex_query), where
complex_query can be computed in anyorder constraint because we can be
sure the order of head revisions doesn't matter. OTOH, _flipand(x, y) is hard
because x is passed to arbitrary sub-expression y.

As I said, most revset predicates have no explicit order. I introduced the
order flag in order to work around a few exceptions, which are x:y,
x + y, sort() and reverse(), IIRC. A strong "define" is exceptional.

I feel it inconsistent if x:y has a strong order but x::y does not.

It would be surprising if x::parent suddenly starts listing parent's children
in reverse order.

I also want to change p1, etc's define order as mentioned above.

I don't like this idea because it would make things more inconsistent.
For instance, branch(a + b) could be branch(a) + branch(b), and ancestors(a:b)
could be ancestors(a) + ... + ancestors(b), but is that really what we want?

So, is it really make sense to revamp the revset ordering rules? I don't
think so. I generally like this series, but -1 for bringing "any" order
everywhere.

I think in general, the new code is more explicit and better testable.
I'd like to move forward and not get blocked by legacy code.
I understand "anyorder" in the old code is almost a mistake. But with
the new code, and suppose no legacy code is used, "anyoder" is safe.
It seems to be simple (only used in a few places), less error-prone
(core hg has and encourages strong defineorder), and do have value
for optimization. I'd like to keep it. I can gate it by a config
option but I don't think that's necessary if we make registrar handle
it automatically.

I doubt if it is really error-prone. We'll have to make sure that a revset
function returns a set in the right "defined" order (or an unordered set
which will be sorted implicitly by intersect().)

The current rule is IMHO, simpler. Almost all revsets are in the same
order unless they are explicitly sorted or concatenated.

In D451#8029, @yuja wrote:

How about making registrar.revsetpredicate conservative? If it sees any
predicate registered with the old API, Disable anyorder optimization and
change it to followorder in runtime?

Sounds unnecessarily complicated.

I just got a new idea: Wrap those 3-argument predicate in a function that
does an extra sort if it's defineorder [1]. That seems to solve things
cleanly.

How fast is the anyorder set in real-world
queries? Is that worth introducing more "if"s and corresponding tests?

mozilla-central % hg.old perfrevset 'sort(public() & 1:3)'
! wall 0.193297 comb 0.190000 user 0.190000 sys 0.000000 (best of 51)
mozilla-central % hg.new perfrevset 'sort(public() & 1:3)'
! wall 0.000188 comb 0.000000 user 0.000000 sys 0.000000 (best of 13504)

So I think anyorder definitely has value (not that much to a revset expert though).

We can apply anyorder optimization to inner queries even if we decided to not
optimize _flipand(). For example, think ancestors(complex_query), where
complex_query can be computed in anyorder constraint because we can be
sure the order of head revisions doesn't matter. OTOH, _flipand(x, y) is hard
because x is passed to arbitrary sub-expression y.

I hope the word "hard" here could suggest that the new code does make things simpler at least in this case.

It would be surprising if x::parent suddenly starts listing parent's children
in reverse order.
I don't like this idea because it would make things more inconsistent.
For instance, branch(a + b) could be branch(a) + branch(b), and ancestors(a:b)
could be ancestors(a) + ... + ancestors(b), but is that really what we want?

We can say revsets taking N revs returning O(N) revs would maintain
the order, and not for other revsets. That'd be clearly defined.
But I don't feel strong. We can keep the existing behavior here.

I doubt if it is really error-prone. We'll have to make sure that a revset
function returns a set in the right "defined" order (or an unordered set
which will be sorted implicitly by intersect().)

(The enforce-define [1] idea seems to address this well)

The current rule is IMHO, simpler. Almost all revsets are in the same
order unless they are explicitly sorted or concatenated.

Maybe. But if you insist defineorder does not always need to "define" order.
I hope I can rename it to maybedefineorder, which will make it less confusing
for developers. That said, I still prefer strong "define"s. It seems Martin
also likes it. I think it's simpler because both developers and end-users
won't need to worry about the "weak define" concept.

First, I'm tired of discussing this. Perhaps, you would be the same (guessing
from the initial reply.) I don't think this should be a blocker of your previous
series which optimizes draft() ... something.

If you want to move things forward, please split non-controversial parts,
which I think are:

remove "order" from parsed tree, use _flipand() instead
make subset argument optional

Alternatively, I could send my build/matchtree patch without respecting to
this series to unblock your original patch.

I just got a new idea: Wrap those 3-argument predicate in a function that
does an extra sort if it's defineorder [1]. That seems to solve things
cleanly.

I don't think it's clean, but yeah doable. If I had to take this series, that would
be the safest workaround for old code.

mozilla-central % hg.old perfrevset 'sort(public() & 1:3)'
! wall 0.193297 comb 0.190000 user 0.190000 sys 0.000000 (best of 51)
mozilla-central % hg.new perfrevset 'sort(public() & 1:3)'
! wall 0.000188 comb 0.000000 user 0.000000 sys 0.000000 (best of 13504)

So I think anyorder definitely has value (not that much to a revset expert though).

In this example, public() & 1:3 could be fully optimized to anyorder
without the help of _flipand(). It's obvious that the order of public() & 1:3
doesn't matter.

We can apply anyorder optimization to inner queries even if we decided to not
optimize _flipand(). For example, think ancestors(complex_query), where
complex_query can be computed in anyorder constraint because we can be
sure the order of head revisions doesn't matter. OTOH, _flipand(x, y) is hard
because x is passed to arbitrary sub-expression y.

I hope the word "hard" here could suggest that the new code does make things simpler at least in this case.

I think it's "simpler" at the cost of bringing the "order" concept everywhere.

We can say revsets taking N revs returning O(N) revs would maintain
the order, and not for other revsets. That'd be clearly defined.
But I don't feel strong. We can keep the existing behavior here.

Yep. I don't want to think about the complexity to determine if a revset
predicate may define its own order.

Maybe. But if you insist defineorder does not always need to "define" order.
I hope I can rename it to maybedefineorder, which will make it less confusing
for developers.

Seems good.

That said, I still prefer strong "define"s. It seems Martin
also likes it.

I guess he liked it since the new registrar would get rid of the subset
argument. So do I in that regard.

I think it's simpler because both developers and end-users
won't need to worry about the "weak define" concept.

Users and (most) developers don't have to think about it in either case.
Almost all revsets should be in revision order.

Don't worry about that optimization patch. I'll refactor this series and use maybedefine. Thanks!

quark removed a parent revision: D455: test-revset: make it work with chg.Aug 25 2017, 1:09 PM

quark removed a child revision: D452: revset: add an order-aware intersect helper function.

quark added a child revision: D523: revset: improve documentation about ordering handling.Aug 25 2017, 8:14 PM

quark updated this revision to Diff 1317.

It seems _optimize() has more bugs, so I decided to not queue this.
I have partially updated patch, but how can I collaborate with you?

FWIW, I think _optimize() could use defineorder/anyorder constants
in place of True/False for readablity.

mercurial/revset.py
63	`methods` is the table of operators, not functions, which wouldn't be modified by extensions. I'll drop this change.
144	right-hand side was originally `anyorder`, so updated in flight.
164	This, too. added `anyorder` in flight.
895	This should be undocumented. Changed to a comment.
mercurial/revsetlang.py
368	Split this back to unary/binary/ternary cases since I slightly prefer explicit handling of node tuples.
440	Perhaps this should keep the current `preserveorder` since `not public()` is fully replaced with `_notpublic()`.
448	The order matters. Try `(contains("glob:") & 2:0):1` for example.
463	Perhaps this is `preserveorder` since `keyvalue` node isn't a standalone expression.

yuja added inline comments.Aug 27 2017, 8:42 AM

mercurial/revsetlang.py
440	This is an existing bug, btw. $ hg debugrevspec -p analyzed -p optimized 'not public()' * analyzed: (not (func ('symbol', 'public') None any) define) * optimized: (func ('symbol', '_notpublic') None any)

It seems _optimize() has more bugs

Perhaps we can eliminate preserveorder flag from _optimize() at all.
flipand can always be inserted since and(x, y) is exactly the same as
flipand(x, y) if order != defineorder. For or, we could

backout c63cb2d10d6d assuming the optimization wouldn't be that useful
or add an internal method to select define/anyorder tree at runtime, e.g. (switch-by-order (or original-tree) (or sorted-tree))

One of the drawback of the current patch is we have to resolve the order
constraint twice, at parsing phase and evaluation phase. That's probably why
I decided to embed the flag to parsed tree, though I don't remember it. :-)

mercurial/revset.py
895	Nit: this could be an internal method (= operator) like `difference` since we don't need to embed it in revset expression.

I have partially updated patch, but how can I collaborate with you?

You can phabsend . which will update this patch (assuming Differential Revision: .../D451 line exists), and I can fix the rest of things.

Perhaps we can eliminate preserveorder flag from _optimize() at all.

I like that simplicity. I don't think or optimization is that important (so there was no _flipor).

If we can attach extra hints (not affecting correctness if get lost) to tree nodes (ex. not tuple, but a TreeNode object with a estimated_weight property), the optimization about and and or could be moved to runtime and _flipand becomes unnecessary.

mercurial/revset.py
63	hgsubversion replaces `stringset` to support names like `r123`. We did similar things to support `D123`. I guess the most correct way would be using `repo.names` somehow. I don't feel strong. Dropping this makes core code cleaner. I can fix existing extensions we use.
895	Since we had `_notpublic`, I felt the practice here was to not "pollute" the "methods" table. i.e. "methods" can only contain names outputted directly from the parser. So I still slightly prefer not using an operator. What do you think?
mercurial/revsetlang.py
368	Ha, that was an older version of this patch.
440	I thought since `_notpublic` is atomic and cannot be further split. `preserveorder` does not matter here.
448	Good catch. I realized this when working on `revset.py` but forgot to update it here.
463	Not sure. I think it's similar to function argument (which we preserves order by default in line 463).

yuja updated this revision to Diff 1347.Aug 28 2017, 9:00 AM

You can phabsend . which will update this patch

Done.

Perhaps we can eliminate preserveorder flag from _optimize() at all.

I like that simplicity. I don't think or optimization is that important (so there was no _flipor).

Let's revert c63cb2d10d6d then.

mercurial/revset.py
895	I don't have strong preference as we already have pseudo operator `difference`.
mercurial/revsetlang.py
440	It's just for logical correctness. Since `not public()` is replaced with `_notpublic()`, the ordering constraint should be derived from `not public()`, not from `public()`.
463	`preserveorder` is switched to `True` at `func` node, and its child `list` just passes around it. The same rule should apply to `keyvalue`. Anyway, we should get rid of it from `_optimize()`.

quark mentioned this in D543: revset: accept additional arguments for stringset.Aug 28 2017, 2:09 PM

quark edited the summary of this revision. (Show Details)Aug 29 2017, 10:02 AM

quark updated this revision to Diff 1389.

quark added a parent revision: D561: revset: drop optimization about reordering "or" set elements.Aug 29 2017, 10:03 AM

Flagged as (API) and queued the series, thanks!

This revision is now accepted and ready to land.Aug 30 2017, 9:21 AM

Closed by commit rHG1b28525e6698: revset: remove order information from tree (API) (authored by quark). · Explain WhyAug 30 2017, 9:53 AM

This revision was automatically updated to reflect the committed changes.

martinvonz added inline comments.Aug 30 2017, 1:10 PM

mercurial/revset.py
136	I think using the regular (x,y) order would be clearer. You'd need to rename it then. Maybe something like: # 'smallyand(x, y)' is equivalent to 'and(x, y)', but faster when y is small Feel free to ignore.

quark added inline comments.Aug 30 2017, 3:46 PM

mercurial/revset.py
136	I came up with a similar idea yesterday but didn't write a patch. I think maintaining AST node order could be less confusing for developers. So I'll send a patch. Maybe this could be an additional option of the "andset".

quark mentioned this in rFBHGX2ce6a458869c: revset: accept additional arguments for stringset.Aug 31 2017, 2:30 PM

Revision Contents
Changeset List

			Path	Packages
M			mercurial/revset.py (61 lines)
M			mercurial/revsetlang.py (121 lines)
M			tests/test-revset.t (322 lines)

Diff	ID	Description	Created	Lint	Unit
Base		Base
Diff 1	1090		Aug 19 2017, 1:42 PM	★	★
Diff 2	1091		Aug 19 2017, 2:01 PM	★	★
Diff 3	1092		Aug 19 2017, 2:18 PM	★	★
Diff 4	1093		Aug 19 2017, 2:52 PM	★	★
Diff 5	1094		Aug 19 2017, 2:58 PM	★	★
Diff 6	1095		Aug 19 2017, 4:05 PM	★	★
Diff 7	1096		Aug 19 2017, 4:50 PM	★	★
Diff 8	1097		Aug 19 2017, 5:03 PM	★	★
Diff 9	1098		Aug 19 2017, 5:17 PM	★	★
Diff 10	1099		Aug 20 2017, 12:12 AM	★	★
Diff 11	1100		Aug 20 2017, 3:26 AM	★	★
Diff 12	1103		Aug 20 2017, 4:43 AM	★	★
Diff 13	1108		Aug 20 2017, 3:58 PM	★	★
Diff 14	1317		Aug 25 2017, 8:14 PM	★	★
Diff 15	1347		Aug 28 2017, 9:00 AM	★	★
Diff 16	1389		Aug 29 2017, 10:02 AM	★	★
Diff 17	1434	rHG1b28525e66982a50c33a7163228afdc785e8ca58	Aug 20 2017, 1:55 PM	★	★

Status	Author	Revision
Closed	quark	D523 revset: improve documentation about ordering handling
Closed	quark	D451 revset: remove order information from tree
Closed	quark	D561 revset: drop optimization about reordering "or" set elements

Diff 1434

mercurial/revset.py


	baseset = smartset.baseset			baseset = smartset.baseset
	generatorset = smartset.generatorset			generatorset = smartset.generatorset
	spanset = smartset.spanset			spanset = smartset.spanset
	fullreposet = smartset.fullreposet			fullreposet = smartset.fullreposet

	# helpers			# helpers

	def getset(repo, subset, x):			def getset(repo, subset, x, order=defineorder):
				yujaUnsubmitted Not Done Perhaps the default `order=defineorder` would be safer at this point. yuja: Perhaps the default `order=defineorder` would be safer at this point.
				quarkAuthorUnsubmitted Not Done Agree `anyorder` could be a surprise. I was trying to optimize aggressively. Maybe `followorder` is a better default since `subset & x` is the old default. quark: Agree `anyorder` could be a surprise. I was trying to optimize aggressively. Maybe…
				quarkAuthorUnsubmitted Not Done Actually, `defineorder` as default also makes sense and seems to be better. I'll use it. quark: Actually, `defineorder` as default also makes sense and seems to be better. I'll use it.
	if not x:			if not x:
	raise error.ParseError(_("missing argument"))			raise error.ParseError(_("missing argument"))
	return methods[x[0]](repo, subset, *x[1:])			return methods[x[0]](repo, subset, *x[1:], order=order)

				quarkAuthorUnsubmitted Done TODO can be removed. quark: TODO can be removed.
	def _getrevsource(repo, r):			def _getrevsource(repo, r):
	extra = repo[r].extra()			extra = repo[r].extra()
	for label in ('source', 'transplant_source', 'rebase_source'):			for label in ('source', 'transplant_source', 'rebase_source'):
	if label in extra:			if label in extra:
				yujaUnsubmitted Not Done `methods` is the table of operators, not functions, which wouldn't be modified by extensions. I'll drop this change. yuja: `methods` is the table of operators, not functions, which wouldn't be modified by extensions.
				quarkAuthorUnsubmitted Not Done hgsubversion replaces `stringset` to support names like `r123`. We did similar things to support `D123`. I guess the most correct way would be using `repo.names` somehow. I don't feel strong. Dropping this makes core code cleaner. I can fix existing extensions we use. quark: hgsubversion replaces `stringset` to support names like `r123`. We did similar things to…
	try:			try:
	return repo[extra[label]].rev()			return repo[extra[label]].rev()
	except error.RepoLookupError:			except error.RepoLookupError:
	pass			pass
	return None			return None

	# operator methods			# operator methods

	def stringset(repo, subset, x):			def stringset(repo, subset, x, order):
	x = scmutil.intrev(repo[x])			x = scmutil.intrev(repo[x])
	if (x in subset			if (x in subset
	or x == node.nullrev and isinstance(subset, fullreposet)):			or x == node.nullrev and isinstance(subset, fullreposet)):
	return baseset([x])			return baseset([x])
	return baseset()			return baseset()

	def rangeset(repo, subset, x, y, order):			def rangeset(repo, subset, x, y, order):
	m = getset(repo, fullreposet(repo), x)			m = getset(repo, fullreposet(repo), x)
	else:			else:
	# carrying the sorting over when possible would be more efficient			# carrying the sorting over when possible would be more efficient
	return subset & r			return subset & r

	def dagrange(repo, subset, x, y, order):			def dagrange(repo, subset, x, y, order):
	r = fullreposet(repo)			r = fullreposet(repo)
	xs = dagop.reachableroots(repo, getset(repo, r, x), getset(repo, r, y),			xs = dagop.reachableroots(repo, getset(repo, r, x), getset(repo, r, y),
	includepath=True)			includepath=True)
	return subset & xs			return subset & xs
				quarkAuthorUnsubmitted Not Done In theory this should be: if order == defineorder: return xs & subset else: return subset & xs But it's a bit tricky to find a counterexample. I'm still trying. quark: In theory this should be: if order == defineorder: return xs & subset else…
				yujaUnsubmitted Not Done `subset & xs` should be correct since `dagrange` doesn't have its own order unlike `rangeset`. Most revset functions "follow" the default order even if they are used where they may "define" order. yuja: `subset & xs` should be correct since `dagrange` doesn't have its own order unlike `rangeset`.
				quarkAuthorUnsubmitted Not Done For `subset & xs` to be correct, `subset` needs to be in ascending order. That is true currently. But it is not very obvious why `subset` is in ascending order here (or, the question is, who is responsible to sort it?). I think it's simpler to not depend on it and make every revset respect `defineorder` explicitly. That also allows us to remove some unnecessary sorting. quark: For `subset & xs` to be correct, `subset` needs to be in ascending order. That is true…

	def andset(repo, subset, x, y, order):			def andset(repo, subset, x, y, order):
	return getset(repo, getset(repo, subset, x), y)			if order == anyorder:
				yorder = anyorder
				else:
				yorder = followorder
				return getset(repo, getset(repo, subset, x, order), y, yorder)

				def flipandset(repo, subset, y, x, order):
				# 'flipand(y, x)' is equivalent to 'and(x, y)', but faster when y is small
				martinvonzUnsubmitted Not Done I think using the regular (x,y) order would be clearer. You'd need to rename it then. Maybe something like: # 'smallyand(x, y)' is equivalent to 'and(x, y)', but faster when y is small Feel free to ignore. martinvonz: I think using the regular (x,y) order would be clearer. You'd need to rename it then. Maybe…
				quarkAuthorUnsubmitted Not Done I came up with a similar idea yesterday but didn't write a patch. I think maintaining AST node order could be less confusing for developers. So I'll send a patch. Maybe this could be an additional option of the "andset". quark: I came up with a similar idea yesterday but didn't write a patch. I think maintaining AST node…
				if order == anyorder:
				yorder = anyorder
				else:
				yorder = followorder
				return getset(repo, getset(repo, subset, y, yorder), x, order)

	def differenceset(repo, subset, x, y, order):			def differenceset(repo, subset, x, y, order):
	return getset(repo, subset, x) - getset(repo, subset, y)			return getset(repo, subset, x, order) - getset(repo, subset, y, anyorder)
				yujaUnsubmitted Not Done right-hand side was originally `anyorder`, so updated in flight. yuja: right-hand side was originally `anyorder`, so updated in flight.

	def _orsetlist(repo, subset, xs):			def _orsetlist(repo, subset, xs, order):
	assert xs			assert xs
	if len(xs) == 1:			if len(xs) == 1:
	return getset(repo, subset, xs[0])			return getset(repo, subset, xs[0], order)
	p = len(xs) // 2			p = len(xs) // 2
	a = _orsetlist(repo, subset, xs[:p])			a = _orsetlist(repo, subset, xs[:p], order)
	b = _orsetlist(repo, subset, xs[p:])			b = _orsetlist(repo, subset, xs[p:], order)
	return a + b			return a + b

	def orset(repo, subset, x, order):			def orset(repo, subset, x, order):
	xs = getlist(x)			xs = getlist(x)
	if order == followorder:			if order == followorder:
	# slow path to take the subset order			# slow path to take the subset order
	return subset & _orsetlist(repo, fullreposet(repo), xs)			return subset & _orsetlist(repo, fullreposet(repo), xs, anyorder)
	else:			else:
	return _orsetlist(repo, subset, xs)			return _orsetlist(repo, subset, xs, order)

	def notset(repo, subset, x, order):			def notset(repo, subset, x, order):
	return subset - getset(repo, subset, x)			return subset - getset(repo, subset, x, anyorder)
				yujaUnsubmitted Not Done This, too. added `anyorder` in flight. yuja: This, too. added `anyorder` in flight.

	def relationset(repo, subset, x, y, order):			def relationset(repo, subset, x, y, order):
	raise error.ParseError(_("can't use a relation in this context"))			raise error.ParseError(_("can't use a relation in this context"))

	def relsubscriptset(repo, subset, x, y, z, order):			def relsubscriptset(repo, subset, x, y, z, order):
	# this is pretty basic implementation of 'x#y[z]' operator, still			# this is pretty basic implementation of 'x#y[z]' operator, still
	# experimental so undocumented. see the wiki for further ideas.			# experimental so undocumented. see the wiki for further ideas.
	# https://www.mercurial-scm.org/wiki/RevsetOperatorPlan			# https://www.mercurial-scm.org/wiki/RevsetOperatorPlan
	else:			else:
	return _descendants(repo, subset, x, startdepth=n, stopdepth=n + 1)			return _descendants(repo, subset, x, startdepth=n, stopdepth=n + 1)

	raise error.UnknownIdentifier(rel, ['generations'])			raise error.UnknownIdentifier(rel, ['generations'])

	def subscriptset(repo, subset, x, y, order):			def subscriptset(repo, subset, x, y, order):
	raise error.ParseError(_("can't use a subscript in this context"))			raise error.ParseError(_("can't use a subscript in this context"))

	def listset(repo, subset, *xs):			def listset(repo, subset, xs, *opts):
	raise error.ParseError(_("can't use a list in this context"),			raise error.ParseError(_("can't use a list in this context"),
	hint=_('see hg help "revsets.x or y"'))			hint=_('see hg help "revsets.x or y"'))

	def keyvaluepair(repo, subset, k, v):			def keyvaluepair(repo, subset, k, v, order):
	raise error.ParseError(_("can't use a key-value pair in this context"))			raise error.ParseError(_("can't use a key-value pair in this context"))

	def func(repo, subset, a, b, order):			def func(repo, subset, a, b, order):
	f = getsymbol(a)			f = getsymbol(a)
	if f in symbols:			if f in symbols:
	func = symbols[f]			func = symbols[f]
	if getattr(func, '_takeorder', False):			if getattr(func, '_takeorder', False):
	return func(repo, subset, b, order)			return func(repo, subset, b, order)
	s.add(fctx.introrev())			s.add(fctx.introrev())
	else:			else:
	s = dagop.revancestors(repo, baseset([c.rev()]), followfirst)			s = dagop.revancestors(repo, baseset([c.rev()]), followfirst)

	return subset & s			return subset & s

	@predicate('follow([pattern[, startrev]])', safe=True)			@predicate('follow([pattern[, startrev]])', safe=True)
	def follow(repo, subset, x):			def follow(repo, subset, x):
	"""			"""
				yujaUnsubmitted Not Done This should be undocumented. Changed to a comment. yuja: This should be undocumented. Changed to a comment.
				yujaUnsubmitted Not Done Nit: this could be an internal method (= operator) like `difference` since we don't need to embed it in revset expression. yuja: Nit: this could be an internal method (= operator) like `difference` since we don't need to…
				quarkAuthorUnsubmitted Not Done Since we had `_notpublic`, I felt the practice here was to not "pollute" the "methods" table. i.e. "methods" can only contain names outputted directly from the parser. So I still slightly prefer not using an operator. What do you think? quark: Since we had `_notpublic`, I felt the practice here was to not "pollute" the "methods" table. i.
				yujaUnsubmitted Not Done I don't have strong preference as we already have pseudo operator `difference`. yuja: I don't have strong preference as we already have pseudo operator `difference`.
	An alias for ``::.`` (ancestors of the working directory's first parent).			An alias for ``::.`` (ancestors of the working directory's first parent).
	If pattern is specified, the histories of files matching given			If pattern is specified, the histories of files matching given
	pattern in the revision given by startrev are followed, including copies.			pattern in the revision given by startrev are followed, including copies.
	"""			"""
	return _follow(repo, subset, x, 'follow')			return _follow(repo, subset, x, 'follow')

				yujaUnsubmitted Not Done IIUC, `followorder` is correct because the ordering flags of `x and y` are flipped as if they were `y and x`. yuja: IIUC, `followorder` is correct because the ordering flags of `x and y` are flipped as if they…
				quarkAuthorUnsubmitted Not Done In this case, `y` is expected to completely redefine the order. So `y`'s `subset`'s order does not matter. quark: In this case, `y` is expected to completely redefine the order. So `y`'s `subset`'s order does…
				quarkAuthorUnsubmitted Not Done By "y's subset", I mean "getset(repo, subset, x, xorder)". quark: By "y's subset", I mean "getset(repo, subset, x, xorder)".
				yujaUnsubmitted Not Done So in your proposed design, that's true. x's order doesn't matter. I just meant, in the original design, `x` should follow the subset's order because `y` could have no explicit ordering (so `y` follows `x`, which follows `subset`.) yuja: So in your proposed design, that's true. x's order doesn't matter. I just meant, in the…
	@predicate('_followfirst', safe=True)			@predicate('_followfirst', safe=True)
	def _followfirst(repo, subset, x):			def _followfirst(repo, subset, x):
	# ``followfirst([pattern[, startrev]])``			# ``followfirst([pattern[, startrev]])``
	# Like ``follow([pattern[, startrev]])`` but follows only the first parent			# Like ``follow([pattern[, startrev]])`` but follows only the first parent
	# of every revisions or files revisions.			# of every revisions or files revisions.
	return _follow(repo, subset, x, '_followfirst', followfirst=True)			return _follow(repo, subset, x, '_followfirst', followfirst=True)

	@predicate('followlines(file, fromline:toline[, startrev=., descend=False])',			@predicate('followlines(file, fromline:toline[, startrev=., descend=False])',
	if parents[1] != node.nullrev:			if parents[1] != node.nullrev:
	ps.add(parents[1])			ps.add(parents[1])
	except error.WdirUnsupported:			except error.WdirUnsupported:
	parents = repo[r].parents()			parents = repo[r].parents()
	if len(parents) == 2:			if len(parents) == 2:
	ps.add(parents[1].rev())			ps.add(parents[1].rev())
	return subset & ps			return subset & ps

	@predicate('present(set)', safe=True)			@predicate('present(set)', safe=True, takeorder=True)
	def present(repo, subset, x):			def present(repo, subset, x, order):
	"""An empty set, if any revision in set isn't found; otherwise,			"""An empty set, if any revision in set isn't found; otherwise,
	all revisions in set.			all revisions in set.

	If any of specified revisions is not present in the local repository,			If any of specified revisions is not present in the local repository,
	the query is normally aborted. But this predicate allows the query			the query is normally aborted. But this predicate allows the query
	to continue even in such cases.			to continue even in such cases.
	"""			"""
	try:			try:
	return getset(repo, subset, x)			return getset(repo, subset, x, order)
	except error.RepoLookupError:			except error.RepoLookupError:
	return baseset()			return baseset()

	# for internal use			# for internal use
	@predicate('_notpublic', safe=True)			@predicate('_notpublic', safe=True)
	def _notpublic(repo, subset, x):			def _notpublic(repo, subset, x):
	getargs(x, 0, 0, "_notpublic takes no arguments")			getargs(x, 0, 0, "_notpublic takes no arguments")
	return _phase(repo, subset, phases.draft, phases.secret)			return _phase(repo, subset, phases.draft, phases.secret)
	return False			return False

	return subset.filter(matches, condrepr=('<matching%r %r>', fields, revs))			return subset.filter(matches, condrepr=('<matching%r %r>', fields, revs))

	@predicate('reverse(set)', safe=True, takeorder=True)			@predicate('reverse(set)', safe=True, takeorder=True)
	def reverse(repo, subset, x, order):			def reverse(repo, subset, x, order):
	"""Reverse order of set.			"""Reverse order of set.
	"""			"""
	l = getset(repo, subset, x)			l = getset(repo, subset, x, order)
	if order == defineorder:			if order == defineorder:
	l.reverse()			l.reverse()
	return l			return l

	@predicate('roots(set)', safe=True)			@predicate('roots(set)', safe=True)
	def roots(repo, subset, x):			def roots(repo, subset, x):
	"""Changesets in set with no parent changeset in set.			"""Changesets in set with no parent changeset in set.
	"""			"""
	- ``topo`` for a reverse topographical sort			- ``topo`` for a reverse topographical sort

	The ``topo`` sort order cannot be combined with other sort keys. This sort			The ``topo`` sort order cannot be combined with other sort keys. This sort
	takes one optional argument, ``topo.firstbranch``, which takes a revset that			takes one optional argument, ``topo.firstbranch``, which takes a revset that
	specifies what topographical branches to prioritize in the sort.			specifies what topographical branches to prioritize in the sort.

	"""			"""
	s, keyflags, opts = _getsortargs(x)			s, keyflags, opts = _getsortargs(x)
	revs = getset(repo, subset, s)			revs = getset(repo, subset, s, order)

	if not keyflags or order != defineorder:			if not keyflags or order != defineorder:
	return revs			return revs
	if len(keyflags) == 1 and keyflags[0][0] == "rev":			if len(keyflags) == 1 and keyflags[0][0] == "rev":
	revs.sort(reverse=keyflags[0][1])			revs.sort(reverse=keyflags[0][1])
	return revs			return revs
	elif keyflags[0][0] == "topo":			elif keyflags[0][0] == "topo":
	firstbranch = ()			firstbranch = ()
				yujaUnsubmitted Not Done Can you split this to new patch, and preferably include a micro benchmark? Revset had historically lots of subtle ordering bugs, and I believe there are still some. Fewer "if"s should be better in general. yuja: Can you split this to new patch, and preferably include a micro benchmark? Revset had…
				quarkAuthorUnsubmitted Not Done I can do that. quark: I can do that.
	if 'topo.firstbranch' in opts:			if 'topo.firstbranch' in opts:
	firstbranch = getset(repo, subset, opts['topo.firstbranch'])			firstbranch = getset(repo, subset, opts['topo.firstbranch'])
	revs = baseset(dagop.toposort(revs, repo.changelog.parentrevs,			revs = baseset(dagop.toposort(revs, repo.changelog.parentrevs,
	firstbranch),			firstbranch),
	istopo=True)			istopo=True)
	if keyflags[0][1]:			if keyflags[0][1]:
	revs.reverse()			revs.reverse()
	return revs			return revs
	for t in s.split('\0'):			for t in s.split('\0'):
	try:			try:
	# fast path for integer revision			# fast path for integer revision
	r = int(t)			r = int(t)
	if str(r) != t or r not in cl:			if str(r) != t or r not in cl:
	raise ValueError			raise ValueError
	revs = [r]			revs = [r]
	except ValueError:			except ValueError:
	revs = stringset(repo, subset, t)			revs = stringset(repo, subset, t, defineorder)

	for r in revs:			for r in revs:
	if r in seen:			if r in seen:
	continue			continue
	if (r in subset			if (r in subset
	or r == node.nullrev and isinstance(subset, fullreposet)):			or r == node.nullrev and isinstance(subset, fullreposet)):
	ls.append(r)			ls.append(r)
	seen.add(r)			seen.add(r)
	"range": rangeset,			"range": rangeset,
	"rangeall": rangeall,			"rangeall": rangeall,
	"rangepre": rangepre,			"rangepre": rangepre,
	"rangepost": rangepost,			"rangepost": rangepost,
	"dagrange": dagrange,			"dagrange": dagrange,
	"string": stringset,			"string": stringset,
	"symbol": stringset,			"symbol": stringset,
	"and": andset,			"and": andset,
				"flipand": flipandset,
	"or": orset,			"or": orset,
	"not": notset,			"not": notset,
	"difference": differenceset,			"difference": differenceset,
	"relation": relationset,			"relation": relationset,
	"relsubscript": relsubscriptset,			"relsubscript": relsubscriptset,
	"subscript": subscriptset,			"subscript": subscriptset,
	"list": listset,			"list": listset,
	"keyvalue": keyvaluepair,			"keyvalue": keyvaluepair,
	if ui:			if ui:
	aliases.extend(ui.configitems('revsetalias'))			aliases.extend(ui.configitems('revsetalias'))
	warn = ui.warn			warn = ui.warn
	if localalias:			if localalias:
	aliases.extend(localalias.items())			aliases.extend(localalias.items())
	if aliases:			if aliases:
	tree = revsetlang.expandaliases(tree, aliases, warn=warn)			tree = revsetlang.expandaliases(tree, aliases, warn=warn)
	tree = revsetlang.foldconcat(tree)			tree = revsetlang.foldconcat(tree)
	tree = revsetlang.analyze(tree, order)			tree = revsetlang.analyze(tree)
	tree = revsetlang.optimize(tree)			tree = revsetlang.optimize(tree)
	posttreebuilthook(tree, repo)			posttreebuilthook(tree, repo)
	return makematcher(tree)			return makematcher(tree, order)

	def makematcher(tree):			def makematcher(tree, order=defineorder):
	"""Create a matcher from an evaluatable tree"""			"""Create a matcher from an evaluatable tree"""
	def mfunc(repo, subset=None):			def mfunc(repo, subset=None):
	if subset is None:			if subset is None:
	subset = fullreposet(repo)			subset = fullreposet(repo)
	return getset(repo, subset, tree)			return getset(repo, subset, tree, order)
	return mfunc			return mfunc

	def loadpredicate(ui, extname, registrarobj):			def loadpredicate(ui, extname, registrarobj):
	"""Load revset predicates from specified registrarobj			"""Load revset predicates from specified registrarobj
	"""			"""
	for name, func in registrarobj._table.iteritems():			for name, func in registrarobj._table.iteritems():
	symbols[name] = func			symbols[name] = func
	if func._safe:			if func._safe:
	safesymbols.add(name)			safesymbols.add(name)

	# load built-in predicates explicitly to setup safesymbols			# load built-in predicates explicitly to setup safesymbols
	loadpredicate(None, None, predicate)			loadpredicate(None, None, predicate)

	# tell hggettext to extract docstrings from these functions:			# tell hggettext to extract docstrings from these functions:
	i18nfunctions = symbols.values()			i18nfunctions = symbols.values()

mercurial/revsetlang.py


	This can't be used for testing a nullary function since its args tree			This can't be used for testing a nullary function since its args tree
	is also None. Use _isnamedfunc() instead.			is also None. Use _isnamedfunc() instead.
	"""			"""
	if not _isnamedfunc(x, funcname):			if not _isnamedfunc(x, funcname):
	return			return
	return x[2]			return x[2]

	# Constants for ordering requirement, used in _analyze():			# Constants for ordering requirement, used in getset():
	#			#
	# If 'define', any nested functions and operations can change the ordering of			# If 'define', any nested functions and operations can change the ordering of
	# the entries in the set. If 'follow', any nested functions and operations			# the entries in the set. If 'follow', any nested functions and operations
	# should take the ordering specified by the first operand to the '&' operator.			# should take the ordering specified by the first operand to the '&' operator.
	#			#
	# For instance,			# For instance,
	#			#
	# X & (Y \| Z)			# X & (Y \| Z)
	# ^ ^^^^^^^			# ^ ^^^^^^^
	# \| follow			# \| follow
	# define			# define
	#			#
	# will be evaluated as 'or(y(x()), z(x()))', where 'x()' can change the order			# will be evaluated as 'or(y(x()), z(x()))', where 'x()' can change the order
	# of the entries in the set, but 'y()', 'z()' and 'or()' shouldn't.			# of the entries in the set, but 'y()', 'z()' and 'or()' shouldn't.
	#			#
	# 'any' means the order doesn't matter. For instance,			# 'any' means the order doesn't matter. For instance,
	#			#
	# X & !Y			# X & !Y
	# ^			# ^
	# any			# any
	#			#
	# 'y()' can either enforce its ordering requirement or take the ordering			# 'y()' can either enforce its ordering requirement or take the ordering
	# specified by 'x()' because 'not()' doesn't care the order.			# specified by 'x()' because 'not()' doesn't care the order.
	#
	# Transition of ordering requirement:
	#
	# 1. starts with 'define'
	# 2. shifts to 'follow' by 'x & y'
	# 3. changes back to 'define' on function call 'f(x)' or function-like
	# operation 'x (f) y' because 'f' may have its own ordering requirement
	# for 'x' and 'y' (e.g. 'first(x)')
	#
	anyorder = 'any' # don't care the order			anyorder = 'any' # don't care the order
	defineorder = 'define' # should define the order			defineorder = 'define' # should define the order
	followorder = 'follow' # must follow the current order			followorder = 'follow' # must follow the current order

	# transition table for 'x & y', from the current expression 'x' to 'y'
	_tofolloworder = {
	anyorder: anyorder,
	defineorder: followorder,
	followorder: followorder,
	}

	def _matchonly(revs, bases):			def _matchonly(revs, bases):
	"""			"""
	>>> f = lambda args: _matchonly(map(parse, args))			>>> f = lambda args: _matchonly(map(parse, args))
	>>> f('ancestors(A)', 'not ancestors(B)')			>>> f('ancestors(A)', 'not ancestors(B)')
	('list', ('symbol', 'A'), ('symbol', 'B'))			('list', ('symbol', 'A'), ('symbol', 'B'))
	"""			"""
	ta = _matchnamedfunc(revs, 'ancestors')			ta = _matchnamedfunc(revs, 'ancestors')
	tb = bases and bases[0] == 'not' and _matchnamedfunc(bases[1], 'ancestors')			tb = bases and bases[0] == 'not' and _matchnamedfunc(bases[1], 'ancestors')
	# x + y + z -> (or x y z) -> (or (list x y z))			# x + y + z -> (or x y z) -> (or (list x y z))
	return (op, _fixops(('list',) + x[1:]))			return (op, _fixops(('list',) + x[1:]))
	elif op == 'subscript' and x[1][0] == 'relation':			elif op == 'subscript' and x[1][0] == 'relation':
	# x#y[z] ternary			# x#y[z] ternary
	return _fixops(('relsubscript', x[1][1], x[1][2], x[2]))			return _fixops(('relsubscript', x[1][1], x[1][2], x[2]))

	return (op,) + tuple(_fixops(y) for y in x[1:])			return (op,) + tuple(_fixops(y) for y in x[1:])

	def _analyze(x, order):			def _analyze(x):
	if x is None:			if x is None:
	return x			return x

	op = x[0]			op = x[0]
	if op == 'minus':			if op == 'minus':
	return _analyze(('and', x[1], ('not', x[2])), order)			return _analyze(('and', x[1], ('not', x[2])))
	elif op == 'only':			elif op == 'only':
	t = ('func', ('symbol', 'only'), ('list', x[1], x[2]))			t = ('func', ('symbol', 'only'), ('list', x[1], x[2]))
	return _analyze(t, order)			return _analyze(t)
	elif op == 'onlypost':			elif op == 'onlypost':
	return _analyze(('func', ('symbol', 'only'), x[1]), order)			return _analyze(('func', ('symbol', 'only'), x[1]))
	elif op == 'dagrangepre':			elif op == 'dagrangepre':
	return _analyze(('func', ('symbol', 'ancestors'), x[1]), order)			return _analyze(('func', ('symbol', 'ancestors'), x[1]))
	elif op == 'dagrangepost':			elif op == 'dagrangepost':
	return _analyze(('func', ('symbol', 'descendants'), x[1]), order)			return _analyze(('func', ('symbol', 'descendants'), x[1]))
	elif op == 'negate':			elif op == 'negate':
	s = getstring(x[1], _("can't negate that"))			s = getstring(x[1], _("can't negate that"))
	return _analyze(('string', '-' + s), order)			return _analyze(('string', '-' + s))
	elif op in ('string', 'symbol'):			elif op in ('string', 'symbol'):
	return x			return x
	elif op == 'and':
	ta = _analyze(x[1], order)
	tb = _analyze(x[2], _tofolloworder[order])
	return (op, ta, tb, order)
	elif op == 'or':
	return (op, _analyze(x[1], order), order)
	elif op == 'not':
	return (op, _analyze(x[1], anyorder), order)
	elif op == 'rangeall':			elif op == 'rangeall':
	return (op, None, order)			return (op, None)
	elif op in ('rangepre', 'rangepost', 'parentpost'):			elif op in {'or', 'not', 'rangepre', 'rangepost', 'parentpost'}:
	return (op, _analyze(x[1], defineorder), order)			return (op, _analyze(x[1]))
	elif op == 'group':			elif op == 'group':
	return _analyze(x[1], order)			return _analyze(x[1])
	elif op in ('dagrange', 'range', 'parent', 'ancestor', 'relation',			elif op in {'and', 'dagrange', 'range', 'parent', 'ancestor', 'relation',
	'subscript'):			'subscript'}:
	ta = _analyze(x[1], defineorder)			ta = _analyze(x[1])
	tb = _analyze(x[2], defineorder)			tb = _analyze(x[2])
	return (op, ta, tb, order)			return (op, ta, tb)
	elif op == 'relsubscript':			elif op == 'relsubscript':
	ta = _analyze(x[1], defineorder)			ta = _analyze(x[1])
	tb = _analyze(x[2], defineorder)			tb = _analyze(x[2])
	tc = _analyze(x[3], defineorder)			tc = _analyze(x[3])
	return (op, ta, tb, tc, order)			return (op, ta, tb, tc)
	elif op == 'list':			elif op == 'list':
	return (op,) + tuple(_analyze(y, order) for y in x[1:])			return (op,) + tuple(_analyze(y) for y in x[1:])
	elif op == 'keyvalue':			elif op == 'keyvalue':
	return (op, x[1], _analyze(x[2], order))			return (op, x[1], _analyze(x[2]))
	elif op == 'func':			elif op == 'func':
				yujaUnsubmitted Not Done Split this back to unary/binary/ternary cases since I slightly prefer explicit handling of node tuples. yuja: Split this back to unary/binary/ternary cases since I slightly prefer explicit handling of node…
				quarkAuthorUnsubmitted Not Done Ha, that was an older version of this patch. quark: Ha, that was an older version of this patch.
	f = getsymbol(x[1])			return (op, x[1], _analyze(x[2]))
	d = defineorder
	if f == 'present':
	# 'present(set)' is known to return the argument set with no
	# modification, so forward the current order to its argument
	d = order
	return (op, x[1], _analyze(x[2], d), order)
	raise ValueError('invalid operator %r' % op)			raise ValueError('invalid operator %r' % op)

	def analyze(x, order=defineorder):			def analyze(x):
	"""Transform raw parsed tree to evaluatable tree which can be fed to			"""Transform raw parsed tree to evaluatable tree which can be fed to
	optimize() or getset()			optimize() or getset()

	All pseudo operations should be mapped to real operations or functions			All pseudo operations should be mapped to real operations or functions
	defined in methods or symbols table respectively.			defined in methods or symbols table respectively.

	'order' specifies how the current expression 'x' is ordered (see the
	constants defined above.)
	"""			"""
	return _analyze(x, order)			return _analyze(x)

	def _optimize(x, small):			def _optimize(x, small):
	if x is None:			if x is None:
	return 0, x			return 0, x

	smallbonus = 1			smallbonus = 1
	if small:			if small:
	smallbonus = .5			smallbonus = .5

	op = x[0]			op = x[0]
	if op in ('string', 'symbol'):			if op in ('string', 'symbol'):
	return smallbonus, x # single revisions are small			return smallbonus, x # single revisions are small
	elif op == 'and':			elif op == 'and':
	wa, ta = _optimize(x[1], True)			wa, ta = _optimize(x[1], True)
	wb, tb = _optimize(x[2], True)			wb, tb = _optimize(x[2], True)
	order = x[3]
	w = min(wa, wb)			w = min(wa, wb)

	# (::x and not ::y)/(not ::y and ::x) have a fast path			# (::x and not ::y)/(not ::y and ::x) have a fast path
	tm = _matchonly(ta, tb) or _matchonly(tb, ta)			tm = _matchonly(ta, tb) or _matchonly(tb, ta)
	if tm:			if tm:
	return w, ('func', ('symbol', 'only'), tm, order)			return w, ('func', ('symbol', 'only'), tm)

	if tb is not None and tb[0] == 'not':			if tb is not None and tb[0] == 'not':
	return wa, ('difference', ta, tb[1], order)			return wa, ('difference', ta, tb[1])

	if wa > wb:			if wa > wb:
	return w, (op, tb, ta, order)			return w, ('flipand', tb, ta)
	return w, (op, ta, tb, order)			return w, (op, ta, tb)
	elif op == 'or':			elif op == 'or':
	# fast path for machine-generated expression, that is likely to have			# fast path for machine-generated expression, that is likely to have
	# lots of trivial revisions: 'a + b + c()' to '_list(a b) + c()'			# lots of trivial revisions: 'a + b + c()' to '_list(a b) + c()'
	order = x[2]
	ws, ts, ss = [], [], []			ws, ts, ss = [], [], []
	def flushss():			def flushss():
	if not ss:			if not ss:
	return			return
	if len(ss) == 1:			if len(ss) == 1:
	w, t = ss[0]			w, t = ss[0]
	else:			else:
	s = '\0'.join(t[1] for w, t in ss)			s = '\0'.join(t[1] for w, t in ss)
	y = ('func', ('symbol', '_list'), ('string', s), order)			y = ('func', ('symbol', '_list'), ('string', s))
	w, t = _optimize(y, False)			w, t = _optimize(y, False)
	ws.append(w)			ws.append(w)
	ts.append(t)			ts.append(t)
	del ss[:]			del ss[:]
	for y in getlist(x[1]):			for y in getlist(x[1]):
	w, t = _optimize(y, False)			w, t = _optimize(y, False)
	if t is not None and (t[0] == 'string' or t[0] == 'symbol'):			if t is not None and (t[0] == 'string' or t[0] == 'symbol'):
	ss.append((w, t))			ss.append((w, t))
	continue			continue
	flushss()			flushss()
	ws.append(w)			ws.append(w)
	ts.append(t)			ts.append(t)
	flushss()			flushss()
	if len(ts) == 1:			if len(ts) == 1:
	return ws[0], ts[0] # 'or' operation is fully optimized out			return ws[0], ts[0] # 'or' operation is fully optimized out
	return max(ws), (op, ('list',) + tuple(ts), order)			return max(ws), (op, ('list',) + tuple(ts))
	elif op == 'not':			elif op == 'not':
	# Optimize not public() to _notpublic() because we have a fast version			# Optimize not public() to _notpublic() because we have a fast version
	if x[1][:3] == ('func', ('symbol', 'public'), None):			if x[1][:3] == ('func', ('symbol', 'public'), None):
	order = x[1][3]			newsym = ('func', ('symbol', '_notpublic'), None)
	newsym = ('func', ('symbol', '_notpublic'), None, order)
	o = _optimize(newsym, not small)			o = _optimize(newsym, not small)
	return o[0], o[1]			return o[0], o[1]
				yujaUnsubmitted Not Done Perhaps this should keep the current `preserveorder` since `not public()` is fully replaced with `_notpublic()`. yuja: Perhaps this should keep the current `preserveorder` since `not public()` is fully replaced…
				yujaUnsubmitted Not Done This is an existing bug, btw. $ hg debugrevspec -p analyzed -p optimized 'not public()' * analyzed: (not (func ('symbol', 'public') None any) define) * optimized: (func ('symbol', '_notpublic') None any) yuja: This is an existing bug, btw. ``` $ hg debugrevspec -p analyzed -p optimized 'not public()' *…
				quarkAuthorUnsubmitted Not Done I thought since `_notpublic` is atomic and cannot be further split. `preserveorder` does not matter here. quark: I thought since `_notpublic` is atomic and cannot be further split. `preserveorder` does not…
				yujaUnsubmitted Not Done It's just for logical correctness. Since `not public()` is replaced with `_notpublic()`, the ordering constraint should be derived from `not public()`, not from `public()`. yuja: It's just for logical correctness. Since `not public()` is replaced with `_notpublic()`, the…
	else:			else:
	o = _optimize(x[1], not small)			o = _optimize(x[1], not small)
	order = x[2]			return o[0], (op, o[1])
	return o[0], (op, o[1], order)
	elif op == 'rangeall':			elif op == 'rangeall':
	return smallbonus, x			return smallbonus, x
	elif op in ('rangepre', 'rangepost', 'parentpost'):			elif op in ('rangepre', 'rangepost', 'parentpost'):
	o = _optimize(x[1], small)			o = _optimize(x[1], small)
	order = x[2]			return o[0], (op, o[1])
				yujaUnsubmitted Not Done The order matters. Try `(contains("glob:") & 2:0):1` for example. yuja: The order matters. Try `(contains("glob:") & 2:0):1` for example.
				quarkAuthorUnsubmitted Not Done Good catch. I realized this when working on `revset.py` but forgot to update it here. quark: Good catch. I realized this when working on `revset.py` but forgot to update it here.
	return o[0], (op, o[1], order)
	elif op in ('dagrange', 'range'):			elif op in ('dagrange', 'range'):
	wa, ta = _optimize(x[1], small)			wa, ta = _optimize(x[1], small)
	wb, tb = _optimize(x[2], small)			wb, tb = _optimize(x[2], small)
	order = x[3]			return wa + wb, (op, ta, tb)
	return wa + wb, (op, ta, tb, order)
	elif op in ('parent', 'ancestor', 'relation', 'subscript'):			elif op in ('parent', 'ancestor', 'relation', 'subscript'):
	w, t = _optimize(x[1], small)			w, t = _optimize(x[1], small)
	order = x[3]			return w, (op, t, x[2])
	return w, (op, t, x[2], order)
	elif op == 'relsubscript':			elif op == 'relsubscript':
	w, t = _optimize(x[1], small)			w, t = _optimize(x[1], small)
	order = x[4]			return w, (op, t, x[2], x[3])
	return w, (op, t, x[2], x[3], order)
	elif op == 'list':			elif op == 'list':
	ws, ts = zip(*(_optimize(y, small) for y in x[1:]))			ws, ts = zip(*(_optimize(y, small) for y in x[1:]))
	return sum(ws), (op,) + ts			return sum(ws), (op,) + ts
	elif op == 'keyvalue':			elif op == 'keyvalue':
	w, t = _optimize(x[2], small)			w, t = _optimize(x[2], small)
				yujaUnsubmitted Not Done Perhaps this is `preserveorder` since `keyvalue` node isn't a standalone expression. yuja: Perhaps this is `preserveorder` since `keyvalue` node isn't a standalone expression.
				quarkAuthorUnsubmitted Not Done Not sure. I think it's similar to function argument (which we preserves order by default in line 463). quark: Not sure. I think it's similar to function argument (which we preserves order by default in…
				yujaUnsubmitted Not Done `preserveorder` is switched to `True` at `func` node, and its child `list` just passes around it. The same rule should apply to `keyvalue`. Anyway, we should get rid of it from `_optimize()`. yuja: `preserveorder` is switched to `True` at `func` node, and its child `list` just passes around…
	return w, (op, x[1], t)			return w, (op, x[1], t)
	elif op == 'func':			elif op == 'func':
	f = getsymbol(x[1])			f = getsymbol(x[1])
	wa, ta = _optimize(x[2], small)			wa, ta = _optimize(x[2], small)
	if f in ('author', 'branch', 'closed', 'date', 'desc', 'file', 'grep',			if f in ('author', 'branch', 'closed', 'date', 'desc', 'file', 'grep',
	'keyword', 'outgoing', 'user', 'destination'):			'keyword', 'outgoing', 'user', 'destination'):
	w = 10 # slow			w = 10 # slow
	elif f in ('modifies', 'adds', 'removes'):			elif f in ('modifies', 'adds', 'removes'):
	w = 30 # slower			w = 30 # slower
	elif f == "contains":			elif f == "contains":
	w = 100 # very slow			w = 100 # very slow
	elif f == "ancestor":			elif f == "ancestor":
	w = 1 * smallbonus			w = 1 * smallbonus
	elif f in ('reverse', 'limit', 'first', 'wdir', '_intlist'):			elif f in ('reverse', 'limit', 'first', 'wdir', '_intlist'):
	w = 0			w = 0
	elif f == "sort":			elif f == "sort":
	w = 10 # assume most sorts look at changelog			w = 10 # assume most sorts look at changelog
	else:			else:
	w = 1			w = 1
	order = x[3]			return w + wa, (op, x[1], ta)
	return w + wa, (op, x[1], ta, order)
	raise ValueError('invalid operator %r' % op)			raise ValueError('invalid operator %r' % op)

	def optimize(tree):			def optimize(tree):
	"""Optimize evaluatable tree			"""Optimize evaluatable tree

	All pseudo operations should be transformed beforehand.			All pseudo operations should be transformed beforehand.
	"""			"""
	_weight, newtree = _optimize(tree, small=True)			_weight, newtree = _optimize(tree, small=True)

tests/test-revset.t

	<spanset+ 0:2>			<spanset+ 0:2>
	0			0
	1			1
	$ try --optimize :			$ try --optimize :
	(rangeall			(rangeall
	None)			None)
	* optimized:			* optimized:
	(rangeall			(rangeall
	None			None)
	define)
	* set:			* set:
	<spanset+ 0:10>			<spanset+ 0:10>
	0			0
	1			1
	2			2
	3			3
	4			4
	5			5
	(func			(func
	('symbol', 'public')			('symbol', 'public')
	None))))			None))))
	* optimized:			* optimized:
	(keyvalue			(keyvalue
	('symbol', 'foo')			('symbol', 'foo')
	(func			(func
	('symbol', '_notpublic')			('symbol', '_notpublic')
	None			None))
	any))
	hg: parse error: can't use a key-value pair in this context			hg: parse error: can't use a key-value pair in this context
	[255]			[255]

	relation-subscript operator has the highest binding strength (as function call):			relation-subscript operator has the highest binding strength (as function call):

	$ hg debugrevspec -p parsed 'tip:tip^#generations[-1]'			$ hg debugrevspec -p parsed 'tip:tip^#generations[-1]'
	* parsed:			* parsed:
	(range			(range

	$ hg debugrevspec -p analyzed -p optimized --no-show-revs \			$ hg debugrevspec -p analyzed -p optimized --no-show-revs \
	> '(not public())#generations[0]'			> '(not public())#generations[0]'
	* analyzed:			* analyzed:
	(relsubscript			(relsubscript
	(not			(not
	(func			(func
	('symbol', 'public')			('symbol', 'public')
	None			None))
	any)
	define)
	('symbol', 'generations')			('symbol', 'generations')
	('symbol', '0')			('symbol', '0'))
	define)
	* optimized:			* optimized:
	(relsubscript			(relsubscript
	(func			(func
	('symbol', '_notpublic')			('symbol', '_notpublic')
	None			None)
	any)
	('symbol', 'generations')			('symbol', 'generations')
	('symbol', '0')			('symbol', '0'))
	define)

	resolution of subscript and relation-subscript ternary operators:			resolution of subscript and relation-subscript ternary operators:

	$ hg debugrevspec -p analyzed 'tip[0]'			$ hg debugrevspec -p analyzed 'tip[0]'
	* analyzed:			* analyzed:
	(subscript			(subscript
	('symbol', 'tip')			('symbol', 'tip')
	('symbol', '0')			('symbol', '0'))
	define)
	hg: parse error: can't use a subscript in this context			hg: parse error: can't use a subscript in this context
	[255]			[255]

	$ hg debugrevspec -p analyzed 'tip#rel[0]'			$ hg debugrevspec -p analyzed 'tip#rel[0]'
	* analyzed:			* analyzed:
	(relsubscript			(relsubscript
	('symbol', 'tip')			('symbol', 'tip')
	('symbol', 'rel')			('symbol', 'rel')
	('symbol', '0')			('symbol', '0'))
	define)
	hg: parse error: unknown identifier: rel			hg: parse error: unknown identifier: rel
	[255]			[255]

	$ hg debugrevspec -p analyzed '(tip#rel)[0]'			$ hg debugrevspec -p analyzed '(tip#rel)[0]'
	* analyzed:			* analyzed:
	(subscript			(subscript
	(relation			(relation
	('symbol', 'tip')			('symbol', 'tip')
	('symbol', 'rel')			('symbol', 'rel'))
	define)			('symbol', '0'))
	('symbol', '0')
	define)
	hg: parse error: can't use a subscript in this context			hg: parse error: can't use a subscript in this context
	[255]			[255]

	$ hg debugrevspec -p analyzed 'tip#rel[0][1]'			$ hg debugrevspec -p analyzed 'tip#rel[0][1]'
	* analyzed:			* analyzed:
	(subscript			(subscript
	(relsubscript			(relsubscript
	('symbol', 'tip')			('symbol', 'tip')
	('symbol', 'rel')			('symbol', 'rel')
	('symbol', '0')			('symbol', '0'))
	define)			('symbol', '1'))
	('symbol', '1')
	define)
	hg: parse error: can't use a subscript in this context			hg: parse error: can't use a subscript in this context
	[255]			[255]

	$ hg debugrevspec -p analyzed 'tip#rel0#rel1[1]'			$ hg debugrevspec -p analyzed 'tip#rel0#rel1[1]'
	* analyzed:			* analyzed:
	(relsubscript			(relsubscript
	(relation			(relation
	('symbol', 'tip')			('symbol', 'tip')
	('symbol', 'rel0')			('symbol', 'rel0'))
	define)
	('symbol', 'rel1')			('symbol', 'rel1')
	('symbol', '1')			('symbol', '1'))
	define)
	hg: parse error: unknown identifier: rel1			hg: parse error: unknown identifier: rel1
	[255]			[255]

	$ hg debugrevspec -p analyzed 'tip#rel0[0]#rel1[1]'			$ hg debugrevspec -p analyzed 'tip#rel0[0]#rel1[1]'
	* analyzed:			* analyzed:
	(relsubscript			(relsubscript
	(relsubscript			(relsubscript
	('symbol', 'tip')			('symbol', 'tip')
	('symbol', 'rel0')			('symbol', 'rel0')
	('symbol', '0')			('symbol', '0'))
	define)
	('symbol', 'rel1')			('symbol', 'rel1')
	('symbol', '1')			('symbol', '1'))
	define)
	hg: parse error: unknown identifier: rel1			hg: parse error: unknown identifier: rel1
	[255]			[255]

	parse errors of relation, subscript and relation-subscript operators:			parse errors of relation, subscript and relation-subscript operators:

	$ hg debugrevspec '[0]'			$ hg debugrevspec '[0]'
	hg: parse error at 0: not a prefix: [			hg: parse error at 0: not a prefix: [
	[255]			[255]
	('symbol', '0')			('symbol', '0')
	('symbol', '1'))))			('symbol', '1'))))
	('symbol', '1'))			('symbol', '1'))
	* analyzed:			* analyzed:
	(and			(and
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '1'))			('symbol', '1')))
	define)
	(not			(not
	('symbol', '1')			('symbol', '1')))
	follow)
	define)
	* optimized:			* optimized:
	(difference			(difference
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x001')			('string', '0\x001'))
	define)			('symbol', '1'))
	('symbol', '1')
	define)
	0			0

	$ hg debugrevspec -p unknown '0'			$ hg debugrevspec -p unknown '0'
	abort: invalid stage name: unknown			abort: invalid stage name: unknown
	[255]			[255]

	$ hg debugrevspec -p all --optimize '0'			$ hg debugrevspec -p all --optimize '0'
	abort: cannot use --optimize with --show-stage			abort: cannot use --optimize with --show-stage
	[255]			[255]

	verify optimized tree:			verify optimized tree:

	$ hg debugrevspec --verify '0\|1'			$ hg debugrevspec --verify '0\|1'

	$ hg debugrevspec --verify -v -p analyzed -p optimized 'r3232() & 2'			$ hg debugrevspec --verify -v -p analyzed -p optimized 'r3232() & 2'
	* analyzed:			* analyzed:
	(and			(and
	(func			(func
	('symbol', 'r3232')			('symbol', 'r3232')
	None			None)
	define)			('symbol', '2'))
	('symbol', '2')
	define)
	* optimized:			* optimized:
	(and			(flipand
	('symbol', '2')			('symbol', '2')
	(func			(func
	('symbol', 'r3232')			('symbol', 'r3232')
	None			None))
	define)
	define)
	* analyzed set:			* analyzed set:
	<baseset [2]>			<baseset [2]>
	* optimized set:			* optimized set:
	<baseset [2, 2]>			<baseset [2, 2]>
	--- analyzed			--- analyzed
	+++ optimized			+++ optimized
	2			2
	+2			+2
	may be hidden (issue5385)			may be hidden (issue5385)

	$ try -p parsed -p analyzed ':'			$ try -p parsed -p analyzed ':'
	* parsed:			* parsed:
	(rangeall			(rangeall
	None)			None)
	* analyzed:			* analyzed:
	(rangeall			(rangeall
	None			None)
	define)
	* set:			* set:
	<spanset+ 0:10>			<spanset+ 0:10>
	0			0
	1			1
	2			2
	3			3
	4			4
	5			5
	6			6
	7			7
	8			8
	9			9
	$ try -p analyzed ':1'			$ try -p analyzed ':1'
	* analyzed:			* analyzed:
	(rangepre			(rangepre
	('symbol', '1')			('symbol', '1'))
	define)
	* set:			* set:
	<spanset+ 0:2>			<spanset+ 0:2>
	0			0
	1			1
	$ try -p analyzed ':(1\|2)'			$ try -p analyzed ':(1\|2)'
	* analyzed:			* analyzed:
	(rangepre			(rangepre
	(or			(or
	(list			(list
	('symbol', '1')			('symbol', '1')
	('symbol', '2'))			('symbol', '2'))))
	define)
	define)
	* set:			* set:
	<spanset+ 0:3>			<spanset+ 0:3>
	0			0
	1			1
	2			2
	$ try -p analyzed ':(1&2)'			$ try -p analyzed ':(1&2)'
	* analyzed:			* analyzed:
	(rangepre			(rangepre
	(and			(and
	('symbol', '1')			('symbol', '1')
	('symbol', '2')			('symbol', '2')))
	define)
	define)
	* set:			* set:
	<baseset []>			<baseset []>

	infix/suffix resolution of ^ operator (issue2884):			infix/suffix resolution of ^ operator (issue2884):

	x^:y means (x^):y			x^:y means (x^):y

	$ try '1^:2'			$ try '1^:2'
	('symbol', '9'))			('symbol', '9'))
	('symbol', '8')))			('symbol', '8')))
	* optimized:			* optimized:
	(func			(func
	('symbol', 'only')			('symbol', 'only')
	(difference			(difference
	(range			(range
	('symbol', '8')			('symbol', '8')
	('symbol', '9')			('symbol', '9'))
	define)			('symbol', '8')))
	('symbol', '8')
	define)
	define)
	* set:			* set:
	<baseset+ [8, 9]>			<baseset+ [8, 9]>
	8			8
	9			9
	$ try --optimize '(9)%(5)'			$ try --optimize '(9)%(5)'
	(only			(only
	(group			(group
	('symbol', '9'))			('symbol', '9'))
	(group			(group
	('symbol', '5')))			('symbol', '5')))
	* optimized:			* optimized:
	(func			(func
	('symbol', 'only')			('symbol', 'only')
	(list			(list
	('symbol', '9')			('symbol', '9')
	('symbol', '5'))			('symbol', '5')))
	define)
	* set:			* set:
	<baseset+ [2, 4, 8, 9]>			<baseset+ [2, 4, 8, 9]>
	2			2
	4			4
	8			8
	9			9

	Test the order of operations			Test the order of operations
	'x:y' takes ordering parameter into account:			'x:y' takes ordering parameter into account:

	$ try -p optimized '3:0 & 0:3 & not 2:1'			$ try -p optimized '3:0 & 0:3 & not 2:1'
	* optimized:			* optimized:
	(difference			(difference
	(and			(and
	(range			(range
	('symbol', '3')			('symbol', '3')
	('symbol', '0')			('symbol', '0'))
	define)
	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '3')			('symbol', '3')))
	follow)
	define)
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '1')			('symbol', '1')))
	any)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<filteredset			<filteredset
	<spanset- 0:4>,			<spanset- 0:4>,
	<spanset+ 0:4>>,			<spanset+ 0:4>>,
	<not			<not
	<spanset+ 1:3>>>			<spanset+ 1:3>>>
	3			3
	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '1')			('symbol', '1')
	('symbol', '2')))))			('symbol', '2')))))
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x001\x002')			('string', '0\x001\x002')))
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<baseset [0, 1, 2]>>			<baseset [0, 1, 2]>>
	2			2
	1			1
	0			0

	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '1'))			('symbol', '1'))
	('symbol', '2')))))			('symbol', '2')))))
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(or			(or
	(list			(list
	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '1')			('symbol', '1'))
	follow)			('symbol', '2'))))
	('symbol', '2'))
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<addset			<addset
	<spanset+ 0:2>,			<spanset+ 0:2>,
	<baseset [2]>>>			<baseset [2]>>>
	2			2
	1			1
	0			0

	'_intlist(a b)' should behave like 'a + b':			'_intlist(a b)' should behave like 'a + b':

	$ trylist --optimize '2:0 & %ld' 0 1 2			$ trylist --optimize '2:0 & %ld' 0 1 2
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0'))			('symbol', '0'))
	(func			(func
	('symbol', '_intlist')			('symbol', '_intlist')
	('string', '0\x001\x002')))			('string', '0\x001\x002')))
	* optimized:			* optimized:
	(and			(flipand
	(func			(func
	('symbol', '_intlist')			('symbol', '_intlist')
	('string', '0\x001\x002')			('string', '0\x001\x002'))
	follow)
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0')))
	define)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<baseset+ [0, 1, 2]>>			<baseset+ [0, 1, 2]>>
	2			2
	1			1
	0			0

	$ trylist --optimize '%ld & 2:0' 0 2 1			$ trylist --optimize '%ld & 2:0' 0 2 1
	(and			(and
	(func			(func
	('symbol', '_intlist')			('symbol', '_intlist')
	('string', '0\x002\x001'))			('string', '0\x002\x001'))
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')))			('symbol', '0')))
	* optimized:			* optimized:
	(and			(and
	(func			(func
	('symbol', '_intlist')			('symbol', '_intlist')
	('string', '0\x002\x001')			('string', '0\x002\x001'))
	define)
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0')))
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<baseset [0, 2, 1]>,			<baseset [0, 2, 1]>,
	<spanset- 0:3>>			<spanset- 0:3>>
	0			0
	2			2
	1			1

	'_hexlist(a b)' should behave like 'a + b':			'_hexlist(a b)' should behave like 'a + b':

	$ trylist --optimize --bin '2:0 & %ln' `hg log -T '{node} ' -r0:2`			$ trylist --optimize --bin '2:0 & %ln' `hg log -T '{node} ' -r0:2`
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0'))			('symbol', '0'))
	(func			(func
	('symbol', '_hexlist')			('symbol', '_hexlist')
	('string', '*'))) (glob)			('string', '*'))) (glob)
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(func			(func
	('symbol', '_hexlist')			('symbol', '_hexlist')
	('string', '*') (glob)			('string', '*'))) (glob)
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<baseset [0, 1, 2]>>			<baseset [0, 1, 2]>>
	2			2
	1			1
	0			0

	$ trylist --optimize --bin '%ln & 2:0' `hg log -T '{node} ' -r0+2+1`			$ trylist --optimize --bin '%ln & 2:0' `hg log -T '{node} ' -r0+2+1`
	(and			(and
	(func			(func
	('symbol', '_hexlist')			('symbol', '_hexlist')
	('string', '*')) (glob)			('string', '*')) (glob)
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')))			('symbol', '0')))
	* optimized:			* optimized:
	(and			(flipand
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	follow)
	(func			(func
	('symbol', '_hexlist')			('symbol', '_hexlist')
	('string', '*') (glob)			('string', '*'))) (glob)
	define)
	define)
	* set:			* set:
	<baseset [0, 2, 1]>			<baseset [0, 2, 1]>
	0			0
	2			2
	1			1

	'_list' should not go through the slow follow-order path if order doesn't			'_list' should not go through the slow follow-order path if order doesn't
	matter:			matter:

	$ try -p optimized '2:0 & not (0 + 1)'			$ try -p optimized '2:0 & not (0 + 1)'
	* optimized:			* optimized:
	(difference			(difference
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x001')			('string', '0\x001')))
	any)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<not			<not
	<baseset [0, 1]>>>			<baseset [0, 1]>>>
	2			2

	$ try -p optimized '2:0 & not (0:2 & (0 + 1))'			$ try -p optimized '2:0 & not (0:2 & (0 + 1))'
	* optimized:			* optimized:
	(difference			(difference
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(and			(and
	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '2')			('symbol', '2'))
	any)
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x001')			('string', '0\x001'))))
	any)
	any)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<not			<not
	<baseset [0, 1]>>>			<baseset [0, 1]>>>
	2			2

	because 'present()' does nothing other than suppressing an error, the			because 'present()' does nothing other than suppressing an error, the
	ordering requirement should be forwarded to the nested expression			ordering requirement should be forwarded to the nested expression

	$ try -p optimized 'present(2 + 0 + 1)'			$ try -p optimized 'present(2 + 0 + 1)'
	* optimized:			* optimized:
	(func			(func
	('symbol', 'present')			('symbol', 'present')
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '2\x000\x001')			('string', '2\x000\x001')))
	define)
	define)
	* set:			* set:
	<baseset [2, 0, 1]>			<baseset [2, 0, 1]>
	2			2
	0			0
	1			1

	$ try --optimize '2:0 & present(0 + 1 + 2)'			$ try --optimize '2:0 & present(0 + 1 + 2)'
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0'))			('symbol', '0'))
	(func			(func
	('symbol', 'present')			('symbol', 'present')
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '1')			('symbol', '1')
	('symbol', '2')))))			('symbol', '2')))))
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(func			(func
	('symbol', 'present')			('symbol', 'present')
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x001\x002')			('string', '0\x001\x002'))))
	follow)
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<baseset [0, 1, 2]>>			<baseset [0, 1, 2]>>
	2			2
	1			1
	0			0

	'reverse()' should take effect only if it is the outermost expression:			'reverse()' should take effect only if it is the outermost expression:

	$ try --optimize '0:2 & reverse(all())'			$ try --optimize '0:2 & reverse(all())'
	(and			(and
	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '2'))			('symbol', '2'))
	(func			(func
	('symbol', 'reverse')			('symbol', 'reverse')
	(func			(func
	('symbol', 'all')			('symbol', 'all')
	None)))			None)))
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '2')			('symbol', '2'))
	define)
	(func			(func
	('symbol', 'reverse')			('symbol', 'reverse')
	(func			(func
	('symbol', 'all')			('symbol', 'all')
	None			None)))
	define)
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset+ 0:3>,			<spanset+ 0:3>,
	<spanset+ 0:10>>			<spanset+ 0:10>>
	0			0
	1			1
	2			2

	('symbol', 'all')			('symbol', 'all')
	None)			None)
	(negate			(negate
	('symbol', 'rev')))))			('symbol', 'rev')))))
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '2')			('symbol', '2'))
	define)
	(func			(func
	('symbol', 'sort')			('symbol', 'sort')
	(list			(list
	(func			(func
	('symbol', 'all')			('symbol', 'all')
	None			None)
	define)			('string', '-rev'))))
	('string', '-rev'))
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset+ 0:3>,			<spanset+ 0:3>,
	<spanset+ 0:10>>			<spanset+ 0:10>>
	0			0
	1			1
	2			2

	(list			(list
	('symbol', '1')			('symbol', '1')
	('symbol', '0')			('symbol', '0')
	('symbol', '2')))))			('symbol', '2')))))
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(func			(func
	('symbol', 'first')			('symbol', 'first')
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '1\x000\x002')			('string', '1\x000\x002'))))
	define)
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<baseset [1]>,			<baseset [1]>,
	<spanset- 0:3>>			<spanset- 0:3>>
	1			1

	$ try --optimize '2:0 & not last(0 + 2 + 1)'			$ try --optimize '2:0 & not last(0 + 2 + 1)'
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0'))			('symbol', '0'))
	(not			(not
	(func			(func
	('symbol', 'last')			('symbol', 'last')
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '2')			('symbol', '2')
	('symbol', '1'))))))			('symbol', '1'))))))
	* optimized:			* optimized:
	(difference			(difference
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(func			(func
	('symbol', 'last')			('symbol', 'last')
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x002\x001')			('string', '0\x002\x001'))))
	define)
	any)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<not			<not
	<baseset [1]>>>			<baseset [1]>>>
	2			2
	0			0

	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '2')			('symbol', '2')
	('symbol', '1'))))))			('symbol', '1'))))))
	* optimized:			* optimized:
	(and			(and
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0'))
	define)
	(range			(range
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '1\x000\x002')			('string', '1\x000\x002'))
	define)
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x002\x001')			('string', '0\x002\x001'))))
	define)
	follow)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<spanset- 0:3>,			<spanset- 0:3>,
	<baseset [1]>>			<baseset [1]>>
	1			1

	'A & B' can be rewritten as 'B & A' by weight, but that's fine as long as			'A & B' can be rewritten as 'flipand(B, A)' by weight.
	the ordering rule is determined before the rewrite; in this example,
	'B' follows the order of the initial set, which is the same order as 'A'
	since 'A' also follows the order:

	$ try --optimize 'contains("glob:*") & (2 + 0 + 1)'			$ try --optimize 'contains("glob:*") & (2 + 0 + 1)'
	(and			(and
	(func			(func
	('symbol', 'contains')			('symbol', 'contains')
	('string', 'glob:*'))			('string', 'glob:*'))
	(group			(group
	(or			(or
	(list			(list
	('symbol', '2')			('symbol', '2')
	('symbol', '0')			('symbol', '0')
	('symbol', '1')))))			('symbol', '1')))))
	* optimized:			* optimized:
	(and			(flipand
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '2\x000\x001')			('string', '2\x000\x001'))
	follow)
	(func			(func
	('symbol', 'contains')			('symbol', 'contains')
	('string', 'glob:*')			('string', 'glob:*')))
	define)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<baseset+ [0, 1, 2]>,			<baseset+ [0, 1, 2]>,
	<contains 'glob:*'>>			<contains 'glob:*'>>
	0			0
	1			1
	2			2

	('string', 'glob:*')))			('string', 'glob:*')))
	(group			(group
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '2')			('symbol', '2')
	('symbol', '1')))))			('symbol', '1')))))
	* optimized:			* optimized:
	(and			(flipand
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x002\x001')			('string', '0\x002\x001'))
	follow)
	(func			(func
	('symbol', 'reverse')			('symbol', 'reverse')
	(func			(func
	('symbol', 'contains')			('symbol', 'contains')
	('string', 'glob:*')			('string', 'glob:*'))))
	define)
	define)
	define)
	* set:			* set:
	<filteredset			<filteredset
	<baseset- [0, 1, 2]>,			<baseset- [0, 1, 2]>,
	<contains 'glob:*'>>			<contains 'glob:*'>>
	2			2
	1			1
	0			0

	test sort revset			test sort revset
				quarkAuthorUnsubmitted Done The new code is less efficient here. I guess we it might be solvable by having a `_reverseand` operator that `_optimize` may use. quark: The new code is less efficient here. I guess we it might be solvable by having a `_reverseand`…
	--------------------------------------------			--------------------------------------------

	test when adding two unordered revsets			test when adding two unordered revsets

	$ log 'sort(keyword(issue) or modifies(b))'			$ log 'sort(keyword(issue) or modifies(b))'
	4			4
	6			6

	(func			(func
	('symbol', 'reverse')			('symbol', 'reverse')
	(dagrange			(dagrange
	('symbol', '1')			('symbol', '1')
	('symbol', '5'))))))			('symbol', '5'))))))
	* set:			* set:
	<addset+			<addset+
	<generatorset+>,			<generatorset+>,
	<baseset- [1, 3, 5]>>			<baseset- [1, 3, 5]>>
				quarkAuthorUnsubmitted Not Done This is caused by `fullreposet` having a default order. If we remove that, it would be optimized to `<baseset [1, 3, 5]>` here. quark: This is caused by `fullreposet` having a default order. If we remove that, it would be…
				martinvonzUnsubmitted Not Done Does that mean you'll remove the other.sort() in fullreposet.and? martinvonz: Does that mean you'll remove the other.sort() in fullreposet.__and__?
				quarkAuthorUnsubmitted Not Done In another way. With the new code (`anyorder` gets aggressively used), `fullrepo & xs` would be optimized to `xs & fullrepo` and the latter does not have the sort. quark: In another way. With the new code (`anyorder` gets aggressively used), `fullrepo & xs` would be…
	0			0
	1			1
	2			2
	3			3
	4			4
	5			5

	test optimization of trivial `or` operation			test optimization of trivial `or` operation

	$ try --optimize '0\|(1)\|"2"\|-2\|tip\|null'			$ try --optimize '0\|(1)\|"2"\|-2\|tip\|null'
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	(group			(group
	('symbol', '1'))			('symbol', '1'))
	('string', '2')			('string', '2')
	(negate			(negate
	('symbol', '2'))			('symbol', '2'))
	('symbol', 'tip')			('symbol', 'tip')
	('symbol', 'null')))			('symbol', 'null')))
	* optimized:			* optimized:
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x001\x002\x00-2\x00tip\x00null')			('string', '0\x001\x002\x00-2\x00tip\x00null'))
	define)
	* set:			* set:
	<baseset [0, 1, 2, 8, 9, -1]>			<baseset [0, 1, 2, 8, 9, -1]>
	0			0
	1			1
	2			2
	8			8
	9			9
	-1			-1

	$ try --optimize '0\|1\|2:3'			$ try --optimize '0\|1\|2:3'
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '1')			('symbol', '1')
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '3'))))			('symbol', '3'))))
	* optimized:			* optimized:
	(or			(or
	(list			(list
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '0\x001')			('string', '0\x001'))
	define)
	(range			(range
	('symbol', '2')			('symbol', '2')
	('symbol', '3')			('symbol', '3'))))
	define))
	define)
	* set:			* set:
	<addset			<addset
	<baseset [0, 1]>,			<baseset [0, 1]>,
	<spanset+ 2:4>>			<spanset+ 2:4>>
	0			0
	1			1
	2			2
	3			3
	('symbol', '4'))			('symbol', '4'))
	('symbol', '5')			('symbol', '5')
	('symbol', '6')))			('symbol', '6')))
	* optimized:			* optimized:
	(or			(or
	(list			(list
	(range			(range
	('symbol', '0')			('symbol', '0')
	('symbol', '1')			('symbol', '1'))
	define)
	('symbol', '2')			('symbol', '2')
	(range			(range
	('symbol', '3')			('symbol', '3')
	('symbol', '4')			('symbol', '4'))
	define)
	(func			(func
	('symbol', '_list')			('symbol', '_list')
	('string', '5\x006')			('string', '5\x006'))))
	define))
	define)
	* set:			* set:
	<addset			<addset
	<addset			<addset
	<spanset+ 0:2>,			<spanset+ 0:2>,
	<baseset [2]>>,			<baseset [2]>>,
	<addset			<addset
	<spanset+ 3:5>,			<spanset+ 3:5>,
	<baseset [5, 6]>>>			<baseset [5, 6]>>>
	$ try --no-optimized -p analyzed '0\|1\|2\|3\|4'			$ try --no-optimized -p analyzed '0\|1\|2\|3\|4'
	* analyzed:			* analyzed:
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	('symbol', '1')			('symbol', '1')
	('symbol', '2')			('symbol', '2')
	('symbol', '3')			('symbol', '3')
	('symbol', '4'))			('symbol', '4')))
	define)
	* set:			* set:
	<addset			<addset
	<addset			<addset
	<baseset [0]>,			<baseset [0]>,
	<baseset [1]>>,			<baseset [1]>>,
	<addset			<addset
	<baseset [2]>,			<baseset [2]>,
	<addset			<addset
	(list			(list
	('symbol', '0')			('symbol', '0')
	(group			(group
	None)))			None)))
	* optimized:			* optimized:
	(or			(or
	(list			(list
	('symbol', '0')			('symbol', '0')
	None)			None))
	define)
	hg: parse error: missing argument			hg: parse error: missing argument
	[255]			[255]

	test that chained `or` operations never eat up stack (issue4624)			test that chained `or` operations never eat up stack (issue4624)
	(uses `0:1` instead of `0` to avoid future optimization of trivial revisions)			(uses `0:1` instead of `0` to avoid future optimization of trivial revisions)

	$ hg log -T '{rev}\n' -r `$PYTHON -c "print '+'.join(['0:1'] * 500)"`			$ hg log -T '{rev}\n' -r `$PYTHON -c "print '+'.join(['0:1'] * 500)"`
	0			0
	('symbol', '3'))			('symbol', '3'))
	(dagrangepre			(dagrangepre
	('symbol', '1')))			('symbol', '1')))
	* optimized:			* optimized:
	(func			(func
	('symbol', 'only')			('symbol', 'only')
	(list			(list
	('symbol', '3')			('symbol', '3')
	('symbol', '1'))			('symbol', '1')))
	define)
	* set:			* set:
	<baseset+ [3]>			<baseset+ [3]>
	3			3
	$ try --optimize 'ancestors(1) - ancestors(3)'			$ try --optimize 'ancestors(1) - ancestors(3)'
	(minus			(minus
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	('symbol', '1'))			('symbol', '1'))
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	('symbol', '3')))			('symbol', '3')))
	* optimized:			* optimized:
	(func			(func
	('symbol', 'only')			('symbol', 'only')
	(list			(list
	('symbol', '1')			('symbol', '1')
	('symbol', '3'))			('symbol', '3')))
	define)
	* set:			* set:
	<baseset+ []>			<baseset+ []>
	$ try --optimize 'not ::2 and ::6'			$ try --optimize 'not ::2 and ::6'
	(and			(and
	(not			(not
	(dagrangepre			(dagrangepre
	('symbol', '2')))			('symbol', '2')))
	(dagrangepre			(dagrangepre
	('symbol', '6')))			('symbol', '6')))
	* optimized:			* optimized:
	(func			(func
	('symbol', 'only')			('symbol', 'only')
	(list			(list
	('symbol', '6')			('symbol', '6')
	('symbol', '2'))			('symbol', '2')))
	define)
	* set:			* set:
	<baseset+ [3, 4, 5, 6]>			<baseset+ [3, 4, 5, 6]>
	3			3
	4			4
	5			5
	6			6
	$ try --optimize 'ancestors(6) and not ancestors(4)'			$ try --optimize 'ancestors(6) and not ancestors(4)'
	(and			(and
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	('symbol', '6'))			('symbol', '6'))
	(not			(not
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	('symbol', '4'))))			('symbol', '4'))))
	* optimized:			* optimized:
	(func			(func
	('symbol', 'only')			('symbol', 'only')
	(list			(list
	('symbol', '6')			('symbol', '6')
	('symbol', '4'))			('symbol', '4')))
	define)
	* set:			* set:
	<baseset+ [3, 5, 6]>			<baseset+ [3, 5, 6]>
	3			3
	5			5
	6			6

	no crash by empty group "()" while optimizing to "only()"			no crash by empty group "()" while optimizing to "only()"

	$ try --optimize '::1 and ()'			$ try --optimize '::1 and ()'
	(and			(and
	(dagrangepre			(dagrangepre
	('symbol', '1'))			('symbol', '1'))
	(group			(group
	None))			None))
	* optimized:			* optimized:
	(and			(flipand
	None			None
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	('symbol', '1')			('symbol', '1')))
	define)
	define)
	hg: parse error: missing argument			hg: parse error: missing argument
	[255]			[255]

	optimization to only() works only if ancestors() takes only one argument			optimization to only() works only if ancestors() takes only one argument

	$ hg debugrevspec -p optimized 'ancestors(6) - ancestors(4, 1)'			$ hg debugrevspec -p optimized 'ancestors(6) - ancestors(4, 1)'
	* optimized:			* optimized:
	(difference			(difference
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	('symbol', '6')			('symbol', '6'))
	define)
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	(list			(list
	('symbol', '4')			('symbol', '4')
	('symbol', '1'))			('symbol', '1'))))
	any)
	define)
	0			0
	1			1
	3			3
	5			5
	6			6
	$ hg debugrevspec -p optimized 'ancestors(6, 1) - ancestors(4)'			$ hg debugrevspec -p optimized 'ancestors(6, 1) - ancestors(4)'
	* optimized:			* optimized:
	(difference			(difference
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	(list			(list
	('symbol', '6')			('symbol', '6')
	('symbol', '1'))			('symbol', '1')))
	define)
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	('symbol', '4')			('symbol', '4')))
	any)
	define)
	5			5
	6			6

	optimization disabled if keyword arguments passed (because we're too lazy			optimization disabled if keyword arguments passed (because we're too lazy
	to support it)			to support it)

	$ hg debugrevspec -p optimized 'ancestors(set=6) - ancestors(set=4)'			$ hg debugrevspec -p optimized 'ancestors(set=6) - ancestors(set=4)'
	* optimized:			* optimized:
	(difference			(difference
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	(keyvalue			(keyvalue
	('symbol', 'set')			('symbol', 'set')
	('symbol', '6'))			('symbol', '6')))
	define)
	(func			(func
	('symbol', 'ancestors')			('symbol', 'ancestors')
	(keyvalue			(keyvalue
	('symbol', 'set')			('symbol', 'set')
	('symbol', '4'))			('symbol', '4'))))
	any)
	define)
	3			3
	5			5
	6			6

	invalid function call should not be optimized to only()			invalid function call should not be optimized to only()

	$ log '"ancestors"(6) and not ancestors(4)'			$ log '"ancestors"(6) and not ancestors(4)'
	hg: parse error: not a symbol			hg: parse error: not a symbol