This is an archive of the discontinued Mercurial Phabricator instance.

re2: feed unicode string to re2 module when necessary
ClosedPublic

Authored by marmoute on Mar 28 2021, 6:52 PM.

Details

Summary

My previous test were using the pyre2 Python project, that wrap the Google RE2 library
in python as a re2 module and accept bytes as input. However the fb-re2
Python project is also offering a wrapping of the Google RE2 library in python
as a re2 module ans accept only unicode on python3. So we detect this
situation and convert thing to unicode when necessary.

Hooray…

We should consider using a rust wrapping for regexp handling. We needs regexps
in Rust anyway and this give use more control with less variants and more
sanity.

Diff Detail

Repository
rHG Mercurial
Lint
Automatic diff as part of commit; lint not applicable.
Unit
Automatic diff as part of commit; unit tests not applicable.

Event Timeline

marmoute created this revision.Mar 28 2021, 6:52 PM
This revision was not accepted when it landed; it landed in state Needs Review.
This revision was automatically updated to reflect the committed changes.