Different behaviour of regexp in 3.6.0b2
Serhiy Storchaka
storchaka at gmail.com
Sat Oct 15 03:39:46 EDT 2016
More information about the Python-list mailing list
Sat Oct 15 03:39:46 EDT 2016
- Previous message (by thread): Different behaviour of regexp in 3.6.0b2
- Next message (by thread): Different behaviour of regexp in 3.6.0b2
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
On 14.10.16 18:40, Lele Gaifax wrote: > Hi all, > > trying out pgcli with Python 3.6.0b2 I got an error related to what seem a > different behaviour, or even a bug, of re.sub(). > > The original intent is to replace spaces within a string with the regular expression > \s+ (see https://github.com/dbcli/pgcli/blob/master/pgcli/packages/prioritization.py#L11, > ignore the fact that the re.sub() call seem underoptimal). > > With Python 3.5.2 is straightforward: > > $ python3.5 > Python 3.5.2+ (default, Sep 22 2016, 12:18:14) > [GCC 6.2.0 20160927] on linux > Type "help", "copyright", "credits" or "license" for more information. > >>> import re > >>> re.sub(r'\s+', r'\s+', 'foo bar') > 'foo\\s+bar' > > While Python 3.6.0b2 gives: > > $ python3.6 > Python 3.6.0b2+ (default, Oct 11 2016, 08:30:05) > [GCC 6.2.0 20160927] on linux > Type "help", "copyright", "credits" or "license" for more information. > >>> import re > >>> re.sub(r'\s+', r'\s+', 'foo bar') > Traceback (most recent call last): > File "/usr/local/python3.6/lib/python3.6/sre_parse.py", line 945, in parse_template > this = chr(ESCAPES[this][1]) > KeyError: '\\s' > > During handling of the above exception, another exception occurred: > > Traceback (most recent call last): > File "<stdin>", line 1, in <module> > File "/usr/local/python3.6/lib/python3.6/re.py", line 191, in sub > return _compile(pattern, flags).sub(repl, string, count) > File "/usr/local/python3.6/lib/python3.6/re.py", line 326, in _subx > template = _compile_repl(template, pattern) > File "/usr/local/python3.6/lib/python3.6/re.py", line 317, in _compile_repl > return sre_parse.parse_template(repl, pattern) > File "/usr/local/python3.6/lib/python3.6/sre_parse.py", line 948, in parse_template > raise s.error('bad escape %s' % this, len(this)) > sre_constants.error: bad escape \s at position 0 > > Accordingly to the documentation (https://docs.python.org/3.6/library/re.html#re.sub) > “unknown escapes [in the repl argument] such as \& are left alone”. > > Am I missing something, or is this a regression? Unknown escapes consisting of "\" following by ASCII letter are errors in 3.6 (and warnings in 3.5). Seems the documentation is not accurate. Could you file a report on https://bugs.python.org/ ?
- Previous message (by thread): Different behaviour of regexp in 3.6.0b2
- Next message (by thread): Different behaviour of regexp in 3.6.0b2
- Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Python-list mailing list