encoding - Python "string_escape" vs "unicode_escape"

Question

Welcome To Ask or Share your Answers For Others

encoding - Python "string_escape" vs "unicode_escape"

posted Oct 24, 2021 in Technique[技术] by 深蓝 (71.8m points)

encoding - Python "string_escape" vs "unicode_escape"

According to the docs, the builtin string encoding string_escape:

Produce[s] a string that is suitable as string literal in Python source code

...while the unicode_escape:

Produce[s] a string that is suitable as Unicode literal in Python source code

So, they should have roughly the same behaviour. BUT, they appear to treat single quotes differently:

>>> print """before '"  after""".encode('string-escape')
before '" x00 after
>>> print """before '"  after""".encode('unicode-escape')
before '" x00 after

The string_escape escapes the single quote while the Unicode one does not. Is it safe to assume that I can simply:

>>> escaped = my_string.encode('unicode-escape').replace("'", "\'")

...and get the expected behaviour?

Edit: Just to be super clear, the expected behavior is getting something suitable as a literal.

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

1 Reply

深蓝 · Answer 1 · 2021-10-23T18:28:27+0000

According to my interpretation of the implementation of unicode-escape and the unicode repr in the CPython 2.6.5 source, yes; the only difference between repr(unicode_string) and unicode_string.encode('unicode-escape') is the inclusion of wrapping quotes and escaping whichever quote was used.

They are both driven by the same function, unicodeescape_string. This function takes a parameter whose sole function is to toggle the addition of the wrapping quotes and escaping of that quote.

Categories

encoding - Python "string_escape" vs "unicode_escape"

encoding - Python "string_escape" vs "unicode_escape"

Please log in or register to add a comment.

Please log in or register to reply this article.

1 Reply

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags