The content of a file is like following, and the file encoding is utf-8:
cd232704-a46f-3d9d-97f6-67edb897d65f b'this Friday, Gerda Scheuers will be excited xe2x80x94 but shexe2x80x99s most excited about the merchandise the movie will bring.'
Here is my code:
with open(file, 'r') as f_in:
for line in f_in:
tokens = line.split('')
print(tokens[1])
I want to get the right answer - "this Friday, Gerda Scheuers will be excited - but she's most excited about the merchandise the movie will bring."
print(b'xe2x80x94'.decode('utf-8')) #convert into ASCII
But I can't read the bytes from a file. If I open a file with bytes, I need to decode the line to splite it.
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…