Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
400 views
in Technique[技术] by (71.8m points)

python - UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa0 in position 187: invalid start byte

I have a .csv file in a blob container on azure cloud, which I am trying to read through the following line of code into a dataframe. I am getting the above mentioned error. Code :

parts = pd.read_csv(StringIO(downloaded_blob.content_as_text()), delimiter=',', encoding= 'unicode_escape')

Can someone shine a light on this issue.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

Tried various encodings did not work. Opened in VS code saw there were some special characters which I did not otherwise see. Removed them and saved the file in utf-8 encoding. It has started to work. Issue closed.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...