I am retrieving Twitter data with a Python tool and dump these in JSON format to my disk. I noticed an unintended escaping of the entire data-string for a tweet being enclosed in double quotes. Furthermore, all double quotes of the actual JSON formatting are escaped with a backslash.
They look like this:
"{"created_at":"Fri Aug 08 11:04:40 +0000
2014","id":497699913925292032,
How do I avoid that? It should be:
{"created_at":"Fri Aug 08 11:04:40 +0000 2014" .....
My file-out code looks like this:
with io.open('data'+self.timestamp+'.txt', 'a', encoding='utf-8') as f:
f.write(unicode(json.dumps(data, ensure_ascii=False)))
f.write(unicode('
'))
The unintended escaping causes problems when reading in the JSON file in a later processing step.
See Question&Answers more detail:
os 与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…