Earlier there was a question How can I convert JSON to CSV? and there were lots of answers, however none of them explains how to convert non-latin1 data.
Let's say I have a JSON file like the following:
[
{"id":123,"FullName":"Иванов Иван Иванович"},
{"id":124,"FullName":"Петров Петр Петрович"}
]
And I try to use a script like that:
#!/usr/bin/env python2.7
# -*- coding: utf-8 -*-
import sys
import codecs
import json
import unicodecsv as csv
if __name__ == '__main__':
fin = codecs.open(sys.argv[1], encoding='utf-8')
data = json.load(fin)
fin.close()
with codecs.open('test.csv', encoding='utf-8', mode='wb') as csv_file:
w = csv.writer(csv_file, encoding='utf-8')
w.writerow(data[0].keys()) # header row
for row in data:
w.writerow(row.values())
Which gives me the following error:
UnicodeDecodeError: 'ascii' codec can't decode byte 0xd0 in position 32: ordinal not in range(128)
First of all it is not clear what is there at position 32, but the most interesting question is if there is a way to save UTF-8 encoded strings to CSV file.
from How to convert JSON to CSV? (with UTF-8 support)
No comments:
Post a Comment