Patch #25861

CSV Importer - handle UndefinedConversionErrors

Added by Jens Krämer 8 days ago. Updated 8 days ago.

Status:NewStart date:
Priority:NormalDue date:
Assignee:-% Done:

0%

Category:Importers
Target version:3.2.7

Description

The CSV import already handles a couple of Exceptions that may happen when the user-selected encoding does not match the real encoding of the CSV (or it contains otherwise illegal byte sequences). We recently encountered Encoding::UndefinedConversionError, which is not yet caught by that error handling. The attached patches add a test case and a fix for that.

0001-adds-failing-test-case-for-import-with-wrong-encodin.patch Magnifier - test case (1.69 KB) Jens Krämer, 2017-05-16 11:06

0002-adds-rescue-from-Encoding-UndefinedConversionError.patch Magnifier - fix (1.05 KB) Jens Krämer, 2017-05-16 11:06

History

#1 Updated by Go MAEDA 8 days ago

  • Target version set to 3.2.7

We can reproduce the problem with the test in 0001-adds-failing-test-case-for-import-with-wrong-encodin.patch. Setting target version to 3.2.7.

By the way, the encoding of the file invalid-Shift_JIS.csv is valid CP932. Shift_JIS and CP932 are almost identical, but CP932 includes more characters.

$ iconv -f cp932 -t utf8 test/fixtures/files/invalid-Shift_JIS.csv
①

Also available in: Atom PDF