Defect #6370

UTF-8 BOM only partially hidden

Added by Barnabás Králik over 2 years ago. Updated about 2 years ago.

Status:NewStart date:2010-09-12
Priority:NormalDue date:
Assignee:-% Done:

0%

Category:-
Target version:-
Affected version:devel Resolution:

Description

We are using revision r4083.

When displaying a text file (the extension of which is not known by coderay) that is encoded as UTF-8 with standard 3-byte BOM, only the first 2 bytes are cut off and the third byte, bf is rendered as "".

Standard unix xxd outputs the following for the file visible in the attached screen shot:

$:~/aarghr$ head -n3 Parser.fs | xxd
0000000: efbb bf6f 7065 6e20 5379 7374 656d 0d0a  ...open System..
0000010: 6f70 656e 2053 7973 7465 6d2e 586d 6c0d  open System.Xml.
0000020: 0a0d 0a 

Névtelen.png - Screenshot showing the problem (17.8 KB) Barnabás Králik, 2010-09-12 15:31


Related issues

Related to Feature #2209: MIME type support for repository Entries New 2008-11-17

History

#1 Updated by Toshi MARUYAMA about 2 years ago

  • Category deleted (Core Plugins)

Also available in: Atom PDF