Skip to content

When reading (chinese) .msg files, HTML converted from RTF is completely garbled (encoding issue) #161

@bbottema

Description

@bbottema

The problem is that the RTF's included codepage is ignored and all the hex bytes for text are converted one at the time. However, codepage 936 (chinese charset) requires two bytes per character (double byte character set, DBCS). Moreover, any code page defined in the RTF header should be honored when parsing user text.

Metadata

Metadata

Assignees

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions