View Issue Details

IDProjectCategoryView StatusLast Update
0011562phplist applicationInternationalization (l18n)public17-05-11 15:00
Reportercoren 
PrioritynormalSeveritymajorReproducibilityalways
Status resolvedResolutionfixed 
Product Version2.10.4 
Target Version4.0.xFixed in Version2.11.6 
Summary0011562: Random Character Encoding Bug in SHIFT-JIS Japanese emails body & Subject
DescriptionHi,

I have been using PHPLIST now for 3 years, from version 2.8 something, to send mail magazines in Japanese.

Japanese email clients have a poor support of UTF-8, sadly, so at the setup/test stage, I had to abandon this first obvious solution in favor of the SHIFT-JIS encoding.

This works well, except for a very problematic bug: some characters (only some, on an apparently random fashion) in the email body and subject are badly converted at some point when saving or sending the email. Instead of the real character,

This forces me to triple check all emails before I can actually send them, and to use alternative expressions when a given character bugs.

I never took the time to make tests to see if the same characters always bugged, on much it depended on context, etc... (I WANT TO USE THE SOFT, NOT DEBUG IT MYSELF !!!).

If someone could dig into this problem and solve it, it would be sooooo cool.

Thanks in advance to all,

Sebastien

PS: if you need something from my install, tell me which files and database tables in details and I'll provide them.
Additional InformationExamples from my last (yesterday) mailing:

アレクサンドル・ソクーロフ becomes アレクサンドル・ャNーロフ
( ソク => ャN )

来日を予定しています becomes 来日を嵐閧オています
( 予定 => 嵐閧オ )
TagsNo tags attached.

Relationships

related to 0002705 closed PHPList v2.11 release 
related to 0015551 resolvedmichiel Backslashes are stripped from HTML messages 
related to 0001644 resolvedmichiel Subject error with UTF-8 encode in Traditional Chinese 

Activities

h2b2

10-10-10 11:01

manager   ~0051126

One possible explanation for this apparent random behavior might be this: Some SHIFT-JIS and BIG5 encoded characters contain backslashes which may get stripped when saving the message, thus breaking the character's encoding. See http://mantis.phplist.com/view.php?id=15551#c51125