Mantis - Resin
|
|||||
Viewing Issue Advanced Details | |||||
|
|||||
ID: | Category: | Severity: | Reproducibility: | Date Submitted: | Last Update: |
507 | minor | always | 12-17-05 19:42 | 01-27-06 16:48 | |
|
|||||
Reporter: | anonymous | Platform: | |||
Assigned To: | ferg | OS: | |||
Priority: | normal | OS Version: | |||
Status: | closed | Product Version: | 3.0.16 | ||
Product Build: | Resolution: | fixed | |||
Projection: | none | ||||
ETA: | none | Fixed in Version: | 3.0.18 | ||
|
|||||
Summary: | 0000507: UTF-8 handling of codepoints above the Basic Multilingual Plane broken | ||||
Description: |
Codepoints about 0xFFFF are not being handled correctly. For example, 0x10001 in UTF-8 is F0 90 80 81 Resin is writing ED A0 80 ED B0 81 Here is a sample JSP. If you have a font that supports it, the character looks like an "A" with an extra horizontal line. <%@ page contentType="text/html;charset=utf-8" %><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> [^] <html xmlns="http://www.w3.org/1999/xhtml" [^] xml:lang="en" lang="en"> <head> <title>Test Page</title> </head> <body> Should look like 𐀁 Raw bytes should be: <% Does look like <%=new String(Character.toChars(0x10001))%> Does look like <%=Character.toChars(0x10001)%> Does look like <%="\uD800\uDC01"%> </body> </html> |
||||
Steps To Reproduce: | |||||
Additional Information: | |||||
Relationships | |||||
Attached Files: |
Notes | |||||
|
|||||
|
|
||||
|
|||||
|
|