Anonymous | Login | Signup for a new account | 12-17-2024 11:54 PST |
Main | My View | View Issues | Change Log | Docs |
Viewing Issue Simple Details [ Jump to Notes ] | [ View Advanced ] [ Issue History ] [ Print ] | ||||||||
ID | Category | Severity | Reproducibility | Date Submitted | Last Update | ||||
0000507 | [Resin] | minor | always | 12-17-05 19:42 | 01-27-06 16:48 | ||||
Reporter | anonymous | View Status | public | ||||||
Assigned To | ferg | ||||||||
Priority | normal | Resolution | fixed | ||||||
Status | closed | Product Version | 3.0.16 | ||||||
Summary | 0000507: UTF-8 handling of codepoints above the Basic Multilingual Plane broken | ||||||||
Description |
Codepoints about 0xFFFF are not being handled correctly. For example, 0x10001 in UTF-8 is F0 90 80 81 Resin is writing ED A0 80 ED B0 81 Here is a sample JSP. If you have a font that supports it, the character looks like an "A" with an extra horizontal line. <%@ page contentType="text/html;charset=utf-8" %><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> [^] <html xmlns="http://www.w3.org/1999/xhtml" [^] xml:lang="en" lang="en"> <head> <title>Test Page</title> </head> <body> Should look like 𐀁 Raw bytes should be: <% Does look like <%=new String(Character.toChars(0x10001))%> Does look like <%=Character.toChars(0x10001)%> Does look like <%="\uD800\uDC01"%> </body> </html> |
||||||||
Additional Information | |||||||||
Attached Files | |||||||||
|
Notes | |
(0000597) anonymous 12-29-05 09:32 |
This works correctly in Tomcat. |
(0000811) ferg 01-27-06 16:48 |
server/1280 |
Mantis 1.0.0rc3[^]
Copyright © 2000 - 2005 Mantis Group
30 total queries executed. 26 unique queries executed. |