Java Solaris Communities Sun Store Join SDN My Profile Why Join?
 
Bug Database
Bug Detail
Quick Lists
Top 25 Bugs
Top 25 RFE's
Recently Closed Bugs
Printable Page Printable Page


Bug Database
Bug ID: 4499288
Votes 0
Synopsis (cs spec) Charset terminology problems
Category java:classes_nio
Reported Against merlin-beta2
Release Fixed 7(b25)
State 10-Fix Delivered, bug
Priority: 3-Medium
Related Bugs
Submit Date 04-SEP-2001
Description
In the java.nio.charset.Charset description, there are some inaccurate
definitions.

  - "a charset is defined as the combination of a coded character set and
    a character-encoding scheme" assumes one coded character set. But it
    should be one or multiple coded character sets.

  - "full Unicode, which is the same as ISO 10646-1" is not accurate. They 
    are not the same thing as Unicode.org says. If it means only in the
    context of a coded character set, the description should clarify the
    revisions.

  - In "A character-encoding scheme is a mapping between a coded character
    set and a set of octet (eight-bit byte) sequences.", "a coded character
    set" should be "a coded character set or several coded character
    sets" as defined in RFC 2278. The examples clarify it, though.

  - "Some schemes, however, are associated with multiple character sets;
    EUC, for example, can be used to encode characters in a variety of
    Asian character sets." should be "Some schemes, however, are
    associated with multiple coded character sets; EUC, for example, can
    be used to encode characters in a variety of Asian coded character
    sets."
Work Around
N/A
Evaluation
Yes.  We can try to be a little more precise here,
although charset terminology will always be confusing.
Posted Date : 2007-11-07 01:13:49.0
Comments
  
  Include a link with my name & email   


PLEASE NOTE: JDK6 is formerly known as Project Mustang