using icu::LocalUConverterPointer; using icu::UnicodeSet;
staticconstchar *kConverter = "ibm-1047";
int main(int argc, constchar *argv[]) {
printf("// %s\n", U_COPYRIGHT_STRING);
printf("// generated by tblgen. You weren't going to edit it by hand, were you?\n");
printf("\n");
UErrorCode status = U_ZERO_ERROR;
LocalUConverterPointer cnv(ucnv_open(kConverter, &status));
if(U_FAILURE(status)) {
fprintf(stderr, "Failed to open %s: %s\n", kConverter, u_errorName(status)); return 1;
}
1 The basic source character set consists of 96 characters: the space character, the control characters repre- 15) senting horizontal tab, vertical tab, form feed, and new-line, plus the following 91 graphical characters: a b c d e f g h i j k l m n opqrstuvwxyz A B C D E F G H I J K L M N OPQRSTUVWXYZ 0 12 3 4 5 6 7 8 9 _ { } [ ] # ( ) < > % : ; . ?*+-/^&|~!=,\" 2 The universal-character-name construct provides a way to name other characters. hex-quad: hexadecimal-digit hexadecimal-digit hexadecimal-digit hexadecimal-digit universal-character-name: \u hex-quad \U hex-quad hex-quad The character designated by the universal-character-name \UNNNNNNNN is that character whose character short name in ISO/IEC 10646 is NNNNNNNN; the character designated by the universal-character-name \uNNNN is that character whose character short name in ISO/IEC 10646 is 0000NNNN. If the hexadecimal value for a universal character name is less than 0x20 or in the range 0x7F-0x9F (inclusive), or if the uni- versal character name designates a character in the basic source character set, then the program is ill- formed.
So basically: printable ASCII plus 0x00-0x1F, 0x7F-0x9F, was all illegal.
Die Informationen auf dieser Webseite wurden
nach bestem Wissen sorgfältig zusammengestellt. Es wird jedoch weder Vollständigkeit, noch Richtigkeit,
noch Qualität der bereit gestellten Informationen zugesichert.
Bemerkung:
Die farbliche Syntaxdarstellung und die Messung sind noch experimentell.