Thread Unicode (25 answers)
Opened by ptk at 2007-08-12 20:38

kristian
 2007-08-13 03:40
#98017 #98017
User since
2005-04-14
684 Artikel
BenutzerIn
[Homepage] [default_avatar]
Hallo

ptk+2007-08-12 20:20:18--
Und es wird ein "falsches" Euro-Zeichen (0x80), da in iso-8859-1 gar kein Euro-Zeichen definiert ist.


Vergiss ISO-8859-1, es benutzen eh alle ne Dose...

MS macht da teils komische Dinge...
1252 != iso-8859-1
Die eigentlich reservierten Zeichen 128 - 159 sehen (Auszug einer Routine die 1252 nach UTF-8 umsetzt) so aus:
Code (perl): (dl )
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
    $self->{'utf_8_sign'}->{chr('128')} = '€';
    $self->{'utf_8_sign'}->{chr('130')} = '‚';
    $self->{'utf_8_sign'}->{chr('131')} = 'ƒ';
    $self->{'utf_8_sign'}->{chr('132')} = '„';
    $self->{'utf_8_sign'}->{chr('133')} = '…';
    $self->{'utf_8_sign'}->{chr('134')} = '†';
    $self->{'utf_8_sign'}->{chr('135')} = '‡';
    $self->{'utf_8_sign'}->{chr('136')} = 'ˆ';
    $self->{'utf_8_sign'}->{chr('137')} = '‰';
    $self->{'utf_8_sign'}->{chr('138')} = 'Š';
    $self->{'utf_8_sign'}->{chr('139')} = '‹';
    $self->{'utf_8_sign'}->{chr('140')} = 'Œ';
    $self->{'utf_8_sign'}->{chr('145')} = '‘';
    $self->{'utf_8_sign'}->{chr('146')} = '’';
    $self->{'utf_8_sign'}->{chr('147')} = '“';
    $self->{'utf_8_sign'}->{chr('148')} = '”';
    $self->{'utf_8_sign'}->{chr('149')} = '•';
    $self->{'utf_8_sign'}->{chr('150')} = '–';
    $self->{'utf_8_sign'}->{chr('151')} = '—';
    $self->{'utf_8_sign'}->{chr('152')} = '˜';
    $self->{'utf_8_sign'}->{chr('153')} = '™';
    $self->{'utf_8_sign'}->{chr('154')} = 'š';
    $self->{'utf_8_sign'}->{chr('155')} = '›';
    $self->{'utf_8_sign'}->{chr('156')} = 'œ';
    $self->{'utf_8_sign'}->{chr('159')} = 'Ÿ';


Ein Euro sieht so aus:
Code (perl): (dl )
$self->{'utf_8_win_euro'} = join("", chr('226'), chr('130'), chr('172'));


Wenn du also von ISO äh 1252 auf UTF-8 willst nimmst du die Tabelle...
wobei der "alte" Euro chr(128) ist.

Gruss
Kristian

View full thread Unicode