Peter Dyballa
Peter_Dybal****@Web*****
Mon Sep 12 08:10:59 JST 2005
Am 10.09.2005 um 13:00 schrieb Peter Dyballa: > I cannot i-search for words with or for the characters ö or ß: is this > a known bug? I need to correct myself: in a dired buffer (latin9, Encoded-Kbd, Unify-8859-On-Encoding) with file names containing ö or ß I can i-search and find these glyphs, in *Messages* buffer (latin9, Encoded-Kbd, Unify-8859-On-Encoding) this does not work -- because these glyphs are output with message ()? View-lossage shows that I do i-search for \232, OK! But I also get a message like this when i-search does not start at once (reason unknown): encoded-kbd-self-insert-sjis: Invalid Shift JIS code: 9a0d. I then see in mini-buffer \236º ... ⁄ (Opt-i) is handled as being character: レ (06332, 3290, 0xcda, U+FF9A) charset: [katakana-jisx0201] (Katakana Part of JISX0201.1976.) code point: [90] syntax: w which means: word category: j:Japanese k:Japanese katakana |:While filling, we can break a line at this character. Properties: hiragana: 53868; katakana: 53996; jisx0208: 53996; buffer code: 0x89 0xDA file code: 0xEF 0xBE 0x9A (encoded by coding system mule-utf-8-unix) display: no font available Setting the coding systems to UTF-8 Carbon Emacs has a problem with some UTF-8 glyphs in file names: -------------- next part -------------- A non-text attachment was scrubbed... Name: pastedGraphic1.tiff Type: image/tiff Size: 23570 bytes Desc: not available Url : http://lists.sourceforge.jp/mailman/archives/macemacsjp-english/attachments/20050912/e28b3669/attachment.tiff -------------- next part -------------- C-u C-x = explains about the de-composed Ǔ (LATIN CAPITAL LETTER U WITH CARON, U+01D3): character: ̌ (01211314, 332492, 0x512cc, U+030C) charset: [mule-unicode-0100-24ff] (Unicode characters of the range U+0100..U+24FF.) code point: [37 76] syntax: w which means: word category: ^:Combining diacritic or mark u:Mule unicode characters buffer code: 0x9C 0xF4 0xA5 0xCC file code: 0xCC 0x8C (encoded by coding system mule-utf-8-unix) display: no font available /System/Library/Fonts/Courier.dfont has this glyph -- I think that's the font that is referred as "-apple-courier-medium-r-normal--14-140-75-75-m-140-mac-roman (0xF3)" when I examine Û (and it also contains the ready to use Ǔ). Checking with C-u C-x = that in UTF-8 encoding the one ß has a buffer code of 0x8E 0xDF, the other one 0xC3 0xDF and both time the same correct file code 0xC3 0x9F, ö is 0x8E 0xF6 vs. 0xC3 0xF6 resp. 0xC3 0xB6. This looks to be the typical GNU Emacs behaviour not being able to handle glyphs outside US-ASCII consistently. -- Greetings Pete <\ _\ O _ |o \ _\\_/-\=' _____________(_)|-(_) (_)___________________________________