[macemacsjp-english 198] Re: Encoding problems (this time with the attachments!)

Back to archive index

Peter Dyballa Peter_Dybal****@Web*****
Tue Sep 13 20:36:35 JST 2005


Am 13.09.2005 um 12:20 schrieb Jose Figueroa-O'Farrill:

> Here are two problems that I'm experiencing:
>

Both of your problems will vanish when you notice that all Carbon 
Emacsen work with QuickDraw which was developed before Unicode came. 
And was a means against ISO standards. So they are optimized for Mac 
encodings. If you can live in this restriction, try these settings:

         (set-language-environment               'Brazilian Portuguese) 
;? there is no Portuguese
         (setq default-file-name-coding-system   'utf-8)
         (setq default-buffer-file-coding-system 'mac-roman-unix)
         (set-default-coding-systems             'mac-roman-unix)
         (set-keyboard-coding-system             'mac-roman)
         (prefer-coding-system                   'mac-roman-unix)
         (set-buffer-file-coding-system          'mac-roman-unix)
         (modify-coding-system-alist 'file "\\.html\\'" 'mac-roman-unix)
         (modify-coding-system-alist 'file "\\.tex\\'"  'mac-roman-unix)
         (modify-coding-system-alist 'file "\\.xml\\'"  'mac-roman-unix)

TeX supports mac-roman as an input encoding. In case any file has the 
wrong encoding (the mode-line would show this as the second most left 
character) you can make that conform by either adding to its header 
something like

	;;; -*- mode: LaTeX; coding: iso-8859-16; -*-

or putting at the end

	%
	% Local Variables:
	% mode: LaTeX
	% fill-column: 160
	% coding-system: iso-latin-9
	% End:
	%
	%%

There are a few functions invoked with C-x RET:

	C-x RET l		set-language-environment
	C-x RET c		universal-coding-system-argument
	C-x RET C-\	set-input-method
	C-x RET X		set-next-selection-coding-system
	C-x RET x		set-selection-coding-system
	C-x RET p		set-buffer-process-coding-system
	C-x RET k		set-keyboard-coding-system
	C-x RET t		set-terminal-coding-system
	C-x RET F		set-file-name-coding-system
	C-x RET r		revert-buffer-with-coding-system
	C-x RET f		set-buffer-file-coding-system

They can be helpful, particularly revert-buffer-with-coding-system in 
case you see strange glyphs.


José is in UTF-8: U+4A U+6F U+73 U+C3A9. In an 8 bit environment like 
ISO Latin-1 the last 'wide character' becomes represented as these two 
bytes à and ©. If still necessary try to experiment with these two 
settings:

	(set-clipboard-coding-system		'utf-8)		    ; import into Carbon Emacs
	(set-selection-coding-system		'mac-roman-unix)  ; export from Carbon 
Emacs


Isn't the € coming to you too? Then you would need ISO Latin-9/ISO 
8859-15.

--
Greetings

   Pete

These are my principles and if you don't like them... well, I have 
others. - Groucho Marx




More information about the macemacsjp-english mailing list
Back to archive index