[texhax] Using accented characters in source

Michael Barr barr at math.mcgill.ca
Sat May 1 20:46:34 CEST 2010

On Sat, 1 May 2010, Lars Madsen wrote:

> Michael Barr wrote:
>>  Now I have a simple way of actually entering an accented character in my
>>  source code (using AllChars) I began to wonder if it is easy to get TeX to
>>  interpret them correctly.  This is obviously a Windows-specific question.
>>  If I compile the file
>>  \documentclass{article}
>>  \usepackage[french]{babel}
>>  \begin{document}
>>  Université
>>  \end{document}
>>  with or without the second line, the output is simply Universit.  This
>>  doesn't surprise me because the Windows code page does not match any of
>>  the font encodings, as far as I know.  Still I expected that the é would
>>  generate some output.  It is hex 82 and, from the table on page 261 of The
>>  LaTeX Companion, first edition, I would have expected C with an acute
>>  accent.
>>  It is not really important; only for a few foreign words in an English
>>  language text, but I am still curious.  I can always say Univerist\'e (or
>>  I could make the è active and define it to be that, but then it wouldn't
>>  hyphenate.
>>  Michael Barr
> you need inputenc, LaTeX use ansii by default, så the é is ignored. Next you 
> need ti figure out which input encoding your file is written in. This depends 
> on the editor.
> In try one of the following
> \usepackage[ansinew]{inputenc} % std win encoding
> \usepackage[latin1]{inputenc} % latin 1
> \usepackage[applemac]{inputenc} % this is the default in TeXShop on MAC
> \usepackage[utf8]{inputenc} % default in many linux dists, and in
> % TeXWorks on all platforms

None of those encodings worked.  The first gave what looked like a comma 
(but wasn't quite) and the other three caused a character undefined error. 
But the log file told me that was being loaded was xxxx.def and the 
directory that it was located in.  So I looked at those files and the 
names cpxxx.def looked interesting.  In particular cp437.def seemed to 
give the correct definitions.  Later I poked around in the windows help 
file and discovered that code page 437 is the standard US encoding.  I 
didn't test anything besides my original file, but with that encoding, it 
seems to work.

TeXhax works again!

Michael Barr

More information about the texhax mailing list