[texworks] inputenc error

David Carlisle d.p.carlisle at gmail.com
Sat Jun 8 10:45:23 CEST 2019


the code you post there is all ASCII so totally unaffected by any
inputenc arguments or the 2018 change to default to UTF-8.

David

On Sat, 8 Jun 2019 at 09:15, Taylor, P <P.Taylor at rhul.ac.uk> wrote:
>
> Thank you, Reinhard.  And of course we will convert to UTF-8 for the
> next phase of the project, but we also need to be able to process the
> legacy version (so that we can compare the outputs of the two, for
> example).  And there are no "exotic" characters in the  input at all —
> it looks, in part, like the following :
>
> \Byzantine 2272 0 inc-auto {{}} {a>ut`o} {1600} {P. Stephanus,
> Alphabetum Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2274 0 inc-autou1 {{}} {a>uto~u} {1600} {P. Stephanus,
> Alphabetum Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2275 0 inc-autwi {{}} {a>ut~w|} {1600} {P. Stephanus,
> Alphabetum Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
>
> \Byzantine 2831 0 inc-u {{}} {~u} {1600} {P. Stephanus, Alphabetum
> Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2833 0 inc-ui {{}} {ui} {1600} {P. Stephanus, Alphabetum
> Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2834 0 inc-un1 {{}} {un} {1600} {P. Stephanus, Alphabetum
> Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2836 0 inc-un3 {{}} {un} {1600} {P. Stephanus, Alphabetum
> Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2835 0 inc-un2 {{}} {un} {1600} {P. Stephanus, Alphabetum
> Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2837 0 inc-un4 {{}} {>~un} {16\th c.} {W.H. Ingram, ‘The
> Ligatures of Early Printed Greek', GRBS 7 (1966), 382-89} {}
> \Byzantine 2838 0 inc-up {{}} {<up} {1600} {P. Stephanus, Alphabetum
> Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
> \Byzantine 2841 0 inc-upo {{}} {<upo} {1600} {P. Stephanus, Alphabetum
> Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
>
> \Byzantine 2849 0 inc-ws1 {{}} {~ws} {16\th c.} {W.H. Ingram, ‘The
> Ligatures of Early Printed Greek', GRBS 7 (1966), 382-89} {}
> \Byzantine 2246 0 inc-aitiatikh {{a>itia\raise {th}}} {a>itiatik`h}
> {1600} {P. Stephanus, Alphabetum Graecum \& Hebraicum (Paris, 1600), BL
> 12904.b.11} {}
> \Byzantine 2256 0 inc-aoristos {{>a'o\raise {os}}} {>a'oristos} {1600}
> {P. Stephanus, Alphabetum Graecum \& Hebraicum (Paris, 1600), BL
> 12904.b.11 } {}
> \Byzantine 2257 0 inc-aoristou {{>ao\raise {ou}}} {>aor'istou} {1600}
> {P. Stephanus, Alphabetum Graecum \& Hebraicum (Paris, 1600), BL
> 12904.b.11 } {}
> \Byzantine 2254 0 inc-antitou {{>a\raise {t}n\raise {t}}} {>ant`i to~u}
> {1600} {P. Stephanus, Alphabetum Graecum \& Hebraicum (Paris, 1600), BL
> 12904.b.11 } {}
> \Byzantine 2273 0 inc-autou {{>auo~u\raise {t}}} {a>uto~u} {1600} {P.
> Stephanus, Alphabetum Graecum \& Hebraicum (Paris, 1600), BL 12904.b.11 } {}
>
> ** Phil
> --------
> Reinhard Kotucha wrote:
> > On 2019-06-07 at 15:56:36 +0000, Taylor, P wrote:
> >
> >   > [...] Are you in a position to advise which variant of \usepackage
> >   > [latin1] {inputenc} / \usepackage [latin9] {inputenc} / \usepackage
> >   > [latin$n$] {inputenc} might best address this issue ?
> >
> > The short answer is "none of them" in favor of
> >
> >    \usepackage[utf8]{inputenc}
> >
> > If you are really forced to use old-fashioned 8-bit input encodings,
> > in most cases you would prefer latin9 (ISO 8895-15) because this makes
> > more frequently used characters accessible directly from Western
> > European keyboards.  For other keyboards one needs other input
> > encodings, for instance latin5 for Russian, viscii for Vietnamese,...
> >
> > For ISO 8895 specifications consult
> >
> >    https://en.wikipedia.org/wiki/ISO/IEC_8859
> >
> > LaTeX converts all non-ASCII characters to LICR (LaTeX Internal
> > Character Representation) so that all non-ASCII characters are
> > assigned to control sequences.  Thus the choice of a paricular input
> > encoding is just a matter of convenience.  If you want to allow users
> > to use different input encodings, all strings in macro packages should
> > be in LICR.
> >
> > The best choice today is definitely UTF-8 and I strongly recommend to
> > use it and to avoid national character sets due to their limitations.
> >
> > Phil, I'm aware that I don't have to tell you what Unicode is good
> > for but you asked on a public mailing list.  Thus the more verbose
> > answer.
> >
> > Regards,
> >    Reinhard
> >
> > --
> > ------------------------------------------------------------------
> > Reinhard Kotucha                            Phone: +49-511-3373112
> > Marschnerstr. 25
> > D-30167 Hannover                    mailto:reinhard.kotucha at web.de
> > ------------------------------------------------------------------
>
>



More information about the texhax mailing list