pdftotext removing "fi" from a recent pdf I made with latex,

Ulrike Fischer news3 at nililand.de
Tue Nov 26 17:31:58 CET 2019

Am Sun, 24 Nov 2019 10:47:21 +0000 schrieb Mike Marchywka:

>> but it likely to be common with many pdf's now. If I just run 
>> "pdftotext" on my output, I get weird boxes where each "fi"
>> is. 

> Nevermind, I figured it out :) I added this stupid thing
> \usepackage[T1]{fontenc}

This is not stupid, T1 is a much better encoding than OT1. But it
changes the font. 

> Compiling to pdf and inverting gives this,
> cat schumann.pdf | pdftotext - - 
> test a word that denes the problem, d e f i n e s
>> pdftotext -v
>> pdftotext version 0.41.0

I have no problem on windows, texlive 2019 and 
pdftotext version 0.75.0

Ulrike Fischer 

More information about the texhax mailing list