[tex4ht] Conversion to ODT missing several features

Paulo Matos paulo at matos-sorge.com
Tue Jan 8 10:12:14 CET 2019


Hi,

This is my first time using tex4ht. I have found it as a way to convert
from latex to ODT and it seems to be the best solution I found up to now
to do this after having tried pandoc.

Scribble is a typesetting system written in the Racket Programming
Language which I am using. Scribble converts to Latex which I then pass
through 'tex4ht oolatex'.

I have attached the original latex, pdf (from pdflatex) and the mk4ht
oolatex output. Using mk4ht 1.1.

If you look at the results there are a few issues:
1. Strange image artifacts at beginning of document;
2. No colors
3. Chapter heading lacking proper formatting

Also, I noticed a few issues on the command line:
Execute script `test.lg'
----------------------------
t4ht.c (2012-07-25-19:28 kpathsea)
t4ht -f/test.tex
  -cooxtpipes
  -coo
(/usr/share/texmf-dist/tex4ht/base/unix/tex4ht.env)
Entering test.lg
System call: mv test.4oo test.tmp
System return: 0
System call: java -classpath /usr/share/texmf-dist/tex4ht/bin/tex4ht.jar
xtpipes -i /usr/share/texmf-dist/tex4ht/xtpipes/ -o test.4oo test.tmp
--- OoFilter Error 6 --- Improper record: Font("ptmr8t","","10","100")
...

System call: java -classpath /usr/share/texmf-dist/tex4ht/bin/tex4ht.jar
xtpipes -i /usr/share/texmf-dist/tex4ht/xtpipes/ -o test-m2.4om test-m2.tmp
org.xml.sax.SAXParseException; lineNumber: 10; columnNumber: 3; An
invalid XML character (Unicode: 0x0) was found in the element content of
the document.
        at
java.xml/com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1239)
        at
java.xml/com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:635)
        at
java.xml/org.xml.sax.helpers.XMLFilterImpl.parse(XMLFilterImpl.java:357)
        at xtpipes.Xtpipes.execute(Xtpipes.java:1300)
        at xtpipes.Xtpipes.execute(Xtpipes.java:783)
        at xtpipes.Xtpipes.mainMethod(Xtpipes.java:665)
        at xtpipes.Xtpipes.main(Xtpipes.java:70)
        at xtpipes.main(xtpipes.java:5)
--- xtpipes error 29 --- At <sax
content-handler="xtpipes.util.ScriptsManager,tex4ht.OomFilter"
lexical-handler="xtpipes.util.ScriptsManagerLH" > : While parsing file
file:/home/pmatos/Projects/practical-racket/tmp/test-m2.tmp: :
org.xml.sax.SAXParseException; lineNumber: 10; columnNumber: 3; An
invalid XML character (Unicode: 0x0) was found in the element content of
the document.
--- Warning --- System return: 256
...
System call: java -classpath /usr/share/texmf-dist/tex4ht/bin/tex4ht.jar
xtpipes -i /usr/share/texmf-dist/tex4ht/xtpipes/ -o test-m3.4om test-m3.tmp
org.xml.sax.SAXParseException; lineNumber: 10; columnNumber: 3; An
invalid XML character (Unicode: 0x0) was found in the element content of
the document.
        at
java.xml/com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1239)
        at
java.xml/com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:635)
        at
java.xml/org.xml.sax.helpers.XMLFilterImpl.parse(XMLFilterImpl.java:357)
        at xtpipes.Xtpipes.execute(Xtpipes.java:1300)
        at xtpipes.Xtpipes.execute(Xtpipes.java:783)
        at xtpipes.Xtpipes.mainMethod(Xtpipes.java:665)
        at xtpipes.Xtpipes.main(Xtpipes.java:70)
        at xtpipes.main(xtpipes.java:5)
--- xtpipes error 29 --- At <sax
content-handler="xtpipes.util.ScriptsManager,tex4ht.OomFilter"
lexical-handler="xtpipes.util.ScriptsManagerLH" > : While parsing file
file:/home/pmatos/Projects/practical-racket/tmp/test-m3.tmp: :
org.xml.sax.SAXParseEx
ception; lineNumber: 10; columnNumber: 3; An invalid XML character
(Unicode: 0x0) was found in the element content of the document.
--- Warning --- System return: 256

java is:
$ java --version
openjdk 10.0.2 2018-07-17
OpenJDK Runtime Environment (build 10.0.2+13)
OpenJDK 64-Bit Server VM (build 10.0.2+13, mixed mode)

I couldn't find much documentation on oolatex part specifically. Is
there anything out there maybe explaining how to improve the conversion?

Any help with the above would be greatly appreciated.

Kind regards,

-- 
Paulo Matos
-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.odt
Type: application/vnd.oasis.opendocument.text
Size: 7863 bytes
Desc: not available
URL: <https://tug.org/pipermail/tex4ht/attachments/20190108/4b158a12/attachment-0001.odt>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.pdf
Type: application/pdf
Size: 77022 bytes
Desc: not available
URL: <https://tug.org/pipermail/tex4ht/attachments/20190108/4b158a12/attachment-0001.pdf>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: test.tex
Type: text/x-tex
Size: 25914 bytes
Desc: not available
URL: <https://tug.org/pipermail/tex4ht/attachments/20190108/4b158a12/attachment-0001.bin>


More information about the tex4ht mailing list