This bug was originally reported at https://bugs.launchpad.net/ubuntu/+source/poppler/+bug/993292
pdftotext -htmlmeta output is missing metadata from PDF catalog. pdfinfo does output all values known:
e.g. a pdfinfo output:
Title: Titel Author: Word Creator: WordToPDF 2.4 build 127 Producer: AFPL Ghostscript 8.54 CreationDate: Fri Jul 2 09:14:02 2007 ModDate: Fri Jul 2 09:14:02 2007 Tagged: no Pages: 6 Encrypted: no Page size: 595 x 842 pts (A4) File size: 104664 bytes Optimized: no PDF version: 1.3
in contrast the meta section of the pdftotext -htmlmeta output:
<head> <title>Titel</title> <meta name="Author" content="Word"/> <meta name="Creator" content="WordToPDF 2.4 build 127"/> <meta name="Producer" content="AFPL Ghostscript 8.54"/> <meta name="CreationDate" content=""/> </head>
This bug was originally reported at https:/ /bugs.launchpad .net/ubuntu/ +source/ poppler/ +bug/993292
pdftotext -htmlmeta output is missing metadata from PDF catalog. pdfinfo does output all values known:
e.g. a pdfinfo output:
Title: Titel
Author: Word
Creator: WordToPDF 2.4 build 127
Producer: AFPL Ghostscript 8.54
CreationDate: Fri Jul 2 09:14:02 2007
ModDate: Fri Jul 2 09:14:02 2007
Tagged: no
Pages: 6
Encrypted: no
Page size: 595 x 842 pts (A4)
File size: 104664 bytes
Optimized: no
PDF version: 1.3
in contrast the meta section of the pdftotext -htmlmeta output:
<head> Titel</ title>
<title>
<meta name="Author" content="Word"/>
<meta name="Creator" content="WordToPDF 2.4 build 127"/>
<meta name="Producer" content="AFPL Ghostscript 8.54"/>
<meta name="CreationDate" content=""/>
</head>