Generate PDF/A compliant documents

Bug #1640607 reported by Robert Ancell
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Simple Scan
Triaged
Wishlist
Unassigned
simple-scan (Ubuntu)
New
Undecided
Unassigned

Bug Description

[Impact]
PDF/A [1] is a standard for PDF documents to make them suitable for archiving. The PDF files that simple-scan produces are PDF compliant, but missing some optional fields that are required for PDF/A.

[Test Case]
1. Scan a document
2. Save to a PDF file
3. Check document through a PDF validator (e.g. [2])

Expected result:
The generated document is compliant with PDF/A

Observed result:
The document is not compliant.

[Regression potential]
Some risk of PDF changes making files saved from Simple Scan not viewable in a PDF reader. This seems very low as PDF/A is a very commonly used standard.

[1] https://en.wikipedia.org/wiki/PDF/A
[2] http://www.pdf-tools.com/pdf/validate-pdfa-online.aspx

Revision history for this message
Robert Ancell (robert-ancell) wrote :

I've added some more metadata into trunk that makes us more compliant. The current output from http://www.pdf-tools.com/pdf/validate-pdfa-online.aspx is:

Validating file "test.pdf" for conformance level pdfa-1a
The key StructTreeRoot is required but missing.
A device-specific color space (DeviceRGB) without an appropriate output intent is used.
The document does not conform to the requested standard.
The document contains device-specific color spaces.
The document doesn't provide appropriate logical structure information.
Done.

Revision history for this message
Washington Campos (wlcampos) wrote :

Hello Robert

Thank you very much for replying to my message, I did another search and found a command line that converts the PDF file to PDF / A, I'm sure you should have already found it, this is ghostscript, with this command line: gs -dPDFA -dBATCH -dNOPAUSE -dUseCIEColor -sProcessColorModel = DeviceCMYK -sDEVICE = pdfwrite -sPDFACompatibilityPolicy = 1 -sOutputFile = output_filename.pdf input_filename.pdf.

Again, thank you very much for the response.

Revision history for this message
Robert Ancell (robert-ancell) wrote :

Simple Scan 3.23.2 has some PDF/A improvements. The main issue remaining the the "device-specific color spaces" which I haven't worked out how to correctly mark yet..

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.