PCL to PDF Conversion with JBIG2 Compression Benefits

Question:  How can PCL to PDF conversion with JBIG2 compression benefit my organization?

Answer: When images began to populate PDF files, file size quickly became a problem.  As users added high-resolution images to PDF files, transition times  increased and disk space became an issue.  JBIG2 technology addresses both of these fundamental issues that all organizations face in today’s modern business environment — storage and portability.

JBIG2 is an image compression standard for bi-level images.  It was developed by the Joint Bi-level Image Experts Group. It is compatible with both lossless and lossy compression; capable of generating files 30% the size the size of Group 4; generating files one-half to one-quarter the size of JBIG, the previous bi-level compression standard.

JBIG2’s main features:

JBIG2 Feature
Benefit
Example
Higher compression rates than its predecessors (e.g., JBIG1, TIFF G3, and G4). File size reduction capabilities up to 90% or higher.Reduction in storage space and transmission bandwidth.With JBIG2 compression, a 78 MB uncompressed 500-page PDF document, would see its file size drop to 12.7 MB. An equivalent TIFF file would be approx. 15.8 MB.
Lossy and lossless compression methods.Lossy yields a higher compression rate without any perceivable information loss.Lossy yields a higher compression rate without any perceivable information loss.
Use of symbol dictionaries• For the compression of other images within the same document.
• Eventually one symbol dictionary could be used to recognize the text in the image. It contains the building blocks of a possible OCR procedure to help rebuild font information (if lost).
• Unique to one PDF document, a global JBIG2 stream can contain a dictionary of symbols used for all the pages of the document.
• Once the dictionary is built, software attempts to recognize letters and build legible text from them.
Use of arithmetic and Huffman coding schemes for bit representation.Huffman coding takes less page memory and has faster compression and decompression than arithmetic coding. However, arithmetic compression is slower, uses more memory but yields better compression results.JBIG2 can support the Huffman and the arithmetic coding algorithms for image structure information such as encoding schemes, references, indexes, sizes, offsets, and popular symbol identities.
ITU-T T.6 facsimile coding schemes and coding control functions for Group 4 facsimile functionalities, which is activated by a MMR (Modified READ (Relative Element Access Designate)) flag.Use of the latest facsimile logic for the compression of building block images.Any image leaf can be coded using MMR logic. In addition, a symbol in a dictionary or whole page can be found in the JBIG2 stream as a MMR image.
Stripped-page compression.JBIG2 can compress uninterrupted image flows.Under specific circumstances, if a scanner sends image information without a page cut, a JBIG2 stream can still take the data and compress it.
Most PDF viewers support reading JBIG2 (ver. 1.4 and higher).JBIG2 technology can be easily integrated into the PDF’s established technologies.Most of the PDF documents produced by high-end scanners with professional drivers are compressed with JBIG2 technologies.

Whom Does JBIG2 Benefit?

Government, financial, judicial, and medical sectors are some examples of PDF-intensive industries where implementation of the JBIG2 format into their workflows would help IT Managers reap noticeable cost-saving benefits.

  • State-of-the-art, accurate, device-independent representation of rasterized documents
  • Efficient, high-quality communication of text and graphics
  • Formatted documents that are suitable for high-speed production printing environments
  • Conformance with JBIG2′s emerging new standard for Internet publishing tools, wireless applications, Internet fax, new-generation fax machines, multi-function peripherals and printer controllers.

Because JBIG2 is symbol-based compression of black-and-white (bitonal) images of text, PageTech’s JBIG2 compression has a distinct advantage over all known competitors implementing JBIG2 who must start with a scanned image of a page (BMP, TIFF or other raster format).

We directly transform the print stream that produces the documents.  This eliminates the need for expensive scanning equipment, personnel to perform the scanning and the time required for an intermediate step.  While interpreting the PCL, Pagetech’s PCL technology provides immediate access to all text characters, symbols and glyphs.

This is a distinct advantage over our competitors who must make several pattern-matching passes through a document’s scanned raster image which consists of text, raster, and grayscale and linedraw objects.  Because the objects in a typical business document are almost all text, PageTech’s tools require significantly less time to build a dictionary to achieve maximum efficiency.

 We can also add color, Intelligent Mail barcodes, QR codes, logos or form overlays to the PCL prior to printing or during the conversion to JBIG2/PDF!

PCLTool SDK LIVE Evaluation: http://www.pcltools.com/pt700d/

VN:F [1.9.22_1171]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.22_1171]
Rating: 0 (from 0 votes)
Share
Posted in PCL Conversion, PCL to PDF, PCL to PDF/A | Tagged , , , , , | Leave a comment

PCL to PDF File Conversion – PDF File Types

Question:  What types of PDF files can PCLTool SDK create when converting PCL to PDF?

Answer:  Depending on the nature and composition of the input PCL, you will be able to convert it into five different types of PDF files:

PDF
Type
Raster
Text
Searchable
Vector
All .TTFs
Embedded
Some .TTFs
Embedded
No .TTFs Embedded
ROR
RTRT
VRTEARTVEA
VRTESRTVES
VRTNERTVNE

(RO) Raster Only without Searchable Text: Each PCL page is converted into a compressed raster image PDF page without the ASCII text from the PCL layered underneath the page image. This worst case scenario happens when the PCL consists of all raster data with no text.

(RT) Raster Only with Searchable Text: Each PCL page is converted into a compressed raster image PDF page with the ASCII text from the PCL layered underneath the page image. This happens when the PCL consists of a raster page of the form with text overlay or has complex PCL objects that cannot be converted to vector PDF.

(VRTEA) Vector and Raster Objects with Searchable Text & All Fonts Embedded: All PCL vector line draw and text objects are translated into PDF vector and text. All the fonts referenced in the PCL file are embedded with their TrueType equivalents.

(VRTES) Vector and Raster Objects w/Searchable Text & Some Fonts Embedded: All PCL vector line draw and text objects are translated into PDF vector and text. All the non-Windows core set fonts referenced in the PCL file are embedded with their TrueType equivalents. Window core set fonts consist of: Arial, Courier New, Times New Roman, Symbol and Wingdings.

(VRTNE) Vector and Raster Objects with Searchable Text and No Embedded Fonts: All PCL vector line draw and text objects are translated into PDF vector and text. All the fonts referenced in the PCL file are substituted with Windows resident TrueType fonts. This is usually accomplished by created the appropriate entries in the Font Substitution Map file (PTC32.map). Even when the Courier font (#3) is referenced in the PCL file, it’s better to force a font substitution to the Windows Core Set equivalent Courier New (#4099) because it saves file size by not having to embed our HPCOB .TTF and Courier New is a higher quality font with Delta hinting. This requires expert level knowledge of PCL and our product.

PCLTool SDK Live Evaluation: http://www.pcltools.com/pt700d/

VN:F [1.9.22_1171]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.22_1171]
Rating: 0 (from 0 votes)
Share
Posted in PCL Conversion, PCL to PDF, PCL to PDF/A, Uncategorized | Tagged , , , , , , | Leave a comment

PDF to PCL Conversion – PCL to PDF Knowledge Base

Question:  How do I convert a PDF file into a PCL print file?

Answer:  PCL file format is the format for printers and computers to talk to each other.  PCL format, which stands for Printer Command Language, was developed by Hewlett-Packard.  PCL is the standard print format for HP LaserJet-compatible printers.

Most people ask how to go from PDF to PCL because they want to create a PCL Form Overlay from a form in PDF format.  If you have a PDF file that you want to convert to PCL format, you can convert from PDF to PCL by using the print to file option in the PCL Pinter Driver.

Steps to create a generic Windows print (.PRN) file from your PDF file:

  1. Open the PDF file

    Printer Properties - Select Print to file

    Printer Properties – Select Print to file

  2. Click to print the file.  When a Print dialog window opens locate the ”Print to file” option and select it.
  3. Click “OK” and a Print to File dialog box will open.  In the File Name box you will see your file’s name with a “.prn” extension following it.
  4. Select the location where you want your .prn file to be saved, and then click “Save.”
  5. This creates a Windows temporary print stream in PCL format, but is NOT SUITABLE for use as a PCL overlay.

To create a PCL Form Overlay or Logo Overlay refer to the instructions found at http://pclhelp.com/pcl-form-overlay-and-logo-macros/create-pcl-form-overlay-logo-overlay/

If you run into any issues or if you have any questions, please contact us as soon as possible.

VN:F [1.9.22_1171]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.22_1171]
Rating: 0 (from 0 votes)
Share
Posted in PCL Conversion, PDF to PCL | Tagged , , , , , | Leave a comment

What is Scrambled Text?

Question: What is Scrambled Text?

Answer: Most PCL printer drivers assign text characters to temporary download fonts in the order in which they are used in the document.  These text characters are unbound to a character set.  This means if the first character to be used in a particular font is capital “A”, it will be assigned to character cell position number 1 in the temporary download font.

In HP character sets, cell one is used by a unprintable character used for communication with the printer.  So, in the PCL print stream you will see a “smiley face” or some other non-text character.  Later on in a 100-page file the same capital “A” might be download many more times and assigned to whatever the next character might be at the time because the printer drivers do not use all the 224 possible text character cells in a single-byte HP font.  The printer drivers do this because the double-byte Arial TrueType typeface (for example) that comes with Windows has 1420 characters to support many single-byte HP character sets.

And, the HP printer resident Arial typeface does not contain the same number of characters.  So, rather than try to switch back and forth between character sets if the document happened to require text in more than one language in the same typeface and font attributes.  Microsoft decided that it would be a lot easier to just download the font characters as they appeared in the document into a temporary font not referencing any character set.  We refer to this text unbound to a character set as being “scrambled” text.

Our PCLMagic printer drivers are created from the Windows Device Driver Kit (DDK) from the Unidriver code for the HP monochrome LaserJet 4si (PCL5e) (Upon Request) and HP Color LaserJet (PCL5c) (default for both) printers.  They call a custom .dll that inserts the text for each page in Unicode and UTF8 format into hidden, encrypted text objects into the PCL before they are scrambled by the printer driver. We do this for several reasons:

  • Extract all the text.
  • Reference the text in the print stream to know when to split documents, name the output files, etc.
  • Create PDF files with searchable text.
  • Generate index files with keywords extracted from the PCL

PCLTool SDK Live Evaluation: http://www.pcltools.com/pt700d/

VN:F [1.9.22_1171]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.22_1171]
Rating: 0 (from 0 votes)
Share
Posted in PCL to PDF | Tagged , , , , , , , , , | Leave a comment

PCL to PDF/A Digital Document Archiving Standard

Question: Do any of your product convert PCL to PDF/A format?

Answer:

Tyler, thanks for your post, PCLTool SDK, PCLWorks, IMGCVT.exe, IMG2PDF.exe & PCLReader can produce PCL to PDF/A output.   By default, we support PDF/A-1b because we feel confident that we can generate this level of PDF/A format from virtually any PCL file.  However, you can switch to PDF/A-1a in order to see if we can achieve this level of compliance with your PCL.  PDF/A-1a compliance is more dependent on the structure and composition of the document.

Select PCL to PDF/A Settings via PDF Preferences

Select PCL to PDF/A Settings via PDF Preferences

Consequently, you may have PCL that we are unable to convert into PDF/A-1a format at this time.  We always want to see PCL that produces PDF/A files that do not comply with the ISO standard so please use our support form  to submit any files that you think we should review.

We always encourage customers to download the PCLTool SDK Live Eval and select the PCL to PDF/A setting via the PDF Preferences dialog and test your PCL.

PCL to PDF/A Digital Document Archiving Standard

PDF/A is an approved set of standards from the International Standards Organization (ISO).  The new ISO 19005-1 standard defines PDF/A format as the digital document standard for the long-term preservation of required tax, legal, historical, transactional and other important documents.

Archiving in PDF/A format to satisfy your corporate, HIPAA, Sarbanes-Oxley or other legal requirements, is the best way to ensure that you will be able to search, retrieve, view and print your documents in the future with the full, legal fidelity of the original.  PDF/A format will remain compatible as PDF technology evolves.

This is why ISO, in conjunction with the US Courts, Library of Congress, ARMA, AIIM, NARA and many more corporations, governments, and other major institutions around the world have approved this standard for long-term archiving of documents.  This new standard defines a subset of PDF 1.4 format and composition rules that will withstand the test of time.

PDF/A is designed to be:

  • Device and software independent
  • Self-contained: Includes all the resources needed to render each page (embedded fonts)
  • Self-documented: Capable of including sufficient XMP metadata for auto-classification and indexing
  • A framework for representing the logical document structure (PDF/A-1a)

Having rules like this in place inhibit the development of PDF functionality that could cause problems in future releases.  If you generate PDF documents from a product that complies with this new PDF/A standard, your files will be safe and viable for the years to come.

The best way to generate PDF/A documents is directly from Adobe Acrobat V7.08 or above. For those who have applications that cannot generate PDF/A directly, but can only generate HP PCL print streams (and, do not want to print, scan, convert to PDF/A and then OCR for indexing and text search-ability), you need one of PageTech’s product options to convert your PCL into PDF/A format.

VN:F [1.9.22_1171]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.22_1171]
Rating: 0 (from 0 votes)
Share
Posted in PCL Conversion, PCL to PDF/A | Tagged , , , , | Leave a comment

PCL to PDF Conversion Preferences

All of the PDF preferences that can be set when converting PCL to either raster or vector PDF format using PCLTool.exe or PCLWorks.exe are offered in the PCL to PDF Conversion Preferences dialog box. Select PDF Preferences from the Preferences menu or click Pref from the Convert Files dialog box.

PCL to PDF Conversion Preferences

PDF Preferences Dialog

This dialog provides options for converting the selected PCL files into Adobe Acrobat® Portable Document Format (.PDF) using PageTech’s native direct driver (PDFWrite.dll). We do not use Adobe Acrobat to create our PDF files. Our Direct Driver is faster and we have 100% control over it, unlike printing to a Windows printer driver.

The preferences set in this dialog are stored in the APP.ini file, [PCL2PDF] Section upon exit of PCLTool.exe.

VN:F [1.9.22_1171]
Rating: 0.0/10 (0 votes cast)
VN:F [1.9.22_1171]
Rating: 0 (from 0 votes)
Share
Posted in PCL Conversion, PCL to PDF | Tagged , , , , | Leave a comment