Download Overview - FineReader Engine 12 for Linux

Latest Release

FineReader Engine 12 for Linux - Release 4 (15.12.2019)

More info and download of the latest release:
FineReader Engine 12 for Linux
– Login needed

Free download of the latest distributive is available for ABBYY customers with a valid Software Maintenance Agreement.
Please contact your Account Manager for login information.

New Features and Improvements - currently copied from Win, TBD

  • New: 'Compare Documents' Module
    • The new 'Compare Documents' Module in ABBYY FineReader Engine provides your software with the ability to quickly and accurately compare two versions of the same document and detect possible differences in their content – for example to compare the originally created contract and its printed and signed version and automatically verify the document’s integrity >
    • The technology can analyze documents in different formats, i.e. Microsoft Word, PDF, JPEG, TIF and other document or image formats and can provide significant value to any document processing solution, especially to applications for the legal, government and business sector.

  • A new ready-to-use code sample helps to test the technology and speed up development work.
  • Higher accuracy for Arabic OCR: New AI-based algorithms
    • Arabic OCR: Significantly improved recognition on low quality images
    • To increase recognition accuracy on low-quality images, new smart AI-based algorithms were implemented for recognition of Arabic. In addition to the ‘standard’ OCR processing approach, a newly trained Recurrent Neural Network for Arabic enables End-to-End recognition processes delivering highly accurate recognition results – even on images with very low quality. To optimize recognition quality and processing speed, a new intelligent built-in classifier selects the appropriate processing methodology.
  • Higher accuracy for Korean OCR: New AI-based algorithms
    • Korean OCR: Significantly higher recognition accuracy
    • To further increase the recognition accuracy of the Korean language, a new Deep Learning Language model was trained on a large document amount. This intelligent model selects the best recognition variant from different recognition hypotheses based on his ‘knowledge’ acquired during an extensive training on documents in Korean. To optimize the balance between the recognition accuracy and OCR processing speed, a new smart built-in classifier selects the usage of this new model only under specific conditions
  • Enhanced: Text-based classifier with advanced security of training data
    • To further increase the security and protection of information contained in training documents for the text-based classification , hashing algorithms were implemented. Using information from checksums eliminates the possibility to recover information from the sample documents.
  • Enhanced: Classification Demo Sample - now with Office format documents
    • To reflect the FineReader Engine’s capability of processing Office documents in the classification process, the provided pre-compiled Demo Sample for classification was enhanced and allows now to display Office documents in the classification results (in addition to PDFs and image formats). Furthermore, the newly added sample documents can be used to test and demonstrate the classification capabilities.
  • Improved: Document layout preservation
    • To improve the detection and recreation of document layout, a new 'single-column' document model was introduced that provides more exact detection and analysis of tables and charts and significantly improves the detection and recreation of document layout.
  • Enhanced: Java wrapper documentation
    • To simplify the usage of the API, the documentation of the ABBYY FineReader Engine 12 has been extended and the documentation for the Java wrapper is now provided in JavaDoc format in addition to the HTML and PDF formats.
  • New code sample for document comparison
    • The extensive code samples library was extended by a new sample that allows testing and demonstrating the ability to compare two versions of the same document and detect differences in their content

  • Improved: Developer's Help documentation
    • The Developer’s Help of the FineReader Engine 12 has been extended by additional information about different possibilities of licensing the SDK, describing the individual types of licensing options in an easy-to-understand comparison table.

More details about individual features and the latest release distributive can be found on the download page.

IMPORTANT COMPATIBILITY INFORMATION

IMPORTANT COMPATIBILITY INFORMATION

  • GetEngine function was deprecated in R2 ⇒ To load the Engine object, please use the InitializeEngine function. It provides the unified Engine loading procedure for all license types (including the Online License).
  • Customers updating from Release 1 to Release 2 and higher who use the GetEngine function would receive an error message if they keep using it in later releases. Please update your code and replace the GetEngine with the InitializeEngine function.
  • Important note about license name change: To better reflect its functionality, the license type ‘Cloud License’ was renamed into ‘Online License’ in the R3 for Windows. This license supports deployment in virtual & cloud environments, usage with Docker containers as well as on premise installations.

More details about individual features, fixed bugs, and the latest release distributive can be found on the download page.

Previous Releases

Release Release 3

ABBYY FineReader Engine 12 for Linux - Release 3 (23.07.2019)

  • Part#: 1366/17
  • Build: 12.3.1.602

New Features and Improvements in this release:

[Added Intelligent Character Recognition Technology (ICR)]

  • The high-quality technology for recognition of hand-printed text (ICR) was added to the FineReader Engine for Linux (previously, this technology was only available in the FineReader Engine for Windows). The technology allows to extract information entered per Hand in individual fields, as used for example on application forms or customer onboarding documents.

[Added Optical Mark Recognition Technology (OMR)]

  • The advanced technology for recognition of optical marks was added to the FineReader Engine for Linux (previously, this technology was only available in the FineReader Engine for Windows). The technology allows to extract information about selected fields on surveys, questionaires or multiple choice exam sheets.

In the past, the ICR and OMR technologies were only available in the Windows version. With this step, it is as well available in the Linux version.

[New input formats: Office documents]

  • A new set of input formats was added to ABBYY FineReader Engine. In addition to image formats, the ABBYY FineReader Engine can now as well open and process the most common Office documents such as text documents, spreadsheets, and presentations:
    • Text documents: doc, docx, rtf, htm / html, txt, odt
    • Spreadsheets: xls, xlsx, ods
    • Presentations: ppt, pptx, odp

This new support of different types of input documents - scanned paper documents, different types of PDF, and digitally-born documents in Office formats - allows processing documents from multiple channels in a single flow.

[Ability to extract information from MRZ in ID documents]

  • Machine-readable zones (MRZ) are used in ID documents to encode personal information. With the ability to extract information from the machine-readable zone, FineReader Engine can be used in ID verification systems that are often used by government organizations as well as in solutions for customer onboarding used by employees of hotels, car rental companies, banks or insurance firms who need to quickly enter and verify their clients' personal data.
  • The personal information in machine-readable zone in ID documents is encoded as 2 or 3 lines of text according to specification of the ICAO Document 9303.

[New OCR languages:]

  • Georgian OCR: The Georgian language was added as a new OCR language
  • OCR for Simple Mathematical Formulas: ABBYY FineReader Engine now allows extracting characters of simple mathematical formulas. With the new OCR language Simple Mathematical Formulas it is possible to process scientific documents containing simple single-line mathematical formulas inside the text.

[Enhanced OCR languages] - with the support of Artificial Intelligence algorithms:

  • The newly trained Convolutional Neural Network for recognition of Asian languages provides following improvements:
    • Significantly faster recognition of Korean
    • Faster recognition of Chinese
    • Increased speed & accuracy in recognition of Japanese (Modern)

[Ability to recognize documents with different text types]

  • If documents contain text areas in different text types on one page, the correct text type will be detected and used.

In the past, this functionality was only available in the Windows version. With this step, the detection of individual text types on one page is as well available in the Linux version.

[Document processing in memory for the Batch Processor]

  • This new feature allows to process documents in memory using the BatchProcessor Object. The new approach can decrease the requirements for free hard disk drive and increase the overall processing speed. Previously, document processing in memory was only available for the FRDocument Object.

[Enhancements in PDF export]

  • Extended set of tags on export to tagged PDF allows creating PDFs that are compliant with the Web Content Accessibility Guidelines in the European Union and the Section 508 Amendment to the Rehabilitation Act of 1973 in the USA.
  • Ability to save information about creation and editing dates during the export to PDF allows recording following information:
    • Date of creation
    • Date of modification
    • Both information (creation and modification dates)

[Enhanced documentation]

  • New article in product documentation describes how to deploy FineReader Engine in Docker containers.
  • EULA file is available in FineReader Engine files
  • Two new documents - these documents were available in the Windows version only. Since this release, following documents are available in the Linux version as well:
    • New Admin's Guide
    • New Licensing Server Guide

[Online License supporting proxy servers]

  • The Cloud-based License was renamed into Online License. This type of license supports FineReader Engine's deployment in the Cloud. In addition, it allows using the FineReader Engine with Docker containers as well as deploying it within virtual environments. The Online License requires permanent internet connection. The licensing mechanism now supports proxy servers allowing to deploy FineReader Engine in systems with high security requirements, where the internet connection is allowed only via a proxy server.

[Other improvements]

  • Important: To better reflect its functionality, the license type ‘Cloud License’ was renamed into ‘Online License’ in the R3 for Windows. This license supports deployment in virtual & cloud environments, usage with Docker containers as well as on premise installations.

Release Release 2 Patch 1

ABBYY FineReader Engine 12 for Linux - Release 2 Patch 1 (22.06.2018)

  • Part#: 1366/8
  • Build: 12.2.27.2249

New Features and Improvements in this release:

[Improved classification]:

Advanced classification algorithms in FineReader Engine 12 leverage advanced technologies such as machine learning, natural language processing technologies, improved document classification quality together with more flexible and fine-tuning options. The customer can choose between new intelligent Image, advanced Text Classifiers or a combination of them:

  • Image Classifier - collects and processes visual information about document images and delivers fast classification results.
  • Text Classifier - extracts and processes information about the documents’ content, which increases the classification accuracy

FineReader Engine 12 also offers new classification modes which help to optimize the classification for high precision, high recall or a balance between these:

  • High precision mode - recommended in scenarios, where it is important to precisely classify documents into the right categories and limit wrong class assignment to a minimum.
  • High recall mode - recommended in scenarios, in which it is important to detect all documents belonging into a certain category among all available documents, and limit the risk that they might be missed.

The new version also features significantly improved and reworked classification API. New improved algorithms and built-in cross-validation techniques make the improved classification API more convenient and easy.

[Improved layout reconstruction]:

  • Improved layout retention on TXT export

New export mode which simulates the original layout by inserting spaces and has the following features:

  • emulation of the paragraph indentation and central alignment with spaces.
  • emulation of spaces between paragraphs with empty lines.
  • special processing of frames and footnotes.
  • translation of characters in upper and lower cases into special Unicode characters.

[New saving options]:

  • PDF 2.0 - the latest PDF standard includes the following updates:
    • encryption - the producer may embed the encrypted PDF document within an unencrypted PDF document;
    • support of the new types of the digital signatures - based on CAdES standard, LTV and certificates based on elliptic curves;
    • new types of annotations: projections, 3D and rich media;
    • accessibility - pronunciation hints.
  • PDF/UA - export to PDF in accordance with PDF/UA standard is available now. Legislation of most countries requires state and federal authorities to provide accessible versions of their websites and PDF documents. Conformance with PDF/UA ensures accessibility for people with disabilities who use assistive technology such as screen readers, screen magnifiers, joysticks and other technologies to navigate and read electronic content.
  • HTML 5 - New export format - as an alternative to PDF for client-server mobile cross-platform applications. The developers can now grant their users an ability to use FineReader Engine 12 without client-program installation, increasing the conversion, avoid the security limitations and the necessity to have the single environment to run the program on user’s PC.

[Possibility to manage images in memory]:

The new ways to manage images in memory are:

  • Internal storage of images in memory (analogue of HBitmap in Windows with restrictions: only uncompressed HBitmap with 24 bits per pixels, works for color images only. As HBitmap is a Windows-specific entity, it could be implemented in different ways on Linux. We decided to support only basic types, others could be supported in future in case of demand).
  • Ability to get HBitmap for Image.
  • Ability to load HBitmap into FRE.

FineReader Engine also includes a demo sample demonstrating how to work with the new methods.

[Other improvements]:

  • An ability to detect FRE build number before Engine is loaded
  • Export of information about tab-space characters to XML
  • Methods using SAFEARRAY parameters support:
    • IReadStream
    • IFileWriter
    • FRDocument::ExportToMemory
    • FRDocument::AddImageFileFromStream
    • Engine::IsPdfWithTextualContentFromStream

IMPORTANT COMPATIBILITY INFORMATION

  • GetEngine function was deprecated in R2 ⇒ To load the Engine object, please use the InitializeEngine function. It provides the unified Engine loading procedure for all license types (including the Online License).
  • Customers updating from Release 1 to Release 2 and higher who use the GetEngine function would receive an error message if they keep using it in later releases. Please update your code and replace the GetEngine with the InitializeEngine function.

Release 2

ABBYY FineReader Engine 12 for Linux - Release 2 (22.06.2018)

  • Part#: 1366/4
  • Build: 12.2.27.1845
  • For added features please refer to release 2 patch 1

Release 1

FineReader Engine 12 Release 1 (available since 08.06.2018)

  • Part#: 1366/3
  • Build: 12.2.19.1858

[New deployment in the Cloud]

  • Cloud-enabled licensing allows integrating FineReader Engine in applications deployed within the Cloud environment (e.g. services like Amazon EC2 and Microsoft Azure) and using the SDK on workstations or virtual machines (containers)

[New & improved OCR languages]

  • IMPROVED: Japanese OCR accuracy - a new OCR language was added: Modern Japanese.
  • Farsi OCR - official support
  • Burmese OCR - technical preview
  • Japanese and Arabic documents: Enhanced recognition of dates, times, addresses and names when using new 'special predefined languages'
  • Improved recognition of right-to-left written languages: Two new properties help to detect the leftmost and the rightmost character in the word and to correctly designate the first and the last letters of the word during recognition and parsing.

[Improved layout reconstruction]

  • IMPROVED: Tables and layout reconstruction
  • Detection and recreation of balanced text columns
  • Dashed table borders on export to DOCX
  • Recreating cell borders color during the export to XLSX

[New PDF/A saving options]

  • PDF/A-2b and PDF/A-3b - new options for PDF/A-2 and for PDF/A-3 are supported for the level B conformance

[New XML saving options and improvements]

  • ALTO 3.1 - New export format
  • Faster export to XML, when saving information about character coordinates prior to image deskewing
  • Direct export of list elements to XML

[Other features and improvements]

  • New entity: Customer Project-ID decreases the numbers of errors during Engine object initialization.
  • New HTML-based Help file
  • 256-bit AES encryption with the possibility to use Unicode characters during PDF encryption
  • Accessing information about individual recognition variants

Note: Please note, that in the Release 1, Classification is not available. It will be implemented in Release 2.

Code Samples

This website uses cookies which enable you to see pages or use other functions of our websites. You can turn off such cookies in your browser’s settings. If you continue to use these pages, you consent to the use of cookies.
  • No tags, yet