ABBYY OCR & NLP

This site is a source of technical information about ABBYY technologies and toolkits for Optical Character Recognition (OCR) and advanced language technologies. The content is intended for:

  • Developers who
    • plan to integrate text recognition, classification and intelligent data extraction in their applications and systems
    • want to enhance their Search, ECM, Data Management or eMail Management solutions with language technologies
    • are evaluating and testing a specific ABBYY SDK for the first time
  • Existing ABBYY SDK developers who
    • need the latest release
    • plan to upgrade or add new features
    • are interested in exploring further technologies from ABBYY
  • Technical audiences who are interested in advanced technologies that allow to: Action Information

Latest News & Info

WEBINAR: Add Value to Your Applications with OCR and Document Conversion Functionality

  • Date & Time: Wednesday, 8 June 2016 | 10:00 a.m. GMT | 11:00 a.m. CET
  • Location: Online Webinar
  • Language: English

Register now >>


Mobile OCR SDK icon Mobile OCR Engine 4.0 - Release 15 available (19.05.2016)

  • This maintenance release contains Android library for ARM-v7 and x86 processor architectures, improvements in Japanese OCR and bug fixes.
  • iOS: Download (Login needed)
  • Android: Download (Login needed)

OCR SDK icon FineReader Engine 11 for Windows - Release 7 Update available (25.05.2016)
This is a maintenance release containing a lot of new features and improvements:

  • Ability to convert documents containing undefined number of pages to searchable PDFs.
  • Simultaneous usage of network and standalone licenses within one installation.
  • Garbage removal from color images.
  • ... more detailsDownload (Login needed)

OCR SDK icon FineReader Engine 11 for Mac - Release 6 Update available (30.03.2016)
This release is a patch to the latest official maintenance release (FineReader Engine 11 R6):


OCR SDK icon FineReader Engine 11 for Linux - Release 6 Update available (21.03.2016)
This release is a patch to the latest official maintenance release (FineReader Engine 11 R6):


News Archive

Smart Classifier Logo

  • New server-based classification appliance, that allows classifying unstructured content into predefined categories.
    • Smart Classifier supports many file formats natively: TXT, Office, HTML, XML, PDFs, Images, others…
    • The system learns the characteristics of the specific document class based on the content of (= Machine Learning). No rules, works for all kind of unstructured documents.
    • Content experts can train and maintain the models via a modern, web based model editor
    • Smart Classifier's built in artificial intelligence selects and optimizes the algorithms to deliver high quality results
    • Integration via REST API

FineReader Engine 11 Logo


InfoExtractor 2.0 logo

  • ABBYY InfoExtractor is an information extraction technology based on natural language processing (NLP)
  • InfoExtractor SDK reveals entities, events and relations across unstructured texts
  • ABBYY InfoExtractor is based on ABBYY's Compreno syntactic & semantic analysis platform

Selected Articles

© 2016 ABBYY. All rights reserved.

Recent changes RSS feed Driven by DokuWiki