Tuesday 29 October 2013

Captiva Dispatcher- Document Classification

Dispatcher is a very strong tool used for document classification.        
     
  • Classify/Identify document type  à  to route the document to the correct workflow 
  • Index images  à  to deliver images to the correct repository
  • Extract business data  à  to manage transactional information
  • Validate information  à  to control your process
Capture--->Classify--->Data Extract---> Data validation--->Document Export


Classification technologies: There are 5 technologies which can be used for different types of documents classification. You can use any one or combination of some of them.

Type/ No
Global Image Analysis(Automatic Template Creation)
Local Image Analysis(HPA-High Precision Anchors)
Keyword Analysis
Text Matching Analysis
Handwritten Detection
1
Used when large number of documents present for classification. Document language independent. Categorizes documents based on similarities ,Like document structure. Automatic learning and builds a dynamic knowledge base
Here sample document is provided with anchors marked on places like header, document name etc.  It can be used along with Global Image Analysis to sub group classified documents.
Keyword match is used to classify document. It’s irrespective of any specific area. i.e full text search done on the document.
Useful when documents have different layouts but same text/data.
Used for handwritten documents
2
It works on ‘Fuzzy Logic’ algorithm
High precision anchor concept
Regular expressions used for search text pattern
Full text OCR used to extract information.
Uses Fuzzy Logic for document processing and learining.
3
Structured(Forms) /semi structured documents(Bank Cheques)
Structured /semi structured documents
semi structured /Unstructured documents
Unstructured document classification
Unstructured document classification(Patient Records)


2 comments:

  1. This can be one particular of the most useful blogs We’ve ever arrive across on this subject. Basically Wonderful. I am also a specialist in this topic so I can understand your hard work.
    Document Management Software
    Document Management System
    Electronic Document Management Software
    Best Document Management Software

    ReplyDelete
  2. Content management (CM) is the process for collection, delivery, retrieval, governance and overall management of information in any format. The term is typically used in reference to administration of the digital content lifecycle, from creation to permanent storage or deletion.

    ReplyDelete