Dispatcher is a very strong tool used for document
classification.
- Classify/Identify document type à to route the document to the correct workflow
- Index images à to deliver images to the correct repository
- Extract business data à to manage transactional information
- Validate information à to control your process
Capture--->Classify--->Data
Extract---> Data validation--->Document Export
Classification technologies: There
are 5 technologies which can be used for different types of documents
classification. You can use any one or combination of some of them.
Type/ No
|
Global Image Analysis(Automatic
Template Creation)
|
Local Image Analysis(HPA-High
Precision Anchors)
|
Keyword Analysis
|
Text Matching Analysis
|
Handwritten Detection
|
1
|
Used when large number of documents
present for classification. Document language independent. Categorizes
documents based on similarities ,Like document structure. Automatic learning
and builds a dynamic knowledge base
|
Here sample document is provided with
anchors marked on places like header, document name etc. It can be used along with Global Image
Analysis to sub group classified documents.
|
Keyword match is used to classify
document. It’s irrespective of any specific area. i.e full text search done
on the document.
|
Useful when documents have different
layouts but same text/data.
|
Used for handwritten documents
|
2
|
It works on ‘Fuzzy Logic’ algorithm
|
High precision anchor concept
|
Regular expressions used for search
text pattern
|
Full text OCR used to extract
information.
|
Uses Fuzzy Logic for document
processing and learining.
|
3
|
Structured(Forms) /semi structured documents(Bank Cheques)
|
Structured /semi structured documents
|
semi structured /Unstructured documents
|
Unstructured document classification
|
Unstructured document classification(Patient Records)
|