Dispatcher is a very strong tool used for document
classification.
- Classify/Identify document type à to route the document to the correct workflow
- Index images à to deliver images to the correct repository
- Extract business data à to manage transactional information
- Validate information à to control your process
Capture--->Classify--->Data
Extract---> Data validation--->Document Export
Classification technologies: There
are 5 technologies which can be used for different types of documents
classification. You can use any one or combination of some of them.
Type/ No
|
Global Image Analysis(Automatic
Template Creation)
|
Local Image Analysis(HPA-High
Precision Anchors)
|
Keyword Analysis
|
Text Matching Analysis
|
Handwritten Detection
|
1
|
Used when large number of documents
present for classification. Document language independent. Categorizes
documents based on similarities ,Like document structure. Automatic learning
and builds a dynamic knowledge base
|
Here sample document is provided with
anchors marked on places like header, document name etc. It can be used along with Global Image
Analysis to sub group classified documents.
|
Keyword match is used to classify
document. It’s irrespective of any specific area. i.e full text search done
on the document.
|
Useful when documents have different
layouts but same text/data.
|
Used for handwritten documents
|
2
|
It works on ‘Fuzzy Logic’ algorithm
|
High precision anchor concept
|
Regular expressions used for search
text pattern
|
Full text OCR used to extract
information.
|
Uses Fuzzy Logic for document
processing and learining.
|
3
|
Structured(Forms) /semi structured documents(Bank Cheques)
|
Structured /semi structured documents
|
semi structured /Unstructured documents
|
Unstructured document classification
|
Unstructured document classification(Patient Records)
|
This can be one particular of the most useful blogs We’ve ever arrive across on this subject. Basically Wonderful. I am also a specialist in this topic so I can understand your hard work.
ReplyDeleteDocument Management Software
Document Management System
Electronic Document Management Software
Best Document Management Software
Content management (CM) is the process for collection, delivery, retrieval, governance and overall management of information in any format. The term is typically used in reference to administration of the digital content lifecycle, from creation to permanent storage or deletion.
ReplyDelete