Case Study

Quick PoC to demonstrate Document Classification Global Information Services and Publishing Company

Quick PoC to demonstrate Document Classification Global Information Services and Publishing Company

Pages 1 Pages

CIGNEX Confidential 97 Quick PoC to demonstrate Document Classification Business Need: Their current classification process of 10-K forms was manual, error prone and not scalable. With 10M documents and 36 target categories they wanted an intelligent classification model. Global Information Services and Publishing Company With over 15,000 employees, active in across 150 countries offering expertise in Health, Tax and Accounting, Governance, Risk and Compliance Key Features: • Input: XML files / Output: Text files using Parser – Apache Tika + Custom • Evaluated DL4J, Naïve Baiyes and TensorFlow as Classifiers that run models and test set • Reviewer – results of classifier including audit logs, docs parsed and reviewing outliers • Custom Reviewer to capture r

Join for free to read