Automating Document Data Extraction

Posted February 24, 2021 in Cutter Business Technology Journal
pdf icon
This Advisor explores an intelligent system using AI tech­nologies to automate data extraction to any one of many structured formats. The system performs minimal manual annotations to capture the semantics of specific sections for any particular document tem­plate. Once that has been done, millions of documents can be fed through the system to extract information automatically. This Advisor provides a brief look at that system.
About The Author
Shahane Eksuzyan
Shahane Eksuzyan is a research scientist at Labs. Dr. Eksuzyan’s research interests include ML and AI. She is an expert in using Python for prototyping and developing models. Prior to getting involved in computer science, Dr. Eksuzyan earned a PhD in chemical physics from the National Academy of Sciences of the Republic of Armenia. She can be reached at
Sedrak Vardanyan
Sedrak Vardanyan is a team leader and senior research scientist at Labs. He is an experienced researcher focused on applied science. For nearly 20 years, Dr. Vardanyan has been dealing with mathematical modeling problems through different fields of applied science: mechanical engineering, economics, university education, and business. He can be reached at
Raj Ramesh
Raj Ramesh is a transformation expert who helps individuals adapt and organizations transform in this new world where AI is becoming commonplace. He is Chief AI Officer at DataFoundry and has worked with senior-level business and IT leaders in Fortune 500 companies as an advisor and architect to foster deeper business-IT collaboration through consulting, advising, and building customized frameworks. Dr. Ramesh has written extensively and spoken… Read More
Not a member? Gain Access to the Cutter Experts today — register now to read select open-access articles.