SlideShare a Scribd company logo
1 of 22
Machine Teaching for workflow
automation
Ariadna Kramkovska
Muntis Rudzitis
RIGA COMM 2020
Agenda
• Introductions
• What is machine teaching?
• Benefits, our unique tool, use cases
• Change of Paradigm
• Demo 1: trained model
• Machine teaching quick wins
• Automation, document extraction
• Demo 2: document classification
Ariadna Kramkovska
Machine Learning Developer
at Emergn
Muntis Rudzitis
Lead Data Scientist
at Emergn
Presenters
Emergn’s Machine
Learning Lab
Our team's core competencies:
• Business understanding of how to profit
from ML
• Machine learning development
• Deep learning and reinforcement learning
• Data visualization
• Algorithms and ML techniques
• Data processing, cleaning and preparation
OUR TECHNOLOGY STACK
We have established and run the largest ML community
in the Baltics with 1260 field experts as members.
All these activities currently makes our company the
No. 1 Choice for young and experience ML talents.
Machine learning models
Tools and programming languages:
• Python
• TensorFlow, TensorBoard
• R ,OpenCV, Caffe2
• KNIME
• Azure Machine Learning
Studio
• C, C++
Deployment techniques
Platforms and environments:
• Stand alone models
• SAP Hana2
• Microsoft Azure (Cortana
intelligences suite)
• SQL Server
We are partnering with GDEXA to help enable the young
generation with highly demanded skills like applied
AI/ML, Big Data Analytics and Cloud Applications.
What is machine teaching?
Why use it?
Business automation
challenge
From automation with replacement of
humans to augmentation and
empowerment of subject matter experts.
We predict that companies who use
augmented automation technologies to
empower their environments and educate
their people on how to use them for better,
more predictable outcomes, will win by
providing the best service and building
better products.
• Wheels for the mind
• Find a comfortable level of automation
Why are we looking for
machine learning
(ML) alternatives?
ML for automation and
workflows should:
• Be transparent and interactive for
business users
• Include natural language-based
solutions where humans have better
comprehension
• Understand context and learning
from smaller data sets
• ML models should be verified and
monitored
AI director at
Facebook
YANN LECUN
ATARI GAME
Self-driving cars need
millions of hours of
training to reach
human level trained in
about 20 hours.
In 80 hours machine
will reach human level
aquired by 15 minutes.
Change of paradigm
Key differences
• Role of subject matter expert (SME) changes – using our tool, SME trains/provides the logic to improve the model.
• SME is integral to the success, needs to be empowered and have the tools to do their work better.
• The "one and done" approach is not flexible/doesn't allow for market changes nor incremental knowledge.
MACHINE TEACHING
MACHINE LEARNING
Iterative machine teaching process
For use cases such as:
• Email/text/document
classification
• Email/text/document
anonymization
• Entity extraction
SME
DATA SCIENTIST
Initial
model
training
Models could be
regularly monitored
by SME
AUTOMATION
WORKFLOW
CLASSIFICATION
MODEL
Tools for model quality inspection
Machine Teaching
administration tools
help business users
and ML Power Users
control classification
model quality.
Demo 1
Monitoring of the trained model and quality check
Finding quick wins
ROUTINE OPERATIONS
• RPA Robots
• Machine Learning
• OCR + Data Extraction
REPETITIVE
COGNITIVE TASKS
COLLABORATION
WORKFLOWS
• Machine Learning
• Rule Engines • Interactive applications
• Workflows
• Data Enrichment
DECISION MAKING
AND SUPPORT
• AI
• Process Mining
• Document classification
• Automation translation
• Collaboration apps
• Approvals
• Case management
• Prioritization of work
• Analysis
• Data Extraction
• Copy data
• Enter data
• Sort documents
FREE UP PEOPLE TIME SUPPORT SHIFT TO DIGITAL OPERATING MODEL
TASK AUTOMATION WORKFLOW AUTOMATION DECISION AUTOMATION
• Machine Learning
Natural
Language
based
use-cases
Automation of the document flow
DOCUMENT LIFE-CYCLE
RECEIVE
DOCUMENT
PROCESS
DOCUMENT
ARCHIVING
• Highly manual
• Need decision making, involving knowledge worker to do manual tasks
• Text comprehension (SME)
• Fraught with errors
• Difficult to research/go back, to find things
• Time consuming
• Not possible when scale is large
Automation of the document flow
DOCUMENT LIFE-CYCLE
RECEIVE
DOCUMENT
SAVE
DOCUMENT
EXTRA
META DATA
Document and
form recognition
using OCR
• Be physical
document or
email or video
any format can
be input
DOCUMENT
ROUTING
DOCUMENT
PUBLISHING
ARCHIVING
Meta-data
extraction
• Key fields
• Key words
• In text
• e.g. PO #
• Subject
• Amount
Document
classification
and routing e.g.
• Classification
• Invoice routing
• Approval,
level, direct
forward to
finance
Document
anonymization
and masking e.g.
• Bank and
personal info
Document
archiving by
classification e.g.
• Archiving based
on classification: in
cloud or on-site
HOT WARM COLD
Information extraction from documents
CLIENT RECORD
CLIENT RELATED
DOCUMENTS AND KEYWORDS DATA
EXTRACTION
MODEL
*Client information is
automatically enriched.
Company: BNP Paribas*
Title:
Name: Tom Barnes*
Other information:
Meta data
Retrieving information from text and documents helps you to obtain data
that can and must supplement the information of certain accounting cards.
DATA
EXTRACTION
CONFIG
Document classification
Email
• By title
• By object
• By action
Mail
• A
• B
• C
Contract
Application
Invoice
CLASS CATALOG
The document class is
determined by the
organizational
documents/text
classification catalog
Training of document class catalogues and document
classification algorithm by business user – specialist
The classification of documents results in the identification of the document class and can use it to
further process the document, usually routing and storing it in an appropriate document process.
DOCUMENT
CLASSIFICATION
MODEL
DOCUMENT
CLASS
Document anonymization
The purpose of anonymizing documents/text is to cover all or part
of sensitive information prior to publication of the document/text.
Person 1, social security number,
living, Address 1, closed contract
Contact Number, with Company
ABC for Contract Title
DATA
MASKING
George Bennett,
Social security number
989384843*****, address,
***** street *
DOCUMENT
ANONYMIZATION
ANONYMIZATED
DOCUMENT
ANONYMIZATED
TEXT
ANONYMIZATION
CONFIGURATION
ANONYMIZATED
DOCUMENT
Machine
Teaching (MT)
Tool
Machine
Learning (ML)
Solution
Technical architecture
SET UP AS AN INTERNAL TOOL
ON-PREMISE SERVICE
MODULAR STRUCTURE
SME/IT:
• MT Tool with UI for training & analytics
• Custom integrations possible
• Endpoints for production scenario provided
Data Scientist:
• ML model exchangeable - has standardized endpoints
• Data and integration – as needed
PYTHON
CAN BE INTEGRATED INTO EXISTING WORKFLOW MODEL
MT DATABASE
MODEL DATA
ML SOLUTION
MT SERVICE
CUSTOM
INTEGRATION &
PRODUCTION
MT TOOL
Machine teaching process
PERS LOC ORG DATE PROD
LABELING
CHANGES
VERIFICATION
TRAINING/
RETRAINING
Demo 2
Text classification example
How to get most value from machine
teaching for Natural Language
Processing (NLP) and Non-NLP models
IN NLP MODELS
• Objective – Create
Optimal data set for best
known optimal models
• Use for data labeling
and data preparation
IN NON-NLP MODELS
• Objective – Find
optimal/best model with
existing data
• Use for model monitoring
and verification
Machine Learning is the subfield of
computer science that, according to Arthur
Samuel in 1959, gives “computers the ability
to learn without being explicitly programmed.”
NLP
Artificial
Intelligence
Machine
Learning
Statistics
Let's stay connected
Contact us at info@emergn.com
for an individual demo.
Follow us: @emergn
MUNTIS RUDZITIS
Lead Data Scientist
at Emergn
https://www.linkedin.com/in/muntis-rudzitis/
ARIADNA KRAMKOVSKA
Machine Learning Developer
at Emergn
https://www.linkedin.com/in/ariadnakramkovska/

More Related Content

Similar to Machine Teaching for workflow automation RIGA COMM 2020

Ipsita_Informatica_9Year
Ipsita_Informatica_9YearIpsita_Informatica_9Year
Ipsita_Informatica_9Year
ipsita mohanty
 

Similar to Machine Teaching for workflow automation RIGA COMM 2020 (20)

Padmini Parmar
Padmini ParmarPadmini Parmar
Padmini Parmar
 
Design Like a Pro: Machine Learning Basics
Design Like a Pro: Machine Learning BasicsDesign Like a Pro: Machine Learning Basics
Design Like a Pro: Machine Learning Basics
 
Global M365 Developer Bootcamp 2020 Hyderabad: KEYNOTE
Global M365 Developer Bootcamp 2020 Hyderabad: KEYNOTEGlobal M365 Developer Bootcamp 2020 Hyderabad: KEYNOTE
Global M365 Developer Bootcamp 2020 Hyderabad: KEYNOTE
 
Choosing the right IDP Solution
Choosing the right IDP SolutionChoosing the right IDP Solution
Choosing the right IDP Solution
 
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
TAUS Machine Translation Showcase, The Simplified Guide to Getting Started in...
 
Challenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in ProductionChallenges of Operationalising Data Science in Production
Challenges of Operationalising Data Science in Production
 
(Recent) technology trends and bridges to gap in the localization industry
(Recent) technology trends and bridges to gap in the localization industry(Recent) technology trends and bridges to gap in the localization industry
(Recent) technology trends and bridges to gap in the localization industry
 
Open, Secure & Transparent AI Pipelines
Open, Secure & Transparent AI PipelinesOpen, Secure & Transparent AI Pipelines
Open, Secure & Transparent AI Pipelines
 
Resume
ResumeResume
Resume
 
Airtel-BML
Airtel-BMLAirtel-BML
Airtel-BML
 
Design Like a Pro: Machine Learning Basics
Design Like a Pro: Machine Learning BasicsDesign Like a Pro: Machine Learning Basics
Design Like a Pro: Machine Learning Basics
 
Ipsita_Informatica_9Year
Ipsita_Informatica_9YearIpsita_Informatica_9Year
Ipsita_Informatica_9Year
 
How I became ML Engineer
How I became ML Engineer How I became ML Engineer
How I became ML Engineer
 
GenerativeAI and Automation - IEEE ACSOS 2023.pptx
GenerativeAI and Automation - IEEE ACSOS 2023.pptxGenerativeAI and Automation - IEEE ACSOS 2023.pptx
GenerativeAI and Automation - IEEE ACSOS 2023.pptx
 
Digitization solutions
Digitization solutionsDigitization solutions
Digitization solutions
 
An AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven OrganizationAn AI Maturity Roadmap for Becoming a Data-Driven Organization
An AI Maturity Roadmap for Becoming a Data-Driven Organization
 
Practical Applications of AI: Real World Examples
Practical Applications of AI: Real World ExamplesPractical Applications of AI: Real World Examples
Practical Applications of AI: Real World Examples
 
UiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptxUiPath Document Understanding_Day 2.pptx
UiPath Document Understanding_Day 2.pptx
 
Delivering Machine Learning Solutions by fmr Sears Dir of PM
Delivering Machine Learning Solutions by fmr Sears Dir of PMDelivering Machine Learning Solutions by fmr Sears Dir of PM
Delivering Machine Learning Solutions by fmr Sears Dir of PM
 
UTICamp-2020. Is translation automation a myth or a reality?
UTICamp-2020. Is translation automation a myth or a reality?UTICamp-2020. Is translation automation a myth or a reality?
UTICamp-2020. Is translation automation a myth or a reality?
 

Recently uploaded

Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
drm1699
 
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Lisi Hocke
 

Recently uploaded (20)

Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...
Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...
Abortion Clinic In Johannesburg ](+27832195400*)[ 🏥 Safe Abortion Pills in Jo...
 
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdfThe Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
The Evolution of Web App Testing_ An Ultimate Guide to Future Trends.pdf
 
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit MilanWorkshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
Workshop: Enabling GenAI Breakthroughs with Knowledge Graphs - GraphSummit Milan
 
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
Abortion Pill Prices Germiston ](+27832195400*)[ 🏥 Women's Abortion Clinic in...
 
The Strategic Impact of Buying vs Building in Test Automation
The Strategic Impact of Buying vs Building in Test AutomationThe Strategic Impact of Buying vs Building in Test Automation
The Strategic Impact of Buying vs Building in Test Automation
 
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
Abortion Pills For Sale WhatsApp[[+27737758557]] In Birch Acres, Abortion Pil...
 
From Theory to Practice: Utilizing SpiraPlan's REST API
From Theory to Practice: Utilizing SpiraPlan's REST APIFrom Theory to Practice: Utilizing SpiraPlan's REST API
From Theory to Practice: Utilizing SpiraPlan's REST API
 
Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...
Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...
Entropy, Software Quality, and Innovation (presented at Princeton Plasma Phys...
 
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale IbridaUNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
UNI DI NAPOLI FEDERICO II - Il ruolo dei grafi nell'AI Conversazionale Ibrida
 
Novo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMsNovo Nordisk: When Knowledge Graphs meet LLMs
Novo Nordisk: When Knowledge Graphs meet LLMs
 
Community is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea GouletCommunity is Just as Important as Code by Andrea Goulet
Community is Just as Important as Code by Andrea Goulet
 
Encryption Recap: A Refresher on Key Concepts
Encryption Recap: A Refresher on Key ConceptsEncryption Recap: A Refresher on Key Concepts
Encryption Recap: A Refresher on Key Concepts
 
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
Abortion Pill Prices Jane Furse ](+27832195400*)[ 🏥 Women's Abortion Clinic i...
 
Software Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements EngineeringSoftware Engineering - Introduction + Process Models + Requirements Engineering
Software Engineering - Introduction + Process Models + Requirements Engineering
 
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
Auto Affiliate  AI Earns First Commission in 3 Hours..pdfAuto Affiliate  AI Earns First Commission in 3 Hours..pdf
Auto Affiliate AI Earns First Commission in 3 Hours..pdf
 
BusinessGPT - Security and Governance for Generative AI
BusinessGPT  - Security and Governance for Generative AIBusinessGPT  - Security and Governance for Generative AI
BusinessGPT - Security and Governance for Generative AI
 
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with GraphGraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
GraphSummit Milan & Stockholm - Neo4j: The Art of the Possible with Graph
 
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
Team Transformation Tactics for Holistic Testing and Quality (NewCrafts Paris...
 
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-CloudAlluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
Alluxio Monthly Webinar | Simplify Data Access for AI in Multi-Cloud
 
Wired_2.0_CREATE YOUR ULTIMATE LEARNING ENVIRONMENT_JCON_16052024
Wired_2.0_CREATE YOUR ULTIMATE LEARNING ENVIRONMENT_JCON_16052024Wired_2.0_CREATE YOUR ULTIMATE LEARNING ENVIRONMENT_JCON_16052024
Wired_2.0_CREATE YOUR ULTIMATE LEARNING ENVIRONMENT_JCON_16052024
 

Machine Teaching for workflow automation RIGA COMM 2020

  • 1. Machine Teaching for workflow automation Ariadna Kramkovska Muntis Rudzitis RIGA COMM 2020
  • 2. Agenda • Introductions • What is machine teaching? • Benefits, our unique tool, use cases • Change of Paradigm • Demo 1: trained model • Machine teaching quick wins • Automation, document extraction • Demo 2: document classification
  • 3. Ariadna Kramkovska Machine Learning Developer at Emergn Muntis Rudzitis Lead Data Scientist at Emergn Presenters
  • 4. Emergn’s Machine Learning Lab Our team's core competencies: • Business understanding of how to profit from ML • Machine learning development • Deep learning and reinforcement learning • Data visualization • Algorithms and ML techniques • Data processing, cleaning and preparation OUR TECHNOLOGY STACK We have established and run the largest ML community in the Baltics with 1260 field experts as members. All these activities currently makes our company the No. 1 Choice for young and experience ML talents. Machine learning models Tools and programming languages: • Python • TensorFlow, TensorBoard • R ,OpenCV, Caffe2 • KNIME • Azure Machine Learning Studio • C, C++ Deployment techniques Platforms and environments: • Stand alone models • SAP Hana2 • Microsoft Azure (Cortana intelligences suite) • SQL Server We are partnering with GDEXA to help enable the young generation with highly demanded skills like applied AI/ML, Big Data Analytics and Cloud Applications.
  • 5. What is machine teaching? Why use it?
  • 6. Business automation challenge From automation with replacement of humans to augmentation and empowerment of subject matter experts. We predict that companies who use augmented automation technologies to empower their environments and educate their people on how to use them for better, more predictable outcomes, will win by providing the best service and building better products. • Wheels for the mind • Find a comfortable level of automation
  • 7. Why are we looking for machine learning (ML) alternatives? ML for automation and workflows should: • Be transparent and interactive for business users • Include natural language-based solutions where humans have better comprehension • Understand context and learning from smaller data sets • ML models should be verified and monitored AI director at Facebook YANN LECUN ATARI GAME Self-driving cars need millions of hours of training to reach human level trained in about 20 hours. In 80 hours machine will reach human level aquired by 15 minutes.
  • 8. Change of paradigm Key differences • Role of subject matter expert (SME) changes – using our tool, SME trains/provides the logic to improve the model. • SME is integral to the success, needs to be empowered and have the tools to do their work better. • The "one and done" approach is not flexible/doesn't allow for market changes nor incremental knowledge. MACHINE TEACHING MACHINE LEARNING
  • 9. Iterative machine teaching process For use cases such as: • Email/text/document classification • Email/text/document anonymization • Entity extraction SME DATA SCIENTIST Initial model training Models could be regularly monitored by SME AUTOMATION WORKFLOW CLASSIFICATION MODEL
  • 10. Tools for model quality inspection Machine Teaching administration tools help business users and ML Power Users control classification model quality.
  • 11. Demo 1 Monitoring of the trained model and quality check
  • 12. Finding quick wins ROUTINE OPERATIONS • RPA Robots • Machine Learning • OCR + Data Extraction REPETITIVE COGNITIVE TASKS COLLABORATION WORKFLOWS • Machine Learning • Rule Engines • Interactive applications • Workflows • Data Enrichment DECISION MAKING AND SUPPORT • AI • Process Mining • Document classification • Automation translation • Collaboration apps • Approvals • Case management • Prioritization of work • Analysis • Data Extraction • Copy data • Enter data • Sort documents FREE UP PEOPLE TIME SUPPORT SHIFT TO DIGITAL OPERATING MODEL TASK AUTOMATION WORKFLOW AUTOMATION DECISION AUTOMATION • Machine Learning Natural Language based use-cases
  • 13. Automation of the document flow DOCUMENT LIFE-CYCLE RECEIVE DOCUMENT PROCESS DOCUMENT ARCHIVING • Highly manual • Need decision making, involving knowledge worker to do manual tasks • Text comprehension (SME) • Fraught with errors • Difficult to research/go back, to find things • Time consuming • Not possible when scale is large
  • 14. Automation of the document flow DOCUMENT LIFE-CYCLE RECEIVE DOCUMENT SAVE DOCUMENT EXTRA META DATA Document and form recognition using OCR • Be physical document or email or video any format can be input DOCUMENT ROUTING DOCUMENT PUBLISHING ARCHIVING Meta-data extraction • Key fields • Key words • In text • e.g. PO # • Subject • Amount Document classification and routing e.g. • Classification • Invoice routing • Approval, level, direct forward to finance Document anonymization and masking e.g. • Bank and personal info Document archiving by classification e.g. • Archiving based on classification: in cloud or on-site HOT WARM COLD
  • 15. Information extraction from documents CLIENT RECORD CLIENT RELATED DOCUMENTS AND KEYWORDS DATA EXTRACTION MODEL *Client information is automatically enriched. Company: BNP Paribas* Title: Name: Tom Barnes* Other information: Meta data Retrieving information from text and documents helps you to obtain data that can and must supplement the information of certain accounting cards. DATA EXTRACTION CONFIG
  • 16. Document classification Email • By title • By object • By action Mail • A • B • C Contract Application Invoice CLASS CATALOG The document class is determined by the organizational documents/text classification catalog Training of document class catalogues and document classification algorithm by business user – specialist The classification of documents results in the identification of the document class and can use it to further process the document, usually routing and storing it in an appropriate document process. DOCUMENT CLASSIFICATION MODEL DOCUMENT CLASS
  • 17. Document anonymization The purpose of anonymizing documents/text is to cover all or part of sensitive information prior to publication of the document/text. Person 1, social security number, living, Address 1, closed contract Contact Number, with Company ABC for Contract Title DATA MASKING George Bennett, Social security number 989384843*****, address, ***** street * DOCUMENT ANONYMIZATION ANONYMIZATED DOCUMENT ANONYMIZATED TEXT ANONYMIZATION CONFIGURATION ANONYMIZATED DOCUMENT
  • 18. Machine Teaching (MT) Tool Machine Learning (ML) Solution Technical architecture SET UP AS AN INTERNAL TOOL ON-PREMISE SERVICE MODULAR STRUCTURE SME/IT: • MT Tool with UI for training & analytics • Custom integrations possible • Endpoints for production scenario provided Data Scientist: • ML model exchangeable - has standardized endpoints • Data and integration – as needed PYTHON CAN BE INTEGRATED INTO EXISTING WORKFLOW MODEL MT DATABASE MODEL DATA ML SOLUTION MT SERVICE CUSTOM INTEGRATION & PRODUCTION MT TOOL
  • 19. Machine teaching process PERS LOC ORG DATE PROD LABELING CHANGES VERIFICATION TRAINING/ RETRAINING
  • 21. How to get most value from machine teaching for Natural Language Processing (NLP) and Non-NLP models IN NLP MODELS • Objective – Create Optimal data set for best known optimal models • Use for data labeling and data preparation IN NON-NLP MODELS • Objective – Find optimal/best model with existing data • Use for model monitoring and verification Machine Learning is the subfield of computer science that, according to Arthur Samuel in 1959, gives “computers the ability to learn without being explicitly programmed.” NLP Artificial Intelligence Machine Learning Statistics
  • 22. Let's stay connected Contact us at info@emergn.com for an individual demo. Follow us: @emergn MUNTIS RUDZITIS Lead Data Scientist at Emergn https://www.linkedin.com/in/muntis-rudzitis/ ARIADNA KRAMKOVSKA Machine Learning Developer at Emergn https://www.linkedin.com/in/ariadnakramkovska/