Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated

•

1 recomendación•1,943 vistas

Chris Fregly

Software

Chris Fregly
Developer Advocate
AI and Machine Learning
@AWS
Smokey and the Multi-Armed Bandit
Featuring BERT Reynolds

Abstract
First, I will train and deploy multiple natural language understanding
(NLU) models and compare them in live production using reinforcement
learning to dynamically shift traffic to the winning model.
Second, I will describe the differences between A/B and multi-armed
bandit tests including exploration-exploitation, reward-maximization, and
regret-minimization.
Third, I will dive deep into the details of building and scaling a multi-
armed bandit deployment on AWS using a real-time, stream-based text
classifier with TensorFlow, PyTorch, and BERT on 150+ million reviews
from the Amazon Customer Reviews Dataset.

Me Developer Advocate
AI and Machine Learning @ AWS
(Based in San Francisco)
Co-Author of the O'Reilly Book,
"Data Science on AWS."
Founder of the Advanced
Kubeflow Meetup (Global)
https://www.datascienceonaws.com
data-science-on-aws
@cfregly
linkedin.com/in/cfregly
https://meetup.com/Data-Science-on-AWS

Data Science on AWS – Book and Workshop Outline
https://www.datascienceonaws.com/

Agenda
• Compare A/B Tests vs. Multi-Armed Bandit Tests
• Optimize Bandits with Reinforcement Learning
• Train 2 BERT Languge Models with TensorFlow
• Train a Multi-Armed Bandit Model with Vowpal Wabbit
• Test 2 BERT Models with a Bandit
• DEMO: Scale Multi-Armed Bandits on AWS

Traditional A/B Tests
• Static
• Cannot Add New Models After Test Begins
• Static Traffic Split Between Models A and B
• May Negatively Impact Business Metrics
• Must Run Experiment to Completion
• No Concept of Reward for Winning Model

Multi-Armed Bandit Tests
• Add New Models
• Dynamically Shift Traffic
• Explore-Exploit Strategy
• Finish Experiment Early - or Run Longer!
• Minimize Regret (Business Impact)
• Maximize Reward

Train 2 BERT Models with TensorFlow (Models A & B)
• BERT Mania!
• Fine-Tuning BERT

Train a Bandit Model with Reinforcement Learning (RL)
• Popular Reinforcement Learning Strategies
• Epsilon Greedy
• Thompson’s Sampling
• Online Cover
• Bagging
• Implemented in Vowpal Wabbit (VW)!
• Try Our Open Source RL Containers
• https://github.com/aws/sagemaker-rl-container

Test 2 BERT Models with a Multi-Armed Bandit Model

DEMO: Scale Multi-Armed Bandits on AWS
• BERT Model 1: TensorFlow
• BERT Model 2: PyTorch

© 2020, Amazon Web Services, Inc. or its affiliates. All rights reserved.
DEMO!

More Resources
• O’Reilly Book - Data Science on AWS – Early Release Available!
• https://datascienceonaws.com
• GitHub Repo
• https://github.com/data-science-on-aws/workshop
• AWS Blog Post on Multi-Armed Bandits
• https://aws.amazon.com/blogs/machine-learning/power-contextual-bandits-using-continual-learning-
with-amazon-sagemaker-rl/
• Bandit Algorithms
• https://github.com/VowpalWabbit/vowpal_wabbit/wiki/Contextual-Bandit-algorithms
• Open Source SageMaker Reinforcement Learning Containers
• https://github.com/aws/sagemaker-rl-container

Thank you!
© 2020, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Chris Fregly
data-science-on-aws
@cfregly
linkedin.com/in/cfregly

Más contenido relacionado

La actualidad más candente

Tensorflow in production with AWS LambdaFabian Dubois

PipelineAI Optimizes Your Enterprise AI Pipeline from Distributed Training to...Chris Fregly

Yannis Zarkadas. Enterprise data science workflows on kubeflowMarynaHoldaieva

End to end Machine Learning using Kubeflow - Build, Train, Deploy and ManageAnimesh Singh

Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017Codemotion

How to deploy machine learning models in the CloudAlex Casalboni

Where should I run my code? Serverless, Containers, Virtual Machines and moreBret McGowen - NYC Google Developer Advocate

Serverless with Google CloudBret McGowen - NYC Google Developer Advocate

Whizlabs webinar - Deploying Portfolio Site with AWS ServerlessDhaval Nagar

Infrastructure as Code and AWS CDKSupratipBanerjee

Serving models using KFServingTheofilos Papapanagiotou

Dip into prometheusZaar Hai

TensorFlow London 14: Ben Hall 'Machine Learning Workloads with Kubernetes an...Seldon

IaC on AWS CloudBhuvaneswari Subramani

Automated Testing for Terraform, Docker, Packer, Kubernetes, and MoreC4Media

Introducing Kubeflow (w. Special Guests Tensorflow and Apache Spark)DataWorks Summit

Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + ...Chris Fregly

Kubeflow Distributed Training and HPOAnimesh Singh

Rene GroeschkeCodeFest

100% Puppet Cloud Deployment of Legacy SoftwarePuppet

La actualidad más candente (20)

Tensorflow in production with AWS Lambda

PipelineAI Optimizes Your Enterprise AI Pipeline from Distributed Training to...

Yannis Zarkadas. Enterprise data science workflows on kubeflow

End to end Machine Learning using Kubeflow - Build, Train, Deploy and Manage

Gabriele Nocco - Massive distributed processing with H2O - Codemotion Milan 2017

How to deploy machine learning models in the Cloud

Where should I run my code? Serverless, Containers, Virtual Machines and more

Serverless with Google Cloud

Whizlabs webinar - Deploying Portfolio Site with AWS Serverless

Infrastructure as Code and AWS CDK

Serving models using KFServing

Dip into prometheus

TensorFlow London 14: Ben Hall 'Machine Learning Workloads with Kubernetes an...

IaC on AWS Cloud

Automated Testing for Terraform, Docker, Packer, Kubernetes, and More

Introducing Kubeflow (w. Special Guests Tensorflow and Apache Spark)

Hands-on Learning with KubeFlow + Keras/TensorFlow 2.0 + TF Extended (TFX) + ...

Kubeflow Distributed Training and HPO

Rene Groeschke

100% Puppet Cloud Deployment of Legacy Software

Similar a Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated

Atmosphere Conference 2015: The 10 Myths of DevOpsPROIDEA

MongoDB & ChirpAntonio Di Motta

Building an MLOps Stack for Companies at Reasonable ScaleMerelda

Amazon SageMaker (December 2018)Julien SIMON

Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...Codiax

Micro services may not be the best ideaSamuel ROZE

From Notebook to production with Amazon SageMakerAmazon Web Services

LLMOps for Your Data: Best Practices to Ensure Safety, Quality, and CostAggregage

Enterprise Architectures with Ruby (and Rails)Konstantin Gredeskoul

Simplify Machine Learning with the Deep Learning AMI | AWS Floor28Amazon Web Services

APIdays Paris 2019 - Maintain & Evolve a Public GraphQL API by Aurélien Davi...apidays

Deep AutoViML For Tensorflow Models and MLOps WorkflowsBill Liu

MLOps and Reproducible ML on AWS with Kubeflow and SageMakerProvectus

Serverless projects at MyplanetDaniel Zivkovic

Scaling Machine Learning from zero to millions of users (May 2019)Julien SIMON

ML Best Practices: Prepare Data, Build Models, and Manage Lifecycle (AIM396-S...Amazon Web Services

Build 2019 RecapEran Stiller

Technology Based TestingAlan Richardson

An Introduction to Amazon SageMaker (October 2018)Julien SIMON

Use Case Patterns for LLM Applications (1).pdfM Waleed Kadous

Similar a Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated (20)

Atmosphere Conference 2015: The 10 Myths of DevOps

MongoDB & Chirp

Building an MLOps Stack for Companies at Reasonable Scale

Amazon SageMaker (December 2018)

Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...

Micro services may not be the best idea

From Notebook to production with Amazon SageMaker

LLMOps for Your Data: Best Practices to Ensure Safety, Quality, and Cost

Enterprise Architectures with Ruby (and Rails)

Simplify Machine Learning with the Deep Learning AMI | AWS Floor28

APIdays Paris 2019 - Maintain & Evolve a Public GraphQL API by Aurélien Davi...

Deep AutoViML For Tensorflow Models and MLOps Workflows

MLOps and Reproducible ML on AWS with Kubeflow and SageMaker

Serverless projects at Myplanet

Scaling Machine Learning from zero to millions of users (May 2019)

ML Best Practices: Prepare Data, Build Models, and Manage Lifecycle (AIM396-S...

Build 2019 Recap

Technology Based Testing

An Introduction to Amazon SageMaker (October 2018)

Use Case Patterns for LLM Applications (1).pdf

Más de Chris Fregly

AWS reInvent 2022 reCap AI/ML and DataChris Fregly

Pandas on AWS - Let me count the ways.pdfChris Fregly

Ray AI Runtime (AIR) on AWS - Data Science On AWS MeetupChris Fregly

Amazon reInvent 2020 Recap: AI and Machine LearningChris Fregly

Waking the Data Scientist at 2am: Detect Model Degradation on Production Mod...Chris Fregly

Quantum Computing with Amazon BraketChris Fregly

15 Tips to Scale a Large AI/ML Workshop - Both Online and In-PersonChris Fregly

AWS Re:Invent 2019 Re:CapChris Fregly

Swift for TensorFlow - Tanmay Bakshi - Advanced Spark and TensorFlow Meetup -...Chris Fregly

Hyper-Parameter Tuning Across the Entire AI Pipeline GPU Tech Conference San ...Chris Fregly

Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...Chris Fregly

High Performance Distributed TensorFlow in Production with GPUs - NIPS 2017 -...Chris Fregly

PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...Chris Fregly

High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...Chris Fregly

Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...Chris Fregly

Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...Chris Fregly

Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...Chris Fregly

Building Google Cloud ML Engine From Scratch on AWS with PipelineAI - ODSC Lo...Chris Fregly

Optimizing, Profiling, and Deploying TensorFlow AI Models in Production with ...Chris Fregly

High Performance TensorFlow in Production -- Sydney ML / AI Train Workshop @ ...Chris Fregly

Más de Chris Fregly (20)

AWS reInvent 2022 reCap AI/ML and Data

Pandas on AWS - Let me count the ways.pdf

Ray AI Runtime (AIR) on AWS - Data Science On AWS Meetup

Amazon reInvent 2020 Recap: AI and Machine Learning

Waking the Data Scientist at 2am: Detect Model Degradation on Production Mod...

Quantum Computing with Amazon Braket

15 Tips to Scale a Large AI/ML Workshop - Both Online and In-Person

AWS Re:Invent 2019 Re:Cap

Swift for TensorFlow - Tanmay Bakshi - Advanced Spark and TensorFlow Meetup -...

Hyper-Parameter Tuning Across the Entire AI Pipeline GPU Tech Conference San ...

Advanced Spark and TensorFlow Meetup - Dec 12 2017 - Dong Meng, MapR + Kubern...

High Performance Distributed TensorFlow in Production with GPUs - NIPS 2017 -...

PipelineAI + TensorFlow AI + Spark ML + Kuberenetes + Istio + AWS SageMaker +...

High Performance TensorFlow in Production - Big Data Spain - Madrid - Nov 15 ...

Optimizing, Profiling, and Deploying TensorFlow AI Models with GPUs - San Fra...

Building Google's ML Engine from Scratch on AWS with GPUs, Kubernetes, Istio,...

Nvidia GPU Tech Conference - Optimizing, Profiling, and Deploying TensorFlow...

Building Google Cloud ML Engine From Scratch on AWS with PipelineAI - ODSC Lo...

Optimizing, Profiling, and Deploying TensorFlow AI Models in Production with ...

High Performance TensorFlow in Production -- Sydney ML / AI Train Workshop @ ...

Último

%in kaalfontein+277-882-255-28 abortion pills for sale in kaalfonteinmasabamasaba

Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812

SHRMPro HRMS Software Solutions PresentationShrmpro

Software Quality Assurance Interview QuestionsArshad QA

%in Midrand+277-882-255-28 abortion pills for sale in midrandmasabamasaba

Crypto Cloud Review - How To Earn Up To $500 Per DAY Of Bitcoin 100% On AutoP...SelfMade bd

Announcing Codolex 2.0 from GDK SoftwareJim McKeeth

VTU technical seminar 8Th Sem on Scikit-learnAmarnathKambale

Chinsurah Escorts ☎️8617697112 Starting From 5K to 15K High Profile Escorts ...Nitya salvi

Generic or specific? Making sensible software design decisionsBert Jan Schrijver

%in Stilfontein+277-882-255-28 abortion pills for sale in Stilfonteinmasabamasaba

introduction-to-automotive Andoid os-csimmonds-ndctechtown-2021.pdfVishalKumarJha10

OpenChain - The Ramifications of ISO/IEC 5230 and ISO/IEC 18974 for Legal Pro...Shane Coughlan

Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave

Define the academic and professional writing..pdfPearlKirahMaeRagusta1

CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

The title is not connected to what is insideshinachiaurasa2

%in Bahrain+277-882-255-28 abortion pills for sale in Bahrainmasabamasaba

Right Money Management App For Your Financial GoalsJhone kinadey

call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls

Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated

1. Chris Fregly Developer Advocate AI and Machine Learning @AWS Smokey and the Multi-Armed Bandit Featuring BERT Reynolds

2. Abstract First, I will train and deploy multiple natural language understanding (NLU) models and compare them in live production using reinforcement learning to dynamically shift traffic to the winning model. Second, I will describe the differences between A/B and multi-armed bandit tests including exploration-exploitation, reward-maximization, and regret-minimization. Third, I will dive deep into the details of building and scaling a multi- armed bandit deployment on AWS using a real-time, stream-based text classifier with TensorFlow, PyTorch, and BERT on 150+ million reviews from the Amazon Customer Reviews Dataset.

3. Me Developer Advocate AI and Machine Learning @ AWS (Based in San Francisco) Co-Author of the O'Reilly Book, "Data Science on AWS." Founder of the Advanced Kubeflow Meetup (Global) https://www.datascienceonaws.com data-science-on-aws @cfregly linkedin.com/in/cfregly https://meetup.com/Data-Science-on-AWS

4. Data Science on AWS – Book and Workshop Outline https://www.datascienceonaws.com/

5. Agenda • Compare A/B Tests vs. Multi-Armed Bandit Tests • Optimize Bandits with Reinforcement Learning • Train 2 BERT Languge Models with TensorFlow • Train a Multi-Armed Bandit Model with Vowpal Wabbit • Test 2 BERT Models with a Bandit • DEMO: Scale Multi-Armed Bandits on AWS

6. Traditional A/B Tests • Static • Cannot Add New Models After Test Begins • Static Traffic Split Between Models A and B • May Negatively Impact Business Metrics • Must Run Experiment to Completion • No Concept of Reward for Winning Model

7. Multi-Armed Bandit Tests • Add New Models • Dynamically Shift Traffic • Explore-Exploit Strategy • Finish Experiment Early - or Run Longer! • Minimize Regret (Business Impact) • Maximize Reward

8. Train 2 BERT Models with TensorFlow (Models A & B) • BERT Mania! • Fine-Tuning BERT

9. Train a Bandit Model with Reinforcement Learning (RL) • Popular Reinforcement Learning Strategies • Epsilon Greedy • Thompson’s Sampling • Online Cover • Bagging • Implemented in Vowpal Wabbit (VW)! • Try Our Open Source RL Containers • https://github.com/aws/sagemaker-rl-container

10. Test 2 BERT Models with a Multi-Armed Bandit Model

11. DEMO: Scale Multi-Armed Bandits on AWS

12. DEMO: Scale Multi-Armed Bandits on AWS • BERT Model 1: TensorFlow • BERT Model 2: PyTorch

14. More Resources • O’Reilly Book - Data Science on AWS – Early Release Available! • https://datascienceonaws.com • GitHub Repo • https://github.com/data-science-on-aws/workshop • AWS Blog Post on Multi-Armed Bandits • https://aws.amazon.com/blogs/machine-learning/power-contextual-bandits-using-continual-learning- with-amazon-sagemaker-rl/ • Bandit Algorithms • https://github.com/VowpalWabbit/vowpal_wabbit/wiki/Contextual-Bandit-algorithms • Open Source SageMaker Reinforcement Learning Containers • https://github.com/aws/sagemaker-rl-container

Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated

Similar a Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated (20)

Más de Chris Fregly

Más de Chris Fregly (20)

Último

Último (20)

Smokey and the Multi-Armed Bandit Featuring BERT Reynolds Updated