Targeted Audience Delivery with Presto

+
Lucas.Waye @ TiVo.com
April 5th, 2018

What’s using Presto:
Targeted Audience Delivery

TV networks, programmers,
and advertisers
What are my target
viewership segments?
Set-Top box data
Purchasing Behavior
Location-based Consumer Data
Program Metadata

TV networks, programmers,
and advertisers
What are my target
viewership segments?
Set-Top box data
Purchasing Behavior
Location-based Consumer Data
Program Metadata
brought to you (in part) by

looking to the past for inspiration for the future

Similar Products at TiVo
ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)

Similar Products at TiVo
ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)
transactional and customer-conﬁgurable data
semi-aggregated viewership data +
sets of households (e.g., “18-24 years old”, “owns minivan”)

New Product, New Challenges…
ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)
MySQL
MySQL
MySQL
Many new data marts
popping up in our tech stack

ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)
more viewership data
OK,
storage is cheap

ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)
more viewership data
storage is not cheap…

ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)
storage is not cheap… Need ﬁner
grain data!

ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)
storage is not cheap… Need ﬁner
grain data!
Can’t aggregate
as much

ETL
Amazon
S3 Java services
on EC2
ETL
Amazon
Redshift
MySQL
(RDS)
static,
hard to scale

Wait, what about Redshift Spectrum ?

Redshift Spectrum
Redshift: Pay per
node-hour
Spectrum: Pay per
data access

Experiment: join on two tables
• Small Joins: join small Redshift table with (ﬁltered-down) large table on S3

• Join across ~1M rows

• Large Joins: join large Redshift table with (unﬁltered) large table table on S3

• Join across ~10M rows

Compare to: both tables on Redshift
How Does it Scale?

Time
(sec)
Concurrent queries
Redshift Spectrum for “Simple" Queries
0
10
20
30
40
50
60
70
1 3 5 7 9 11 13 15
Latency (sec) vs. # Concurrent Requests
1 day 1 day (Spectrum)
Spectrum faster when cluster loaded
and can pre-filter/pre-aggregate data
small joins

Time
(sec)
Concurrent queries
Redshift Spectrum for “Simple" Queries
0
10
20
30
40
50
60
70
1 3 5 7 9 11 13 15
Latency (sec) vs. # Concurrent Requests
1 day 1 day (Spectrum)
Spectrum faster when cluster loaded
and can pre-filter/pre-aggregate data
small joins
Spectrum faster

Time
(sec)
Concurrent queries
Redshift Spectrum for Complex Queries

Time
(sec)
Concurrent queries
Redshift Spectrum for Complex Queries
Spectrum slower!

Memory for broadcast join on the cluster is a non-parallelizable resource in the cluster
Amdahl’s Law in Eﬀect

Memory for broadcast join on the cluster is a non-parallelizable resource in the cluster
Amdahl’s Law in Eﬀect
“Operations that can't be pushed to the Redshift Spectrum
layer include [JOIN], DISTINCT and ORDER BY. …
When large amounts of data are returned from Amazon S3,
the processing is limited by your cluster's resources.”
https://docs.aws.amazon.com/redshift/latest/dg/c-spectrum-external-performance.html

Wait, what about Redshift Spectrum ?
Our queries won’t work well on Spectrum.

Our Choice:
• Storage/Compute Separation

• Easy to add and remove worker nodes

• Query many diﬀerent data sources (inside our VPC)  
without separate load

• Good performance for analytical queries. 
Not so good for transactional and simple queries…

• Managed (e.g., Qubole, Starburst)

Coordinator
Worker Worker Worker
S3 / Hive
metastore
MySQL
Connector
Connector
SELECT SUM(v.seconds_viewed)
FROM hive.db.viewership v
JOIN mysql.db.audiences a ON a.hh_id = v.hh_id
WHERE audience_id = 42
mysql catalog à
hive catalog à
SELECT …
FROM db.audiences
WHERE audience_id = 42
DRAFT - TiVo Confidential 2018
How Presto Works
Data is streamed

back to the workers

First Challenge:
What instance types should we use?

Presto Worker Memory
System Memory
reserved-system-memory =
0.4 * JVM Max Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
All Queries Start Using
Memory From Here

System Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
All Queries Start Using
Memory From Here
Query

System Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
Needs more memory than in
General Pool —> Switch to Reserved
Query

System Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
General Pool —> Switch to Reserved
Query
Only one query allowed!

System Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
Reserved Pool —> Fail
Query

System Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
Reserved Pool —> Fail
Query
But there’s available
memory??

System Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
Reserved Pool —> keep allocating
(resource overcommit)
Query

System Memory
Reserved Memory
max-memory-per-node
General Memory
(the rest)
Query
But now a single query can
hog the entire cluster!

Query Query
Multiple Workers

Query Query
Multiple Workers
Total Memory
max-memory=
max-memory-per-node * number of nodes

QueryQuery
Multiple Workers
Total Memory
max-memory=
QueryQuery

Total Memory
max-memory=
Query
Query
Multiple Workers
Query
Query

• What if memory usage varies a lot between diﬀerent queries? 
• Use many inexpensive instances, or a few expensive instances? 
• Compute optimized or memory optimized?
Working With Reserved Memory Pool
How do we achieve that?
Conceptually, reserved memory pool should be the “high water mark”

while most queries complete in the general pool.

• What if memory usage varies a lot between diﬀerent queries? 
• Use many inexpensive instances, or a few expensive instances? 
• Compute optimized or memory optimized?
Working With Reserved Memory Pool
Conceptually, reserved memory pool should be the “high water mark”

while most queries complete in the general pool.
Solution: multiple clusters based on workload
Empiric testing found smaller cluster size was slightly faster
Solution: Cost/Beneﬁt Analysis
How do we achieve that?

Choosing the Right Instance Type
r 4 . 4 x l a r g e
Instance
Class
Generation
Multiplier
For CPU and Mem
t 2 . 2 x l a r g e
c 5 . 16x l a r g e

r 4 . 4 x l a r g e
Instance
Class
Generation
Multiplier
For CPU and Mem
t 2 . 2 x l a r g e
c 5 . 16x l a r g e
Over 100 to choose from!

Credit: Willard Simmons (DataXu)

Older generations
are inefﬁcient

Better for larger
memory clusters
Older generations
are inefﬁcient

Better for smaller
memory clusters
Older generations
are inefﬁcient

Second Challenge:
Elastic Scaling

More Concurrency? Add More Nodes

Presto
Worker
Presto
Worker
Presto
Coordinator
1 Query
When will queries complete
at current rate?

Presto
Worker
Presto
Worker
Presto
Coordinator
10 Queries
at current rate?
Not fast enough!

Presto
Worker
Presto
Worker
Presto
Coordinator
10 Queries
at current rate?
Qubole provisions more nodes up to a limit
(around 3 minutes)
Presto
Worker
Presto
Worker

Presto
Worker
Presto
Worker
Presto
Coordinator
1 Query
at current rate?
Presto
Worker
Presto
Worker
Too fast!

Presto
Worker
Presto
Worker
Presto
Coordinator
1 Query
at current rate?
Qubole decommissions more nodes up to a limit

Not so fast…
Presto
Worker
Presto
Worker
Presto
Coordinator
1 Query
at current rate?
Not fast enough!
100% CPU 100% CPU

Presto
Worker
Presto
Worker
Presto
Coordinator
1 Query
at current rate?
Upscaling only works for new queries
Presto
Worker
Presto
Worker
100% CPU 100% CPUIdle Idle
Not so fast…
Not fast enough!

Presto
Worker
Presto
Worker
Presto
Coordinator
1 Query
at current rate?
Upscaling only works for new queries
Presto
Worker
Presto
Worker
100% CPU 100% CPUIdle Idle
Not so fast…
Not fast enough!
Maybe we should have sent this query
to a more powerful cluster?
Autoscaling is for concurrency

Query History
Presto UI is nice for watching queries as they’re happening, but not for historical auditing

Service administration portal tracks Qubole commands

(Presto queries) and links to the Qubole web site

View and download intermediate queries and results

Presto Query Auditing

• Oﬃcial Presto JDBC driver does not support Prepared Statements

• Worker loss not handled gracefully 
(if one task fails, all tasks fail — we take that risk with retry logic)

• No support for upper-case table names in MySQL (Issue 2863)

• TIMESTAMP behavior does not match SQL standard (Issue 7122)

• Naïve query optimizer (talk to Starburst!)
Speciﬁc Technical Presto Issues

• Oﬃcial Presto JDBC driver does not support Prepared Statements

• Worker loss not handled gracefully 
(if one task fails, all tasks fail — we take that risk with retry logic)

• No support for upper-case table names in MySQL (Issue 2863)

• TIMESTAMP behavior does not match SQL standard (Issue 7122)

• Naïve query optimizer (talk to Starburst!)
Moral: you may need to get creative with workarounds
Speciﬁc Technical Presto Issues

Presto Docker container
using memory connectors
Testing

Testing
Declarative syntax allows us to mock tables
in the Docker container

Testing
Declarative syntax allows us to mock tables
in the Docker container
…so we can test our generated queries in isolation
using Behavior-Driven Development.

Setting expectations: Make sure everyone knows Presto is 
not a full-ﬂedged database.

Providing one logical view of the data model across many databases is great! 
Favorite for many other workloads beyond its initial scope for this reason.

Presto’s simplicity resulted in widespread adoption.

Biggest (Positive) Surprise

Provocative Ending
Presto feels like an API gateway, but for data.
Behavioral Services Data Applications
Interface (REST, WSDL, Thrift, etc.) :: Data Definition Language (DDL)
Requests (HTTP, SOAP, etc.) :: Data Manipulation Language (DML)
Service implementation language :: Database technology
Publishing an endpoint :: Exposing a table or view
Service handler :: CREATE VIEW, CREATE TRIGGER
Service endpoint configuration :: Catalog/connector configuration

Provocative Ending
Presto feels like an API gateway, but for data.
Behavioral Services Data Applications
Interface (REST, WSDL, Thrift, etc.) :: Data Definition Language (DDL)
Requests (HTTP, SOAP, etc.) :: Data Manipulation Language (DML)
Service implementation language :: Database technology
Publishing an endpoint :: Exposing a table or view
Service handler :: CREATE VIEW, CREATE TRIGGER
Service endpoint configuration :: Catalog/connector configuration
What other engineering advancements can we push through the lens from
microservices (behaviors) to databases (state)?

Targeted Audience Delivery with Presto

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (20)

Similar a Targeted Audience Delivery with Presto

Similar a Targeted Audience Delivery with Presto (20)

Último

Último (20)

Targeted Audience Delivery with Presto