SlideShare a Scribd company logo
1 of 42
1© Cloudera, Inc. All rights reserved.
5th Annual
2© Cloudera, Inc. All rights reserved.
Perspectives on Ethical Big Data
Governance
Murthy Mathiprakasam, Principal Product Marketing Manager, Informatica
Steven Totman, Financial Services Industry Lead, Cloudera
3© Cloudera, Inc. All rights reserved.
Governance of Big Data?
44
What is Metadata
5
So what is “Metadata”?
5
Metadata enables you to put context and meaning to things.
It is generated and consumed by every organization and software product.
6
So what is “Metadata”?
6
Metadata enables you to put context and meaning to things.
It is generated and consumed by every organization and software product.
Varieties
Made Since
Made By
Contains
Volume
7
Enterprise Metadata
Confidential – Internal Cloudera only
Business Glossary
Enterprise Taxonomy
Ontology
Business Metadata
B
T
Technical Metadata Database Schema
File Definition
Data Flow Design
BI Report Definition
Data Model
O
Operational Metadata Job Run-time Stats
Report run information
Hardware Usage
Scheduler Stats
Enables Data Governance
Literally, “data about data” that describes your company’s
information from both a business and a technical perspective
Data Lineage
Impact Analysis
Topology Understanding
SOX Compliance
Auditing
88
What is Data Governance
9
What is Data Governance?
Confidential – Internal Cloudera only
People
ProcessTechnology
● Encompasses the People,
Processes and Information
Technology required to create a
consistent and proper handling
of an organization's data across
the business enterprise
● Goals may be defined at all levels
of the enterprise and doing so
may aid in acceptance of
processes by those who will use
them
10© Cloudera, Inc. All rights reserved.
Jake Sorofman | December 12, 2012
Jake Sorofman - Gartner: Don’t be creepy
I think personalization has the potential for unmistakable good—for both consumers
and brands. But the question remains: how do you hew the line between
personalized and downright creepy?
Comic book prophet Stan Lee told us “With great power comes great responsibility.”
Data Ethics
11© Cloudera, Inc. All rights reserved.
Millennials are not very worried about their privacy
online - Digital lives of millennials survey
http://www.americanpressinstitute.org/publications/reports/survey-research/digital-lives-of-millennials/
Only 2/10 Millennials worry
all / most of the time
Don’t worry at all 34%
Worry all/most of the time 20%
Worry a little 46%
Question: how much do you worry, if at all, about information about you being available online?
12© Cloudera, Inc. All rights reserved.
Ethical Data Usage
Good way to evaluate -
What would mum and dad
think?
13© 2016 Cloudera, Inc. All rights reserved.
Privacy by design https://www.cpdp.vic.gov.au/images/content/pdf/CPDP_Privacy_by_Design_Background_paper_Oct_2014.pdf
Based on 7 Foundational Principles,
PbD was first developed by Ontario’s
Information and Privacy
Commissioner, Dr. Ann Cavoukian, in
the 1990s.
Privacy by Design
14© Cloudera, Inc. All rights reserved.
Real Life Case Study:
Network Intelligence
15© 2016 Cloudera, Inc. All rights reserved.
These common policy guidelines from the “toolkit”
ought to inform data protection and usage
16© Cloudera, Inc. All rights reserved.
It’s Becoming Harder to Protect Your Organization
16 billion connected devices
generating more data
Attacker sophistication has
increased leading to 250% more
successful attacks
Protection against attacks with
known signatures no longer
sufficient
Threat Surface Expanding Attacks are Increasing Threats Are Highly Adaptive
Trends Driving Change
17© 2016 Cloudera, Inc. All rights reserved.
Data Free-for-All:
Available & Error-Prone
Basic Security Controls:
Authorization
Authentication
Comprehensive Auditing
Data Security &
Governance:
Lineage Visibility
Metadata Discovery
Encryption & Key
Management
Start with the Hadoop Security Maturity Model
Achieve Scale and Cost Effectiveness via a Secure Data Vault
Fully Compliance Ready:
Audit-Ready & Protected
Audit Ready For:
EU Data Protection Directive,
PCI DSS
HIPAA, FERPA
FISMA, PII
Full encryption, key
management, transparency,
and enforcement for all data-
at-rest and data-in-motion
DataVolume&Sensitivity
Security Compliance & Risk Mitigation
0 Highly Vulnerable
Data at Risk
1 Reduced Risk
Exposure
2 Managed, Secure,
Protected
3 Enterprise Data
Hub Secure Data
Vault
Perspectives on
Ethical Big Data Governance
Murthy Mathiprakasam
mmathipra@informatica.com
@mmathiprakasam
Big Data Can Be An Asset or a Liability
IT
efficiency
and
compliance
Risk
reduction,
cost
controls, &
business
efficiencies
Greater
compliance,
Efficiency,
and support
for revenue
growth
Strategic
differentiation
0: Unaware
• No activity
1: Initial
• Ad hoc
2: Repeatable
• Pilot
3: Defined
• Project
4: Managed
• Program
5: Optimized
• Function
Data Governance
Maturity
Data is an enterprise resource
like any other resource
(financial capital, human capital,
etc)
• Passively, enterprises must
comply with internal controls
and external regulations
• Strategically, enterprises
can drive greater analytical
value with governed assets
Is Data Becoming Too Pervasive?
Who Knows What About You?
Your Mission
Define and Enforce an Ethical
Policy of Governance That Delivers
A Great Public Outcomes While
Ensuring Trust By Understanding,
Protecting, and Tracking Your Data
Keys To Success
Train Your People on a Defined Code of Ethics
Best Practices for Big Data Governance - 1
▪ Define Big Data Governance Process –
Roles, Standards, Taxonomies, Business
Glossaries
▪ Discover Big Data Assets
▪ Apply, Measure, and Monitor Enforcement
▪ Leverage IT & Business Collaboration To
Balance Big Data Governance Objectives
With Agility
Apply
Data
Governance
Apply
Measure
and
Monitor
Define
Discover
IT Business
• Quality and Governance
are not fixed points
• Use flexibility of Hadoop
alongside curation
process and technology
for fit-for-purpose data
assets to get right data to
right people at the right
time
Best Practices for Big Data Governance - 2
Curation of Fit-for-Purpose Data Assets
Raw Prepared
Cleansed/
Matched
Hadoop Data Lake
What Technology Challenges Are Customers Facing?
“too many data silos making it impossible to
know what data can be trusted”
Pete, Chief Data Officer
“need to ensure confidence in
data integrity, accuracy, and
timeliness”
Ron, VP Global Information Systems
“need code re-usability and
code maintainability”
Ben, Director of Platform
Architecture
“regulations have become very strict
and very precise – lots of gaps in the
quality of the data”
Christine, Manager Data Management
“transforming data management from a labor
intensive, qualitative approach to a systematic
approach…to classify data and understand lineage”
Ned, Senior Vice President
Big Data Cannot Be Tackled Manually
The Race to Business Value Will Not Be Won By Hand
More
Volume
More
Variety
More
Velocity
More Data
Consumers
More Data
Silos
More Data
Platforms
Big Data Is Difficult To Trust
Changing
Needs for Quality
Same data used for
multiple purposes
Hidden
Relationships
Everything and everyone
is interconnected
Magnified
Trust Issues
New sources of
external data
And Regulations And Controls Are Harder To Meet
SOX
PCI
HIPAA
FISMA
ISO
GLBA
NIST
Perimeter Security Is Insufficient
Perimeter security: Outside in security
• Not if, but when
• Network focused
• Attacks will only grow
Big Data: Bigger Risk
• An exponential attack surface
• With exponential risks
In the Public Sector, This Means People Can Suffer Impact
Can’t make comprehensive decisions based on all
of the available data
Can’t make accurate decisions based on
high quality and secure data
Can’t make timely decisions based on
fresh and up-to-date data
Can’t operationalize data delivery to
fuel decisions repeatably and scalably
Find & Access
Any Data Centrally
Discover Data
Relationships
Prepare & Share
Relevant Data
Operationalize Business
Insights Quickly
“Project Sonoma”
PILLAR 1
Big Data
Integration
• Optimized Execution & Flexible
Deployment
• 100’s of Pre-built Transforms,
Connectors & Parsers
• Flexible Deployment to Cloud
PILLAR 2
Big Data
Governance & Quality
• Business Glossary
• Profiling and Data Quality
• 360° Relationship Views
PILLAR 3
Big Data
Security
• Discover sensitive data in Hive
• Dynamic data masking for Hive
• Proliferation Analysis
19
Informatica Big Data Management 10.1
NEW
Big Data Management Meets Data Governance
Informatica Big Data Management + Cloudera Navigator
• Seamless integration with Cloudera Enterprise
• Track the end-to-end lineage of data before, during, and after Hadoop processing with metadata
management and lifecycle management
• Use data profiling capabilities and pre-built business rules to evaluate and mitigate data quality
• Comprehensive platform for managing technical, business, and operational metadata
Your Mission
Define and Enforce an Ethical
Policy of Governance That Delivers
A Great Public Outcomes While
Ensuring Trust By Understanding,
Protecting, and Tracking Your Data
3
Great
Transportation
▪ Aspiration: Florida Turnpike
sought to improve emergency
preparedness and improve the
prepaid toll program
▪ Challenge: Data collection took
over one month leading to faulty
analytics
▪ Outcome: “Timely and accurate
traffic, revenue, and participation
reports help management make
good choices that will eventually
result in saving money.”
▪ — Bob Hartmann, IT Director,
Florida Turnpike Enterprise
3
Great
Environment
▪ Aspiration: US Geological Service
sought to improve the quality of
water in the United States
▪ Challenge: Collect distributed data
and build a centralized water quality
dataset
▪ Outcome: “We chose Informatica
as our data integration solution
because of its maturity, wide range
of features, ease of use and
industrial strength, integrated
architecture.”
▪ — Harry House, Data Warehouse
Practice Leader, USGS
3
Great
Education
▪ Aspiration: Rochester Institute of
Technology sought to understand how
it could improve student enrollment,
student housing, and student retention
▪ Challenge: Data was in disparate
systems
▪ Outcome: “We're becoming myth
busters. Informatica provides timely,
accurate information we need to spot
trends, improve the quality of our
academic learning, and reduce
attrition.”
▪ — Kim Sowers, Director of Application
Development, Rochester Institute of
Technology
3
Great
Healthcare
▪ Aspiration: Utah Dept of Health
sought to process healthcare claims
faster and improve public health
▪ Challenge: Manual effort to track
and link claims data over time
▪ Outcome: “We see the Informatica
as absolutely essential to
everything that we want to do, not
only to meet our mandate for the All
Payer Database,”
▪ — Dr. Keely Cofrin Allen, Director,
Office of Health Care Statistics,
State of Utah Department of Health
Thank You!
41© Cloudera, Inc. All rights reserved.
Getting Started is Easy
1. 2.
Inventory Data &
Understand Related
Legal frameworks
Define and publish
usage guidelines &
privacy policies
Contact us or a Partner
to Start a POC
3.
Define & share “what is legal and what is right” - for your organization
42© 2016 Cloudera, Inc. All rights reserved.
Final Thought
Ethical Big Data Governance today is
“Like kissing in the school yard”
• Everybody seems to be talking about it
• Very few people are actually doing it
• Even fewer are doing it well
Steven Totman

More Related Content

What's hot

Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Cloudera, Inc.
 
Relying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceCloudera, Inc.
 
Harnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie MacHarnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie MacDataWorks Summit
 
Hortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your dataHortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your dataScott Clinton
 
How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18Cloudera, Inc.
 
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Cloudera, Inc.
 
Big Data Maturity Scorecard
Big Data Maturity ScorecardBig Data Maturity Scorecard
Big Data Maturity ScorecardDataWorks Summit
 
Detection of Anomalous Behavior
Detection of Anomalous BehaviorDetection of Anomalous Behavior
Detection of Anomalous BehaviorCapgemini
 
The 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: ExposedThe 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: ExposedCloudera, Inc.
 
Big Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial ServicesBig Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial ServicesCloudera, Inc.
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformCloudera, Inc.
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationCloudera, Inc.
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Cloudera, Inc.
 
RMS Security Breakfast
RMS Security BreakfastRMS Security Breakfast
RMS Security BreakfastRackspace
 
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike FergusonMapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike FergusonMapR Technologies
 
Security and Audit for Big Data
Security and Audit for Big DataSecurity and Audit for Big Data
Security and Audit for Big DataNicolas Morales
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyCloudera, Inc.
 
Big datacamp june14_alex_liu
Big datacamp june14_alex_liuBig datacamp june14_alex_liu
Big datacamp june14_alex_liuData Con LA
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)Cloudera, Inc.
 

What's hot (20)

Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
Turning Petabytes of Data into Profit with Hadoop for the World’s Biggest Ret...
 
Relying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services ExperienceRelying on Data for Strategic Decision-Making--Financial Services Experience
Relying on Data for Strategic Decision-Making--Financial Services Experience
 
Harnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie MacHarnessing the Power of Big Data at Freddie Mac
Harnessing the Power of Big Data at Freddie Mac
 
Hortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your dataHortonworks Hybrid Cloud - Putting you back in control of your data
Hortonworks Hybrid Cloud - Putting you back in control of your data
 
How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18How Cloudera SDX can aid GDPR compliance 6.21.18
How Cloudera SDX can aid GDPR compliance 6.21.18
 
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1
Using Big Data to Transform Your Customer’s Experience - Part 1

Using Big Data to Transform Your Customer’s Experience - Part 1

 
Big Data Maturity Scorecard
Big Data Maturity ScorecardBig Data Maturity Scorecard
Big Data Maturity Scorecard
 
Detection of Anomalous Behavior
Detection of Anomalous BehaviorDetection of Anomalous Behavior
Detection of Anomalous Behavior
 
The 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: ExposedThe 5 Biggest Data Myths in Telco: Exposed
The 5 Biggest Data Myths in Telco: Exposed
 
Big Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial ServicesBig Data as Competitive Advantage in Financial Services
Big Data as Competitive Advantage in Financial Services
 
Turning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data PlatformTurning Data into Business Value with a Modern Data Platform
Turning Data into Business Value with a Modern Data Platform
 
From Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your OrganizationFrom Insight to Action: Using Data Science to Transform Your Organization
From Insight to Action: Using Data Science to Transform Your Organization
 
Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8Building a Modern Analytic Database with Cloudera 5.8
Building a Modern Analytic Database with Cloudera 5.8
 
RMS Security Breakfast
RMS Security BreakfastRMS Security Breakfast
RMS Security Breakfast
 
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike FergusonMapR Enterprise Data Hub Webinar w/ Mike Ferguson
MapR Enterprise Data Hub Webinar w/ Mike Ferguson
 
Security and Audit for Big Data
Security and Audit for Big DataSecurity and Audit for Big Data
Security and Audit for Big Data
 
The Five Markers on Your Big Data Journey
The Five Markers on Your Big Data JourneyThe Five Markers on Your Big Data Journey
The Five Markers on Your Big Data Journey
 
Big datacamp june14_alex_liu
Big datacamp june14_alex_liuBig datacamp june14_alex_liu
Big datacamp june14_alex_liu
 
Big Data Fundamentals
Big Data FundamentalsBig Data Fundamentals
Big Data Fundamentals
 
The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)The Vortex of Change - Digital Transformation (Presented by Intel)
The Vortex of Change - Digital Transformation (Presented by Intel)
 

Viewers also liked

Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7mmathipra
 
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...StampedeCon
 
Intelligence Types
Intelligence TypesIntelligence Types
Intelligence Typesishlive
 
Improving Model Predictions via Stacking and Hyper-parameters Tuning
Improving Model Predictions via Stacking and Hyper-parameters TuningImproving Model Predictions via Stacking and Hyper-parameters Tuning
Improving Model Predictions via Stacking and Hyper-parameters TuningJo-fai Chow
 
Securing the Hadoop Ecosystem
Securing the Hadoop EcosystemSecuring the Hadoop Ecosystem
Securing the Hadoop EcosystemDataWorks Summit
 
Продукты и решения Informatica
Продукты и решения  InformaticaПродукты и решения  Informatica
Продукты и решения InformaticaNatasha Zaverukha
 
Hadoop Security and Compliance - StampedeCon 2016
Hadoop Security and Compliance - StampedeCon 2016Hadoop Security and Compliance - StampedeCon 2016
Hadoop Security and Compliance - StampedeCon 2016StampedeCon
 
Cassandra as event sourced journal for big data analytics
Cassandra as event sourced journal for big data analyticsCassandra as event sourced journal for big data analytics
Cassandra as event sourced journal for big data analyticsAnirvan Chakraborty
 
Public Terabyte Dataset Project: Web crawling with Amazon Elastic MapReduce
Public Terabyte Dataset Project: Web crawling with Amazon Elastic MapReducePublic Terabyte Dataset Project: Web crawling with Amazon Elastic MapReduce
Public Terabyte Dataset Project: Web crawling with Amazon Elastic MapReduceHadoop User Group
 
Spend Analysis: What Your Data Is Telling You and Why It’s Worth Listening
Spend Analysis: What Your Data Is Telling You and Why It’s Worth ListeningSpend Analysis: What Your Data Is Telling You and Why It’s Worth Listening
Spend Analysis: What Your Data Is Telling You and Why It’s Worth ListeningSAP Ariba
 
Impression tecnique for implant supported rehabilitation/ dental courses
Impression tecnique for implant supported rehabilitation/ dental coursesImpression tecnique for implant supported rehabilitation/ dental courses
Impression tecnique for implant supported rehabilitation/ dental coursesIndian dental academy
 
Informatica Cloud Winter 2016 Release Webinar
Informatica Cloud Winter 2016 Release WebinarInformatica Cloud Winter 2016 Release Webinar
Informatica Cloud Winter 2016 Release WebinarInformatica Cloud
 
The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...Pieter De Leenheer
 
Informatica big data and social media
Informatica big data and social mediaInformatica big data and social media
Informatica big data and social mediaRamy Mahrous
 
8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshareJulianna DeLua
 
Zen and the Art of Supplier Management
Zen and the Art of Supplier ManagementZen and the Art of Supplier Management
Zen and the Art of Supplier ManagementSAP Ariba
 

Viewers also liked (20)

Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
 
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
Using Multiple Persistence Layers in Spark to Build a Scalable Prediction Eng...
 
Presentacion
PresentacionPresentacion
Presentacion
 
Intelligence Types
Intelligence TypesIntelligence Types
Intelligence Types
 
Phrases
PhrasesPhrases
Phrases
 
Improving Model Predictions via Stacking and Hyper-parameters Tuning
Improving Model Predictions via Stacking and Hyper-parameters TuningImproving Model Predictions via Stacking and Hyper-parameters Tuning
Improving Model Predictions via Stacking and Hyper-parameters Tuning
 
Securing the Hadoop Ecosystem
Securing the Hadoop EcosystemSecuring the Hadoop Ecosystem
Securing the Hadoop Ecosystem
 
Spark tuning
Spark tuningSpark tuning
Spark tuning
 
Продукты и решения Informatica
Продукты и решения  InformaticaПродукты и решения  Informatica
Продукты и решения Informatica
 
Hadoop Security and Compliance - StampedeCon 2016
Hadoop Security and Compliance - StampedeCon 2016Hadoop Security and Compliance - StampedeCon 2016
Hadoop Security and Compliance - StampedeCon 2016
 
Cassandra as event sourced journal for big data analytics
Cassandra as event sourced journal for big data analyticsCassandra as event sourced journal for big data analytics
Cassandra as event sourced journal for big data analytics
 
Public Terabyte Dataset Project: Web crawling with Amazon Elastic MapReduce
Public Terabyte Dataset Project: Web crawling with Amazon Elastic MapReducePublic Terabyte Dataset Project: Web crawling with Amazon Elastic MapReduce
Public Terabyte Dataset Project: Web crawling with Amazon Elastic MapReduce
 
Spend Analysis: What Your Data Is Telling You and Why It’s Worth Listening
Spend Analysis: What Your Data Is Telling You and Why It’s Worth ListeningSpend Analysis: What Your Data Is Telling You and Why It’s Worth Listening
Spend Analysis: What Your Data Is Telling You and Why It’s Worth Listening
 
3.implant components and basic techniques3
3.implant components and basic techniques33.implant components and basic techniques3
3.implant components and basic techniques3
 
Impression tecnique for implant supported rehabilitation/ dental courses
Impression tecnique for implant supported rehabilitation/ dental coursesImpression tecnique for implant supported rehabilitation/ dental courses
Impression tecnique for implant supported rehabilitation/ dental courses
 
Informatica Cloud Winter 2016 Release Webinar
Informatica Cloud Winter 2016 Release WebinarInformatica Cloud Winter 2016 Release Webinar
Informatica Cloud Winter 2016 Release Webinar
 
The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...The Data Driven University - Automating Data Governance and Stewardship in Au...
The Data Driven University - Automating Data Governance and Stewardship in Au...
 
Informatica big data and social media
Informatica big data and social mediaInformatica big data and social media
Informatica big data and social media
 
8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare8.17.11 big data and hadoop with informatica slideshare
8.17.11 big data and hadoop with informatica slideshare
 
Zen and the Art of Supplier Management
Zen and the Art of Supplier ManagementZen and the Art of Supplier Management
Zen and the Art of Supplier Management
 

Similar to Perspectives on Ethical Big Data Governance

The value of big data analytics
The value of big data analyticsThe value of big data analytics
The value of big data analyticsMarc Vael
 
Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceIBM Software India
 
From Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data ForumFrom Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data ForumCastlebridge Associates
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesDATAVERSITY
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellenceMudit Mangal
 
Data Governance in a big data era
Data Governance in a big data eraData Governance in a big data era
Data Governance in a big data eraPieter De Leenheer
 
Hadoop: Making it work for the Business Unit
Hadoop: Making it work for the Business UnitHadoop: Making it work for the Business Unit
Hadoop: Making it work for the Business UnitDataWorks Summit
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallTrillium Software
 
Slides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data GovernanceSlides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data GovernanceDATAVERSITY
 
Data Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint WebinarData Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint WebinarConcept Searching, Inc
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaCaserta
 
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarDiscovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarConcept Searching, Inc
 
Big data security
Big data securityBig data security
Big data securityAnne ndolo
 
Big data security
Big data securityBig data security
Big data securityAnne ndolo
 
Data Governance in the Big Data Era
Data Governance in the Big Data EraData Governance in the Big Data Era
Data Governance in the Big Data EraPieter De Leenheer
 
-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...webwinkelvakdag
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperativeTrillium Software
 
Big Data Innovation
Big Data InnovationBig Data Innovation
Big Data Innovationpaul.hawking
 
Strata NYC 2015 - Transamerica and INFA v1
Strata NYC 2015 - Transamerica and INFA v1Strata NYC 2015 - Transamerica and INFA v1
Strata NYC 2015 - Transamerica and INFA v1Vishal Bamba
 

Similar to Perspectives on Ethical Big Data Governance (20)

The value of big data analytics
The value of big data analyticsThe value of big data analytics
The value of big data analytics
 
Understanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidenceUnderstanding Big Data so you can act with confidence
Understanding Big Data so you can act with confidence
 
From Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data ForumFrom Near to Maturity - Presentation to European Data Forum
From Near to Maturity - Presentation to European Data Forum
 
Data Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & ApproachesData Lake Architecture – Modern Strategies & Approaches
Data Lake Architecture – Modern Strategies & Approaches
 
Data foundation for analytics excellence
Data foundation for analytics excellenceData foundation for analytics excellence
Data foundation for analytics excellence
 
Data Governance in a big data era
Data Governance in a big data eraData Governance in a big data era
Data Governance in a big data era
 
Hadoop: Making it work for the Business Unit
Hadoop: Making it work for the Business UnitHadoop: Making it work for the Business Unit
Hadoop: Making it work for the Business Unit
 
The Bigger They Are The Harder They Fall
The Bigger They Are The Harder They FallThe Bigger They Are The Harder They Fall
The Bigger They Are The Harder They Fall
 
Slides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data GovernanceSlides: Taking an Active Approach to Data Governance
Slides: Taking an Active Approach to Data Governance
 
Data Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint WebinarData Breaches and Security Rights in SharePoint Webinar
Data Breaches and Security Rights in SharePoint Webinar
 
Data Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with ClouderaData Governance, Compliance and Security in Hadoop with Cloudera
Data Governance, Compliance and Security in Hadoop with Cloudera
 
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World WebinarDiscovery, Risk, and Insight in a Metadata-Driven World Webinar
Discovery, Risk, and Insight in a Metadata-Driven World Webinar
 
Big data security
Big data securityBig data security
Big data security
 
Big data security
Big data securityBig data security
Big data security
 
Data Governance in the Big Data Era
Data Governance in the Big Data EraData Governance in the Big Data Era
Data Governance in the Big Data Era
 
-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperative
 
Big Data Innovation
Big Data InnovationBig Data Innovation
Big Data Innovation
 
Strata NYC 2015 - Transamerica and INFA v1
Strata NYC 2015 - Transamerica and INFA v1Strata NYC 2015 - Transamerica and INFA v1
Strata NYC 2015 - Transamerica and INFA v1
 
The value of our data
The value of our dataThe value of our data
The value of our data
 

More from Cloudera, Inc.

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxCloudera, Inc.
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera, Inc.
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards FinalistsCloudera, Inc.
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Cloudera, Inc.
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Cloudera, Inc.
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Cloudera, Inc.
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Cloudera, Inc.
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Cloudera, Inc.
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Cloudera, Inc.
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Cloudera, Inc.
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Cloudera, Inc.
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Cloudera, Inc.
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Cloudera, Inc.
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformCloudera, Inc.
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Cloudera, Inc.
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Cloudera, Inc.
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Cloudera, Inc.
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Cloudera, Inc.
 

More from Cloudera, Inc. (20)

Partner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptxPartner Briefing_January 25 (FINAL).pptx
Partner Briefing_January 25 (FINAL).pptx
 
Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists Cloudera Data Impact Awards 2021 - Finalists
Cloudera Data Impact Awards 2021 - Finalists
 
2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists2020 Cloudera Data Impact Awards Finalists
2020 Cloudera Data Impact Awards Finalists
 
Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019Edc event vienna presentation 1 oct 2019
Edc event vienna presentation 1 oct 2019
 
Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19Machine Learning with Limited Labeled Data 4/3/19
Machine Learning with Limited Labeled Data 4/3/19
 
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19Data Driven With the Cloudera Modern Data Warehouse 3.19.19
Data Driven With the Cloudera Modern Data Warehouse 3.19.19
 
Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19Introducing Cloudera DataFlow (CDF) 2.13.19
Introducing Cloudera DataFlow (CDF) 2.13.19
 
Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19Introducing Cloudera Data Science Workbench for HDP 2.12.19
Introducing Cloudera Data Science Workbench for HDP 2.12.19
 
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
Shortening the Sales Cycle with a Modern Data Warehouse 1.30.19
 
Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19Leveraging the cloud for analytics and machine learning 1.29.19
Leveraging the cloud for analytics and machine learning 1.29.19
 
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
Modernizing the Legacy Data Warehouse – What, Why, and How 1.23.19
 
Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18Leveraging the Cloud for Big Data Analytics 12.11.18
Leveraging the Cloud for Big Data Analytics 12.11.18
 
Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3Modern Data Warehouse Fundamentals Part 3
Modern Data Warehouse Fundamentals Part 3
 
Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2Modern Data Warehouse Fundamentals Part 2
Modern Data Warehouse Fundamentals Part 2
 
Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1Modern Data Warehouse Fundamentals Part 1
Modern Data Warehouse Fundamentals Part 1
 
Extending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the PlatformExtending Cloudera SDX beyond the Platform
Extending Cloudera SDX beyond the Platform
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
 
Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360Analyst Webinar: Doing a 180 on Customer 360
Analyst Webinar: Doing a 180 on Customer 360
 
Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18Build a modern platform for anti-money laundering 9.19.18
Build a modern platform for anti-money laundering 9.19.18
 
Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18Introducing the data science sandbox as a service 8.30.18
Introducing the data science sandbox as a service 8.30.18
 

Recently uploaded

Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odishasmiwainfosol
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsAhmed Mohamed
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaHanief Utama
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfMarharyta Nedzelska
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样umasea
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Velvetech LLC
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Developmentvyaparkranti
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Hr365.us smith
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesPhilip Schwarz
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsSafe Software
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanyChristoph Pohl
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf31events.com
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWave PLM
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineeringssuserb3a23b
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtimeandrehoraa
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationBradBedford3
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmSujith Sukumaran
 

Recently uploaded (20)

Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company OdishaBalasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
Balasore Best It Company|| Top 10 IT Company || Balasore Software company Odisha
 
Unveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML DiagramsUnveiling Design Patterns: A Visual Guide with UML Diagrams
Unveiling Design Patterns: A Visual Guide with UML Diagrams
 
React Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief UtamaReact Server Component in Next.js by Hanief Utama
React Server Component in Next.js by Hanief Utama
 
A healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdfA healthy diet for your Java application Devoxx France.pdf
A healthy diet for your Java application Devoxx France.pdf
 
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
办理学位证(UQ文凭证书)昆士兰大学毕业证成绩单原版一模一样
 
Advantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your BusinessAdvantages of Odoo ERP 17 for Your Business
Advantages of Odoo ERP 17 for Your Business
 
Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...Software Project Health Check: Best Practices and Techniques for Your Product...
Software Project Health Check: Best Practices and Techniques for Your Product...
 
VK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web DevelopmentVK Business Profile - provides IT solutions and Web Development
VK Business Profile - provides IT solutions and Web Development
 
Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)Recruitment Management Software Benefits (Infographic)
Recruitment Management Software Benefits (Infographic)
 
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxKnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx
 
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort ServiceHot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Patel Nagar🔝 9953056974 🔝 escort Service
 
Folding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a seriesFolding Cheat Sheet #4 - fourth in a series
Folding Cheat Sheet #4 - fourth in a series
 
Powering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data StreamsPowering Real-Time Decisions with Continuous Data Streams
Powering Real-Time Decisions with Continuous Data Streams
 
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte GermanySuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
SuccessFactors 1H 2024 Release - Sneak-Peek by Deloitte Germany
 
Sending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdfSending Calendar Invites on SES and Calendarsnack.pdf
Sending Calendar Invites on SES and Calendarsnack.pdf
 
What is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need ItWhat is Fashion PLM and Why Do You Need It
What is Fashion PLM and Why Do You Need It
 
Software Coding for software engineering
Software Coding for software engineeringSoftware Coding for software engineering
Software Coding for software engineering
 
SpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at RuntimeSpotFlow: Tracking Method Calls and States at Runtime
SpotFlow: Tracking Method Calls and States at Runtime
 
How to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion ApplicationHow to submit a standout Adobe Champion Application
How to submit a standout Adobe Champion Application
 
Intelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalmIntelligent Home Wi-Fi Solutions | ThinkPalm
Intelligent Home Wi-Fi Solutions | ThinkPalm
 

Perspectives on Ethical Big Data Governance

  • 1. 1© Cloudera, Inc. All rights reserved. 5th Annual
  • 2. 2© Cloudera, Inc. All rights reserved. Perspectives on Ethical Big Data Governance Murthy Mathiprakasam, Principal Product Marketing Manager, Informatica Steven Totman, Financial Services Industry Lead, Cloudera
  • 3. 3© Cloudera, Inc. All rights reserved. Governance of Big Data?
  • 5. 5 So what is “Metadata”? 5 Metadata enables you to put context and meaning to things. It is generated and consumed by every organization and software product.
  • 6. 6 So what is “Metadata”? 6 Metadata enables you to put context and meaning to things. It is generated and consumed by every organization and software product. Varieties Made Since Made By Contains Volume
  • 7. 7 Enterprise Metadata Confidential – Internal Cloudera only Business Glossary Enterprise Taxonomy Ontology Business Metadata B T Technical Metadata Database Schema File Definition Data Flow Design BI Report Definition Data Model O Operational Metadata Job Run-time Stats Report run information Hardware Usage Scheduler Stats Enables Data Governance Literally, “data about data” that describes your company’s information from both a business and a technical perspective Data Lineage Impact Analysis Topology Understanding SOX Compliance Auditing
  • 8. 88 What is Data Governance
  • 9. 9 What is Data Governance? Confidential – Internal Cloudera only People ProcessTechnology ● Encompasses the People, Processes and Information Technology required to create a consistent and proper handling of an organization's data across the business enterprise ● Goals may be defined at all levels of the enterprise and doing so may aid in acceptance of processes by those who will use them
  • 10. 10© Cloudera, Inc. All rights reserved. Jake Sorofman | December 12, 2012 Jake Sorofman - Gartner: Don’t be creepy I think personalization has the potential for unmistakable good—for both consumers and brands. But the question remains: how do you hew the line between personalized and downright creepy? Comic book prophet Stan Lee told us “With great power comes great responsibility.” Data Ethics
  • 11. 11© Cloudera, Inc. All rights reserved. Millennials are not very worried about their privacy online - Digital lives of millennials survey http://www.americanpressinstitute.org/publications/reports/survey-research/digital-lives-of-millennials/ Only 2/10 Millennials worry all / most of the time Don’t worry at all 34% Worry all/most of the time 20% Worry a little 46% Question: how much do you worry, if at all, about information about you being available online?
  • 12. 12© Cloudera, Inc. All rights reserved. Ethical Data Usage Good way to evaluate - What would mum and dad think?
  • 13. 13© 2016 Cloudera, Inc. All rights reserved. Privacy by design https://www.cpdp.vic.gov.au/images/content/pdf/CPDP_Privacy_by_Design_Background_paper_Oct_2014.pdf Based on 7 Foundational Principles, PbD was first developed by Ontario’s Information and Privacy Commissioner, Dr. Ann Cavoukian, in the 1990s. Privacy by Design
  • 14. 14© Cloudera, Inc. All rights reserved. Real Life Case Study: Network Intelligence
  • 15. 15© 2016 Cloudera, Inc. All rights reserved. These common policy guidelines from the “toolkit” ought to inform data protection and usage
  • 16. 16© Cloudera, Inc. All rights reserved. It’s Becoming Harder to Protect Your Organization 16 billion connected devices generating more data Attacker sophistication has increased leading to 250% more successful attacks Protection against attacks with known signatures no longer sufficient Threat Surface Expanding Attacks are Increasing Threats Are Highly Adaptive Trends Driving Change
  • 17. 17© 2016 Cloudera, Inc. All rights reserved. Data Free-for-All: Available & Error-Prone Basic Security Controls: Authorization Authentication Comprehensive Auditing Data Security & Governance: Lineage Visibility Metadata Discovery Encryption & Key Management Start with the Hadoop Security Maturity Model Achieve Scale and Cost Effectiveness via a Secure Data Vault Fully Compliance Ready: Audit-Ready & Protected Audit Ready For: EU Data Protection Directive, PCI DSS HIPAA, FERPA FISMA, PII Full encryption, key management, transparency, and enforcement for all data- at-rest and data-in-motion DataVolume&Sensitivity Security Compliance & Risk Mitigation 0 Highly Vulnerable Data at Risk 1 Reduced Risk Exposure 2 Managed, Secure, Protected 3 Enterprise Data Hub Secure Data Vault
  • 18. Perspectives on Ethical Big Data Governance Murthy Mathiprakasam mmathipra@informatica.com @mmathiprakasam
  • 19. Big Data Can Be An Asset or a Liability IT efficiency and compliance Risk reduction, cost controls, & business efficiencies Greater compliance, Efficiency, and support for revenue growth Strategic differentiation 0: Unaware • No activity 1: Initial • Ad hoc 2: Repeatable • Pilot 3: Defined • Project 4: Managed • Program 5: Optimized • Function Data Governance Maturity Data is an enterprise resource like any other resource (financial capital, human capital, etc) • Passively, enterprises must comply with internal controls and external regulations • Strategically, enterprises can drive greater analytical value with governed assets
  • 20. Is Data Becoming Too Pervasive? Who Knows What About You?
  • 21. Your Mission Define and Enforce an Ethical Policy of Governance That Delivers A Great Public Outcomes While Ensuring Trust By Understanding, Protecting, and Tracking Your Data
  • 23. Train Your People on a Defined Code of Ethics
  • 24. Best Practices for Big Data Governance - 1 ▪ Define Big Data Governance Process – Roles, Standards, Taxonomies, Business Glossaries ▪ Discover Big Data Assets ▪ Apply, Measure, and Monitor Enforcement ▪ Leverage IT & Business Collaboration To Balance Big Data Governance Objectives With Agility Apply Data Governance Apply Measure and Monitor Define Discover IT Business
  • 25. • Quality and Governance are not fixed points • Use flexibility of Hadoop alongside curation process and technology for fit-for-purpose data assets to get right data to right people at the right time Best Practices for Big Data Governance - 2 Curation of Fit-for-Purpose Data Assets Raw Prepared Cleansed/ Matched Hadoop Data Lake
  • 26. What Technology Challenges Are Customers Facing? “too many data silos making it impossible to know what data can be trusted” Pete, Chief Data Officer “need to ensure confidence in data integrity, accuracy, and timeliness” Ron, VP Global Information Systems “need code re-usability and code maintainability” Ben, Director of Platform Architecture “regulations have become very strict and very precise – lots of gaps in the quality of the data” Christine, Manager Data Management “transforming data management from a labor intensive, qualitative approach to a systematic approach…to classify data and understand lineage” Ned, Senior Vice President
  • 27. Big Data Cannot Be Tackled Manually The Race to Business Value Will Not Be Won By Hand More Volume More Variety More Velocity More Data Consumers More Data Silos More Data Platforms
  • 28. Big Data Is Difficult To Trust Changing Needs for Quality Same data used for multiple purposes Hidden Relationships Everything and everyone is interconnected Magnified Trust Issues New sources of external data
  • 29. And Regulations And Controls Are Harder To Meet SOX PCI HIPAA FISMA ISO GLBA NIST
  • 30. Perimeter Security Is Insufficient Perimeter security: Outside in security • Not if, but when • Network focused • Attacks will only grow
  • 31. Big Data: Bigger Risk • An exponential attack surface • With exponential risks
  • 32. In the Public Sector, This Means People Can Suffer Impact Can’t make comprehensive decisions based on all of the available data Can’t make accurate decisions based on high quality and secure data Can’t make timely decisions based on fresh and up-to-date data Can’t operationalize data delivery to fuel decisions repeatably and scalably
  • 33. Find & Access Any Data Centrally Discover Data Relationships Prepare & Share Relevant Data Operationalize Business Insights Quickly “Project Sonoma” PILLAR 1 Big Data Integration • Optimized Execution & Flexible Deployment • 100’s of Pre-built Transforms, Connectors & Parsers • Flexible Deployment to Cloud PILLAR 2 Big Data Governance & Quality • Business Glossary • Profiling and Data Quality • 360° Relationship Views PILLAR 3 Big Data Security • Discover sensitive data in Hive • Dynamic data masking for Hive • Proliferation Analysis 19 Informatica Big Data Management 10.1 NEW
  • 34. Big Data Management Meets Data Governance Informatica Big Data Management + Cloudera Navigator • Seamless integration with Cloudera Enterprise • Track the end-to-end lineage of data before, during, and after Hadoop processing with metadata management and lifecycle management • Use data profiling capabilities and pre-built business rules to evaluate and mitigate data quality • Comprehensive platform for managing technical, business, and operational metadata
  • 35. Your Mission Define and Enforce an Ethical Policy of Governance That Delivers A Great Public Outcomes While Ensuring Trust By Understanding, Protecting, and Tracking Your Data
  • 36. 3 Great Transportation ▪ Aspiration: Florida Turnpike sought to improve emergency preparedness and improve the prepaid toll program ▪ Challenge: Data collection took over one month leading to faulty analytics ▪ Outcome: “Timely and accurate traffic, revenue, and participation reports help management make good choices that will eventually result in saving money.” ▪ — Bob Hartmann, IT Director, Florida Turnpike Enterprise
  • 37. 3 Great Environment ▪ Aspiration: US Geological Service sought to improve the quality of water in the United States ▪ Challenge: Collect distributed data and build a centralized water quality dataset ▪ Outcome: “We chose Informatica as our data integration solution because of its maturity, wide range of features, ease of use and industrial strength, integrated architecture.” ▪ — Harry House, Data Warehouse Practice Leader, USGS
  • 38. 3 Great Education ▪ Aspiration: Rochester Institute of Technology sought to understand how it could improve student enrollment, student housing, and student retention ▪ Challenge: Data was in disparate systems ▪ Outcome: “We're becoming myth busters. Informatica provides timely, accurate information we need to spot trends, improve the quality of our academic learning, and reduce attrition.” ▪ — Kim Sowers, Director of Application Development, Rochester Institute of Technology
  • 39. 3 Great Healthcare ▪ Aspiration: Utah Dept of Health sought to process healthcare claims faster and improve public health ▪ Challenge: Manual effort to track and link claims data over time ▪ Outcome: “We see the Informatica as absolutely essential to everything that we want to do, not only to meet our mandate for the All Payer Database,” ▪ — Dr. Keely Cofrin Allen, Director, Office of Health Care Statistics, State of Utah Department of Health
  • 41. 41© Cloudera, Inc. All rights reserved. Getting Started is Easy 1. 2. Inventory Data & Understand Related Legal frameworks Define and publish usage guidelines & privacy policies Contact us or a Partner to Start a POC 3. Define & share “what is legal and what is right” - for your organization
  • 42. 42© 2016 Cloudera, Inc. All rights reserved. Final Thought Ethical Big Data Governance today is “Like kissing in the school yard” • Everybody seems to be talking about it • Very few people are actually doing it • Even fewer are doing it well Steven Totman