SlideShare una empresa de Scribd logo
1 de 19
Descargar para leer sin conexión
Business Data Lake best practices
OOP Munich, 2017-01-31
2Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
The speaker – Arne Roßmann
!  Part of Insights & Data team
•  Global team delivering around BI, DWH, Information
Strategy & Big Data
!  Working in Business Intelligence since 2008
!  Delivering as Big Data architect & Project
Manager at our clients
•  Defining processes
•  Creating architectures
•  Leading projects
!  Worked in many industries
•  Retail, Chemical, Financial, Logistics, Automotive, ...
3Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Capgemini’s Insights & Data Global Practice
With 15,000 experts globally, we are a recognized leader in information-led transformation
Capgemini’s Insights & Data Global Practice
Expertise in Big Data & Analytics Capgemini Solutions
!  Over 15,000 consultants
globally
!  Industrialized delivery
framework Next Gen Business
Insights Service Centre
!  CUBE lab on the cloud with
various demonstrations for
BI environments
!  Built-in Tools for interactive
agile BI and Devops
Partner Ecosystem
800+ Big Data & 400+ Data
Science Global Consultants
Customer Analytics
!  Segmentation &
Behavior Profiling
!  Behavior Propensity
scoring
!  Pricing Analytics
Marketing & Campaign
Analytics
!  Campaign
Recommendation
!  Cross Sell/Up Sell
!  Campaign
Measurement
!  Campaign Execution
Management
Operations Analytics
!  Sales/ Demand
Forecasting
!  Activity Based Costing
!  Call Center Analytics
Asset/ Equipment
Analytics
!  Warranty Analytics
!  Asset Performance
Monitoring
!  Predictive Asset
Maintenance
!  Insights from Connected
Equipment
Fraud Analytics
!  Fraud Scoring
!  Collusion Fraud
Identification
!  Fraud Framework for
Public Sector (Trouve)
Content Analytics
!  Text Mining Accelerators
!  Key Opinion Leader
!  Content Analytics for
Fraud Detection
Business Data Lake
offering
Data Warehouse
Optimization Solution
Strategic Alliances and
partnerships with major
vendors
Enabling Co-Innovation with
the CUBE lab
Experience in designing and
deploying big data analytics
solutions in a varied
ecosystems
4Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Table of Contents
!  Why the Business Data Lake works
!  Services your Business Data Lake should provide
!  Standardize, Industrialize and Innovate!
Why the Business Data Lake works
6Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Big Data creates opportunities but poses challenges as well
Where do
I start ?
“We know that Big Data can be
helpful but how do we quantify the
benefits and develop a
Business Case?”
“How do we know which Big Data
technology/platform(s) suits our
architecture and business
requirement? “
“How do I get all the unstructured data
(mainly images) out of my operational
processes, into an analytical
environment that allows me to
experiment with data?”
“Can we easily combine data from
multiple source systems into our Big
Data environment and visa versa?”
“Can I do it myself? What skills do I
need for Big Data? “
“How do I measure the effectiveness
or performance of my Big Data
initiative? How do I measure ROI?”
7Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Businesses are looking to close the gap towards ‘insight driven’
Have not completely
integrated their data sources
across the organization
79%
Scattered data lying
in silos across the
organization
Do not have well-defined criteria
to measure the success of
their own Big Data initiatives
67%
Absence of clear
business case
for funding and
implementation
Dependence on
legacy systems
for data processing
and management
Use cloud based Big Data
and analytics platforms
36%
Have either scattered pockets
of resources or follow a
decentralized model for
analytics initiatives
Ineffective
co-ordination
of Big Data and
analytics teams
47%
8Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
The Business Data Lake delivers what we need for the new data
landscape.
Govern
Where it
matters
Encourage
local
requirements
Distill on
demand
Store
securely
!  Focus on MDM
!  Enforce only when sharing
!  Treat Corporate as
aggregation of Local.
!  Let the business decide
what they need
!  Build from the bottom
!  Enable traceability to
source disposable data
views.
!  Store everything ‘as is’
!  Include structured and
unstructured data
!  Store it cheaply were
possible
!  Select only what you want
!  Business friendly tooling
!  Re-usable information
maps
!  Rapid change cycle.
9Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Business Challenges driving the need for BDL services
Business
Enablement
!  Achieve real-time
optimization of
business processes
through predictive
insights and
performance analytics
!  Enhance new services
and stay competitive in
the market
!  Be agile, get insights
fast
ControlControl
!  Ensure data security
and compliance with
EU data regulations
!  Enable up- and
downscaling
according to business
needs
ControlControl
!  Reduce costs
associated with the
governance and secure
storage of data
!  Control the costs of
running flexible data
services
!  Reduce Capex
Services your Business Data Lake
should provide
11Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Capgemini can help accelerate clients’ journey to Insights..
A cloud powered, big data & insights service; bring all your data in one place,
deliver insights at the point of action and generate differentiated business
value.
‘Software-
Defined’’, full
stack cloud
infrastructure
Flexible
‘Pay-as-you-go’
Commercial
Model
Secure
as a Vault
‘Ready to Harvest’
Sector & Domain
Insights
Modular
Hybrid & Elastic
powered by
‘Intelligent
Automation’
Get started quickly: with our platform , tools and expertise we can support you at any
level to manage your data and harvest insights
Your ‘Lab in the
Cloud’
!  Experiment
!  Hypothesize
!  Simulate
12Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
The BDL architecture we built for our clients
Pla$orm		
as	a	Service	
Insights Platform UX Portal
HTML 5, CSS, Angular JS
Big Data Lab
Dataset Library
Data Science Lab
Models Library
Insights Lab
Ready Insights
Common Services Common Services
Ingest
Algorithm
Library
Sector
Insight Labs
Smart
Insights 360
Catalog &
Provision
Meter&Bill
ResourceMonitor
Provision
ServiceCatalog
IoTFramework
AccessMgmt
KnowledgeBase
Helpdesk
RESTful	Web	Services	
Infrastructure	
as	a	Service	
Hybrid	Cloud	Extensibility	-	(Bosh,	CF)	CG-CSB,	Virtustream	
Storage	and	ParallelizaIon	-	EMC	Isilon		
Compute	&	Memory	-	EMC	VCE	
Big	Data	Suite	–	Pivotal,	Cloudera,	Hortonworks	
VMware,	Cortex	
Data	Management	–	InformaIca,	Talend,	HDF,	Apache	Nify	
AnalyIcs	tools		-	SAS,	Madlib,	RStudio,Spark	
Vmware	
Security	&	Governance	
RSA,	AD,	Knox,	Ranger,	Kerberos,	Atlas,	TDE,	W2W,	Metron,	
Falcon	
ITSM		-	BMC	Remedy	
•  Common Web UI and
UX architecture
•  Fully Virtualized
compute, storage &
Network
•  Intelligent automation
of provisioning,
process, service and
support orchestration
•  Modular Component
Architecture
•  Multiple points of
presence
•  Seamless integration
between on-premise,
private & public cloud
•  Proven reference and
component
architecture for on-
premise builds
•  Professional Services
teams to build full stack
•  Demo of full stack
•  Accelerated Partner
enablement
MD&LM
Environment
Hadoop	DistribuIon	–	Hortonworks,	Cloudera	
RE&D,	Dev	Ops	-	Cloud	Foundry,	Jira,	Jit,		
Application LayerInfra Layer User Access LayerSoftware & Services
VisualisaIon	–	Qlik,	Tableau,	SAS	VA,	D3,	High	Charts		
VisualisationVisualisation Self Service Insights
Capgemini	Private	Cloud	 On	Premise	Cloud
13Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
BDLaaS – illustrative example service Dashboard
14Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Standardize, Industrialize and Innovate!
15Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Big data processing is done in three different stages and we have to
cater to each stage differently
!  Continuously running
analytics processes
!  Trust in data quality
!  Service levels secured
!  Managed by IT
Operationalize
!  Store everything: internal
and external, structured
and unstructured
!  Store granular data
!  Minimal effort on IT
Load “as-is”
!  Agile and explorative way
of work
!  Self service
!  Fail fast
Distill on demand
Time
Stage
Actors
Paradigms
IT implements data
integration process for
production
Data providers and IT
provide and store data
Data scientists and
engineers explore and
analyze data
1 2 3
Allow creativity
Encourage collaborationEnsure Business Meta
Data & Data Catalogue
Enable Data Masking
Industrialize!
Examples of technical metadata
!  Path (folder location)
!  Filename
!  File type
!  File size
!  Date of ingestion
!  Technical Owner / Group
!  For HIVE:
!  Nr of records / lines
!  Column number
!  Column names if available
!  Column data types
!  Value distribution
!  Min/Max
Examples of business metadata
!  Project (possibly automatic)
!  Data set name
!  Logical description of dataset
!  Data owner/data stewart
!  Confidentiality classification
!  Line of business
16Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
Start using ELT tools now!
Need for more platform updates
Need for more denormalization
Need for more specialized
Know-How
" Abstraction layer to
Hadoop processing
engines
" Abstraction layer to
NoSQL & SQL
databases
" Standardized control
flows
" Availability of
developers
ELT Tools offer:
17Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
17Copyright © Capgemini 2016. All Rights Reserved
Insights as a Service – Analytics Cloud for Oil & Gas major
Well Health
Dashboards
Equipment Performance
Disaster Management
Supply Chain Analytics
Predictive Maintenancezz
z
Device Data
Driving behavior, GPS, diagnostics, etc.
Real Time DataSystem Data
Environment DataProject Data
• 10 data points per
sec
• 40 GB per field
• 5-6 GB per day
per well,
• 80TB Well data
year
• 24x7x365 monitoring
usage
• Real time charts of
streaming data
• Real time alerts
• Thermal
Visualizations
18Copyright © Capgemini 2015. All Rights Reserved
OOP MUC 2017 - Business Data Lake best practices
We helped customers getting to real value within 12 weeks from idea to
production.
1 3
a
5 6 7 9 11
Business
Insights Need
Integrate
DataSet
Model Build and
Training
Iterate and
Tune
Data
Exploration
Test Data
Science Model
Apply Data
Science
12
Business
Validation
Publish
Insights
Weeks
Business Problem
Identified
Business Value
Delivered
The information contained in this presentation is proprietary.
Copyright © 2015 Capgemini. All rights reserved.
Rightshore® is a trademark belonging to Capgemini.
www.capgemini.com
About Capgemini
With more than 145,000 people in over 40 countries, Capgemini
is one of the world's foremost providers of consulting, technology
and outsourcing services. The Group reported 2014 global
revenues of EUR 10.573 billion.
Together with its clients, Capgemini creates and delivers
business and technology solutions that fit their needs and drive
the results they want. A deeply multicultural organization,
Capgemini has developed its own way of working, the
Collaborative Business Experience™, and draws on Rightshore®,
its worldwide delivery model
Learn more about us at www.capgemini.com.

Más contenido relacionado

La actualidad más candente

How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at ScaleDATAVERSITY
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for DinnerKent Graziano
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorDataWorks Summit
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data EngineeringDurga Gadiraju
 
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DATAVERSITY
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?DATAVERSITY
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Cathrine Wilhelmsen
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...HostedbyConfluent
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureDATAVERSITY
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Databricks
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Emerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingEmerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingDATAVERSITY
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Cathrine Wilhelmsen
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsKhalid Salama
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouseJames Serra
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptxAlex Ivy
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?DATAVERSITY
 

La actualidad más candente (20)

How a Semantic Layer Makes Data Mesh Work at Scale
How a Semantic Layer Makes  Data Mesh Work at ScaleHow a Semantic Layer Makes  Data Mesh Work at Scale
How a Semantic Layer Makes Data Mesh Work at Scale
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Necessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services SectorNecessity of Data Lakes in the Financial Services Sector
Necessity of Data Lakes in the Financial Services Sector
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
 
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
DAS Slides: Building a Data Strategy - Practical Steps for Aligning with Busi...
 
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What is the Question?
 
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
Choosing Between Microsoft Fabric, Azure Synapse Analytics and Azure Data Fac...
 
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
How to Build the Data Mesh Foundation: A Principled Approach | Zhamak Dehghan...
 
Improving Data Literacy Around Data Architecture
Improving Data Literacy Around Data ArchitectureImproving Data Literacy Around Data Architecture
Improving Data Literacy Around Data Architecture
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Gartner: Master Data Management Functionality
Gartner: Master Data Management FunctionalityGartner: Master Data Management Functionality
Gartner: Master Data Management Functionality
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Emerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big ThingEmerging Trends in Data Architecture – What’s the Next Big Thing
Emerging Trends in Data Architecture – What’s the Next Big Thing
 
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
Pipelines and Data Flows: Introduction to Data Integration in Azure Synapse A...
 
Building the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake AnalyticsBuilding the Data Lake with Azure Data Factory and Data Lake Analytics
Building the Data Lake with Azure Data Factory and Data Lake Analytics
 
Building a modern data warehouse
Building a modern data warehouseBuilding a modern data warehouse
Building a modern data warehouse
 
Databricks Platform.pptx
Databricks Platform.pptxDatabricks Platform.pptx
Databricks Platform.pptx
 
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?Emerging Trends in Data Architecture – What’s the Next Big Thing?
Emerging Trends in Data Architecture – What’s the Next Big Thing?
 

Destacado

Blockchain: How the bitcoin technology can change the public sector
Blockchain: How the bitcoin technology can change the public sectorBlockchain: How the bitcoin technology can change the public sector
Blockchain: How the bitcoin technology can change the public sectorCapgemini
 
Data- and database security & GDPR: end-to-end offer
Data- and database security & GDPR: end-to-end offerData- and database security & GDPR: end-to-end offer
Data- and database security & GDPR: end-to-end offerCapgemini
 
Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...
Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...
Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...Capgemini
 
The Six Pillars of Knowledge Economics
The Six Pillars of Knowledge EconomicsThe Six Pillars of Knowledge Economics
The Six Pillars of Knowledge EconomicsCapgemini
 
Payments Trends 2017
Payments Trends 2017Payments Trends 2017
Payments Trends 2017Capgemini
 
Top Ten Trends in Banking 2017
Top Ten Trends in Banking 2017Top Ten Trends in Banking 2017
Top Ten Trends in Banking 2017Capgemini
 
La fabrication additive, c’est quoi?
La fabrication additive, c’est quoi?La fabrication additive, c’est quoi?
La fabrication additive, c’est quoi?Capgemini
 
Top Ten Trends in Lending and Leasing 2017
Top Ten Trends in Lending and Leasing 2017Top Ten Trends in Lending and Leasing 2017
Top Ten Trends in Lending and Leasing 2017Capgemini
 
Top Ten Trends in Insurance 2017
Top Ten Trends in Insurance 2017Top Ten Trends in Insurance 2017
Top Ten Trends in Insurance 2017Capgemini
 
Top Ten Trends in Wealth Management 2017
Top Ten Trends in Wealth Management 2017Top Ten Trends in Wealth Management 2017
Top Ten Trends in Wealth Management 2017Capgemini
 
Cwin16 - lyon - faurecia customer cockpit
Cwin16 - lyon - faurecia customer cockpitCwin16 - lyon - faurecia customer cockpit
Cwin16 - lyon - faurecia customer cockpitCapgemini
 
UNLIMITED by Capgemini: Foundation of Digital Business
UNLIMITED by Capgemini: Foundation of Digital BusinessUNLIMITED by Capgemini: Foundation of Digital Business
UNLIMITED by Capgemini: Foundation of Digital BusinessCapgemini
 
Cwin16 - lyon - customer journey
Cwin16 - lyon - customer journeyCwin16 - lyon - customer journey
Cwin16 - lyon - customer journeyCapgemini
 
Top Ten Trends in Capital Markets 2017
Top Ten Trends in Capital Markets 2017Top Ten Trends in Capital Markets 2017
Top Ten Trends in Capital Markets 2017Capgemini
 
Cwin16 - lyon - exploiter autrement la transformation digitale
Cwin16 - lyon - exploiter autrement la transformation digitaleCwin16 - lyon - exploiter autrement la transformation digitale
Cwin16 - lyon - exploiter autrement la transformation digitaleCapgemini
 
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...Capgemini
 
RDBMS oder NoSQL – warum nicht beides?
RDBMS oder NoSQL – warum nicht beides?RDBMS oder NoSQL – warum nicht beides?
RDBMS oder NoSQL – warum nicht beides?Capgemini
 
Cwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosqlCwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosqlCapgemini
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lakeJames Serra
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation Caserta
 

Destacado (20)

Blockchain: How the bitcoin technology can change the public sector
Blockchain: How the bitcoin technology can change the public sectorBlockchain: How the bitcoin technology can change the public sector
Blockchain: How the bitcoin technology can change the public sector
 
Data- and database security & GDPR: end-to-end offer
Data- and database security & GDPR: end-to-end offerData- and database security & GDPR: end-to-end offer
Data- and database security & GDPR: end-to-end offer
 
Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...
Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...
Cybersecurity-Anforderungen in IT-Sourcing-Projekten meistern – Ein Leitfaden...
 
The Six Pillars of Knowledge Economics
The Six Pillars of Knowledge EconomicsThe Six Pillars of Knowledge Economics
The Six Pillars of Knowledge Economics
 
Payments Trends 2017
Payments Trends 2017Payments Trends 2017
Payments Trends 2017
 
Top Ten Trends in Banking 2017
Top Ten Trends in Banking 2017Top Ten Trends in Banking 2017
Top Ten Trends in Banking 2017
 
La fabrication additive, c’est quoi?
La fabrication additive, c’est quoi?La fabrication additive, c’est quoi?
La fabrication additive, c’est quoi?
 
Top Ten Trends in Lending and Leasing 2017
Top Ten Trends in Lending and Leasing 2017Top Ten Trends in Lending and Leasing 2017
Top Ten Trends in Lending and Leasing 2017
 
Top Ten Trends in Insurance 2017
Top Ten Trends in Insurance 2017Top Ten Trends in Insurance 2017
Top Ten Trends in Insurance 2017
 
Top Ten Trends in Wealth Management 2017
Top Ten Trends in Wealth Management 2017Top Ten Trends in Wealth Management 2017
Top Ten Trends in Wealth Management 2017
 
Cwin16 - lyon - faurecia customer cockpit
Cwin16 - lyon - faurecia customer cockpitCwin16 - lyon - faurecia customer cockpit
Cwin16 - lyon - faurecia customer cockpit
 
UNLIMITED by Capgemini: Foundation of Digital Business
UNLIMITED by Capgemini: Foundation of Digital BusinessUNLIMITED by Capgemini: Foundation of Digital Business
UNLIMITED by Capgemini: Foundation of Digital Business
 
Cwin16 - lyon - customer journey
Cwin16 - lyon - customer journeyCwin16 - lyon - customer journey
Cwin16 - lyon - customer journey
 
Top Ten Trends in Capital Markets 2017
Top Ten Trends in Capital Markets 2017Top Ten Trends in Capital Markets 2017
Top Ten Trends in Capital Markets 2017
 
Cwin16 - lyon - exploiter autrement la transformation digitale
Cwin16 - lyon - exploiter autrement la transformation digitaleCwin16 - lyon - exploiter autrement la transformation digitale
Cwin16 - lyon - exploiter autrement la transformation digitale
 
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
The Currency of Trust: Why Banks and Insurers Must Make Customer Data Safer a...
 
RDBMS oder NoSQL – warum nicht beides?
RDBMS oder NoSQL – warum nicht beides?RDBMS oder NoSQL – warum nicht beides?
RDBMS oder NoSQL – warum nicht beides?
 
Cwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosqlCwin16 - Lyon - partner mark logic - the rise of nosql
Cwin16 - Lyon - partner mark logic - the rise of nosql
 
Big data architectures and the data lake
Big data architectures and the data lakeBig data architectures and the data lake
Big data architectures and the data lake
 
The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation The Data Lake - Balancing Data Governance and Innovation
The Data Lake - Balancing Data Governance and Innovation
 

Similar a Business Data Lake Best Practices

Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaCloudera, Inc.
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemCapgemini
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXtsigitnist02
 
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1SAP Cloud Platform
 
Pivotal Big Data Roadshow
Pivotal Big Data Roadshow Pivotal Big Data Roadshow
Pivotal Big Data Roadshow VMware Tanzu
 
SAP IQ 16 Product Annoucement
SAP IQ 16 Product AnnoucementSAP IQ 16 Product Annoucement
SAP IQ 16 Product AnnoucementDobler Consulting
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceSkillspeed
 
Organising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data WorldOrganising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data WorldDataWorks Summit/Hadoop Summit
 
Noble-D, a cloud focused analytics consulting firm
Noble-D, a cloud focused analytics consulting firmNoble-D, a cloud focused analytics consulting firm
Noble-D, a cloud focused analytics consulting firmnoble-d
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnectaDigital
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Holden Ackerman
 
Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...
Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...
Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...Carie John
 
Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...
Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...
Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...Carie John
 
Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...
Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...
Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...Carie John
 
Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...Carie John
 
Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...
Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...
Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...Carie John
 
Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...Carie John
 
Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...
Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...
Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...Carie John
 
Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...
Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...
Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...Carie John
 

Similar a Business Data Lake Best Practices (20)

Capgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with ClouderaCapgemini Leap Data Transformation Framework with Cloudera
Capgemini Leap Data Transformation Framework with Cloudera
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Informatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake EcosystemInformatica Becomes Part of the Business Data Lake Ecosystem
Informatica Becomes Part of the Business Data Lake Ecosystem
 
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTXCustomer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
Customer Presentation - IBM Cloud Pak for Data Overview (Level 100).PPTX
 
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
SAP Cloud Platform - Your Innovation Platform in the Cloud - L1
 
Pivotal Big Data Roadshow
Pivotal Big Data Roadshow Pivotal Big Data Roadshow
Pivotal Big Data Roadshow
 
SAP IQ 16 Product Annoucement
SAP IQ 16 Product AnnoucementSAP IQ 16 Product Annoucement
SAP IQ 16 Product Annoucement
 
BIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in FinanceBIG Data & Hadoop Applications in Finance
BIG Data & Hadoop Applications in Finance
 
Organising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data WorldOrganising the Data Lake - Information Management in a Big Data World
Organising the Data Lake - Information Management in a Big Data World
 
Noble-D, a cloud focused analytics consulting firm
Noble-D, a cloud focused analytics consulting firmNoble-D, a cloud focused analytics consulting firm
Noble-D, a cloud focused analytics consulting firm
 
Connecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud PlatformConnecta Event: Big Query och dataanalys med Google Cloud Platform
Connecta Event: Big Query och dataanalys med Google Cloud Platform
 
Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI Top Trends in Building Data Lakes for Machine Learning and AI
Top Trends in Building Data Lakes for Machine Learning and AI
 
Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...
Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...
Tableau reseller partner in Djibouti Bilytica Best business Intelligence Comp...
 
Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...
Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...
Tableau reseller partner in Fiji Bilytica Best business Intelligence Company ...
 
Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...
Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...
Tableau reseller partner in Brunei Bilytica Best business Intelligence Compan...
 
Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Cambodia Bilytica Best business Intelligence Comp...
 
Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...
Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...
Tableau reseller partner in Croatia Bilytica Best business Intelligence Compa...
 
Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...
Tableau reseller partner in Ethiopia Bilytica Best business Intelligence Comp...
 
Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...
Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...
Tableau reseller partner in Botswana Bilytica Best business Intelligence Comp...
 
Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...
Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...
Tableau reseller partner in Germany Bilytica Best business Intelligence Compa...
 

Más de Capgemini

Top Healthcare Trends 2022
Top Healthcare Trends 2022Top Healthcare Trends 2022
Top Healthcare Trends 2022Capgemini
 
Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022Capgemini
 
Commercial Banking Trends book 2022
Commercial Banking Trends book 2022Commercial Banking Trends book 2022
Commercial Banking Trends book 2022Capgemini
 
Top Trends in Payments 2022
Top Trends in Payments 2022Top Trends in Payments 2022
Top Trends in Payments 2022Capgemini
 
Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022Capgemini
 
Retail Banking Trends book 2022
Retail Banking Trends book 2022Retail Banking Trends book 2022
Retail Banking Trends book 2022Capgemini
 
Top Life Insurance Trends 2022
Top Life Insurance Trends 2022Top Life Insurance Trends 2022
Top Life Insurance Trends 2022Capgemini
 
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーですキャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーですCapgemini
 
Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021Capgemini
 
Life Insurance Top Trends 2021
Life Insurance Top Trends 2021Life Insurance Top Trends 2021
Life Insurance Top Trends 2021Capgemini
 
Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021Capgemini
 
Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021Capgemini
 
Top Trends in Payments: 2021
Top Trends in Payments: 2021Top Trends in Payments: 2021
Top Trends in Payments: 2021Capgemini
 
Health Insurance Top Trends 2021
Health Insurance Top Trends 2021Health Insurance Top Trends 2021
Health Insurance Top Trends 2021Capgemini
 
Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021Capgemini
 
Capgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous PlanningCapgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous PlanningCapgemini
 
Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020Capgemini
 
Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020Capgemini
 
Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020Capgemini
 
Top Trends in Payments: 2020
Top Trends in Payments: 2020Top Trends in Payments: 2020
Top Trends in Payments: 2020Capgemini
 

Más de Capgemini (20)

Top Healthcare Trends 2022
Top Healthcare Trends 2022Top Healthcare Trends 2022
Top Healthcare Trends 2022
 
Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022Top P&C Insurance Trends 2022
Top P&C Insurance Trends 2022
 
Commercial Banking Trends book 2022
Commercial Banking Trends book 2022Commercial Banking Trends book 2022
Commercial Banking Trends book 2022
 
Top Trends in Payments 2022
Top Trends in Payments 2022Top Trends in Payments 2022
Top Trends in Payments 2022
 
Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022Top Trends in Wealth Management 2022
Top Trends in Wealth Management 2022
 
Retail Banking Trends book 2022
Retail Banking Trends book 2022Retail Banking Trends book 2022
Retail Banking Trends book 2022
 
Top Life Insurance Trends 2022
Top Life Insurance Trends 2022Top Life Insurance Trends 2022
Top Life Insurance Trends 2022
 
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーですキャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
キャップジェミニ、あなたの『RISE WITH SAP』のパートナーです
 
Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021Property & Casualty Insurance Top Trends 2021
Property & Casualty Insurance Top Trends 2021
 
Life Insurance Top Trends 2021
Life Insurance Top Trends 2021Life Insurance Top Trends 2021
Life Insurance Top Trends 2021
 
Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021Top Trends in Commercial Banking: 2021
Top Trends in Commercial Banking: 2021
 
Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021Top Trends in Wealth Management: 2021
Top Trends in Wealth Management: 2021
 
Top Trends in Payments: 2021
Top Trends in Payments: 2021Top Trends in Payments: 2021
Top Trends in Payments: 2021
 
Health Insurance Top Trends 2021
Health Insurance Top Trends 2021Health Insurance Top Trends 2021
Health Insurance Top Trends 2021
 
Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021Top Trends in Retail Banking: 2021
Top Trends in Retail Banking: 2021
 
Capgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous PlanningCapgemini’s Connected Autonomous Planning
Capgemini’s Connected Autonomous Planning
 
Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020Top Trends in Retail Banking: 2020
Top Trends in Retail Banking: 2020
 
Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020Top Trends in Life Insurance: 2020
Top Trends in Life Insurance: 2020
 
Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020Top Trends in Health Insurance: 2020
Top Trends in Health Insurance: 2020
 
Top Trends in Payments: 2020
Top Trends in Payments: 2020Top Trends in Payments: 2020
Top Trends in Payments: 2020
 

Último

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 

Último (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 

Business Data Lake Best Practices

  • 1. Business Data Lake best practices OOP Munich, 2017-01-31
  • 2. 2Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices The speaker – Arne Roßmann !  Part of Insights & Data team •  Global team delivering around BI, DWH, Information Strategy & Big Data !  Working in Business Intelligence since 2008 !  Delivering as Big Data architect & Project Manager at our clients •  Defining processes •  Creating architectures •  Leading projects !  Worked in many industries •  Retail, Chemical, Financial, Logistics, Automotive, ...
  • 3. 3Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Capgemini’s Insights & Data Global Practice With 15,000 experts globally, we are a recognized leader in information-led transformation Capgemini’s Insights & Data Global Practice Expertise in Big Data & Analytics Capgemini Solutions !  Over 15,000 consultants globally !  Industrialized delivery framework Next Gen Business Insights Service Centre !  CUBE lab on the cloud with various demonstrations for BI environments !  Built-in Tools for interactive agile BI and Devops Partner Ecosystem 800+ Big Data & 400+ Data Science Global Consultants Customer Analytics !  Segmentation & Behavior Profiling !  Behavior Propensity scoring !  Pricing Analytics Marketing & Campaign Analytics !  Campaign Recommendation !  Cross Sell/Up Sell !  Campaign Measurement !  Campaign Execution Management Operations Analytics !  Sales/ Demand Forecasting !  Activity Based Costing !  Call Center Analytics Asset/ Equipment Analytics !  Warranty Analytics !  Asset Performance Monitoring !  Predictive Asset Maintenance !  Insights from Connected Equipment Fraud Analytics !  Fraud Scoring !  Collusion Fraud Identification !  Fraud Framework for Public Sector (Trouve) Content Analytics !  Text Mining Accelerators !  Key Opinion Leader !  Content Analytics for Fraud Detection Business Data Lake offering Data Warehouse Optimization Solution Strategic Alliances and partnerships with major vendors Enabling Co-Innovation with the CUBE lab Experience in designing and deploying big data analytics solutions in a varied ecosystems
  • 4. 4Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Table of Contents !  Why the Business Data Lake works !  Services your Business Data Lake should provide !  Standardize, Industrialize and Innovate!
  • 5. Why the Business Data Lake works
  • 6. 6Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Big Data creates opportunities but poses challenges as well Where do I start ? “We know that Big Data can be helpful but how do we quantify the benefits and develop a Business Case?” “How do we know which Big Data technology/platform(s) suits our architecture and business requirement? “ “How do I get all the unstructured data (mainly images) out of my operational processes, into an analytical environment that allows me to experiment with data?” “Can we easily combine data from multiple source systems into our Big Data environment and visa versa?” “Can I do it myself? What skills do I need for Big Data? “ “How do I measure the effectiveness or performance of my Big Data initiative? How do I measure ROI?”
  • 7. 7Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Businesses are looking to close the gap towards ‘insight driven’ Have not completely integrated their data sources across the organization 79% Scattered data lying in silos across the organization Do not have well-defined criteria to measure the success of their own Big Data initiatives 67% Absence of clear business case for funding and implementation Dependence on legacy systems for data processing and management Use cloud based Big Data and analytics platforms 36% Have either scattered pockets of resources or follow a decentralized model for analytics initiatives Ineffective co-ordination of Big Data and analytics teams 47%
  • 8. 8Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices The Business Data Lake delivers what we need for the new data landscape. Govern Where it matters Encourage local requirements Distill on demand Store securely !  Focus on MDM !  Enforce only when sharing !  Treat Corporate as aggregation of Local. !  Let the business decide what they need !  Build from the bottom !  Enable traceability to source disposable data views. !  Store everything ‘as is’ !  Include structured and unstructured data !  Store it cheaply were possible !  Select only what you want !  Business friendly tooling !  Re-usable information maps !  Rapid change cycle.
  • 9. 9Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Business Challenges driving the need for BDL services Business Enablement !  Achieve real-time optimization of business processes through predictive insights and performance analytics !  Enhance new services and stay competitive in the market !  Be agile, get insights fast ControlControl !  Ensure data security and compliance with EU data regulations !  Enable up- and downscaling according to business needs ControlControl !  Reduce costs associated with the governance and secure storage of data !  Control the costs of running flexible data services !  Reduce Capex
  • 10. Services your Business Data Lake should provide
  • 11. 11Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Capgemini can help accelerate clients’ journey to Insights.. A cloud powered, big data & insights service; bring all your data in one place, deliver insights at the point of action and generate differentiated business value. ‘Software- Defined’’, full stack cloud infrastructure Flexible ‘Pay-as-you-go’ Commercial Model Secure as a Vault ‘Ready to Harvest’ Sector & Domain Insights Modular Hybrid & Elastic powered by ‘Intelligent Automation’ Get started quickly: with our platform , tools and expertise we can support you at any level to manage your data and harvest insights Your ‘Lab in the Cloud’ !  Experiment !  Hypothesize !  Simulate
  • 12. 12Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices The BDL architecture we built for our clients Pla$orm as a Service Insights Platform UX Portal HTML 5, CSS, Angular JS Big Data Lab Dataset Library Data Science Lab Models Library Insights Lab Ready Insights Common Services Common Services Ingest Algorithm Library Sector Insight Labs Smart Insights 360 Catalog & Provision Meter&Bill ResourceMonitor Provision ServiceCatalog IoTFramework AccessMgmt KnowledgeBase Helpdesk RESTful Web Services Infrastructure as a Service Hybrid Cloud Extensibility - (Bosh, CF) CG-CSB, Virtustream Storage and ParallelizaIon - EMC Isilon Compute & Memory - EMC VCE Big Data Suite – Pivotal, Cloudera, Hortonworks VMware, Cortex Data Management – InformaIca, Talend, HDF, Apache Nify AnalyIcs tools - SAS, Madlib, RStudio,Spark Vmware Security & Governance RSA, AD, Knox, Ranger, Kerberos, Atlas, TDE, W2W, Metron, Falcon ITSM - BMC Remedy •  Common Web UI and UX architecture •  Fully Virtualized compute, storage & Network •  Intelligent automation of provisioning, process, service and support orchestration •  Modular Component Architecture •  Multiple points of presence •  Seamless integration between on-premise, private & public cloud •  Proven reference and component architecture for on- premise builds •  Professional Services teams to build full stack •  Demo of full stack •  Accelerated Partner enablement MD&LM Environment Hadoop DistribuIon – Hortonworks, Cloudera RE&D, Dev Ops - Cloud Foundry, Jira, Jit, Application LayerInfra Layer User Access LayerSoftware & Services VisualisaIon – Qlik, Tableau, SAS VA, D3, High Charts VisualisationVisualisation Self Service Insights Capgemini Private Cloud On Premise Cloud
  • 13. 13Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices BDLaaS – illustrative example service Dashboard
  • 14. 14Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Standardize, Industrialize and Innovate!
  • 15. 15Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Big data processing is done in three different stages and we have to cater to each stage differently !  Continuously running analytics processes !  Trust in data quality !  Service levels secured !  Managed by IT Operationalize !  Store everything: internal and external, structured and unstructured !  Store granular data !  Minimal effort on IT Load “as-is” !  Agile and explorative way of work !  Self service !  Fail fast Distill on demand Time Stage Actors Paradigms IT implements data integration process for production Data providers and IT provide and store data Data scientists and engineers explore and analyze data 1 2 3 Allow creativity Encourage collaborationEnsure Business Meta Data & Data Catalogue Enable Data Masking Industrialize! Examples of technical metadata !  Path (folder location) !  Filename !  File type !  File size !  Date of ingestion !  Technical Owner / Group !  For HIVE: !  Nr of records / lines !  Column number !  Column names if available !  Column data types !  Value distribution !  Min/Max Examples of business metadata !  Project (possibly automatic) !  Data set name !  Logical description of dataset !  Data owner/data stewart !  Confidentiality classification !  Line of business
  • 16. 16Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices Start using ELT tools now! Need for more platform updates Need for more denormalization Need for more specialized Know-How " Abstraction layer to Hadoop processing engines " Abstraction layer to NoSQL & SQL databases " Standardized control flows " Availability of developers ELT Tools offer:
  • 17. 17Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices 17Copyright © Capgemini 2016. All Rights Reserved Insights as a Service – Analytics Cloud for Oil & Gas major Well Health Dashboards Equipment Performance Disaster Management Supply Chain Analytics Predictive Maintenancezz z Device Data Driving behavior, GPS, diagnostics, etc. Real Time DataSystem Data Environment DataProject Data • 10 data points per sec • 40 GB per field • 5-6 GB per day per well, • 80TB Well data year • 24x7x365 monitoring usage • Real time charts of streaming data • Real time alerts • Thermal Visualizations
  • 18. 18Copyright © Capgemini 2015. All Rights Reserved OOP MUC 2017 - Business Data Lake best practices We helped customers getting to real value within 12 weeks from idea to production. 1 3 a 5 6 7 9 11 Business Insights Need Integrate DataSet Model Build and Training Iterate and Tune Data Exploration Test Data Science Model Apply Data Science 12 Business Validation Publish Insights Weeks Business Problem Identified Business Value Delivered
  • 19. The information contained in this presentation is proprietary. Copyright © 2015 Capgemini. All rights reserved. Rightshore® is a trademark belonging to Capgemini. www.capgemini.com About Capgemini With more than 145,000 people in over 40 countries, Capgemini is one of the world's foremost providers of consulting, technology and outsourcing services. The Group reported 2014 global revenues of EUR 10.573 billion. Together with its clients, Capgemini creates and delivers business and technology solutions that fit their needs and drive the results they want. A deeply multicultural organization, Capgemini has developed its own way of working, the Collaborative Business Experience™, and draws on Rightshore®, its worldwide delivery model Learn more about us at www.capgemini.com.