Talend Big Data - DCIA

January 30, 2018 | Author: Anonymous | Category: N/A
Share Embed


Short Description

Talend 2014. 3. “Big data is what happened when the cost of keeping information became less than the cost of throwing i...

Description

Talend Big Data Delivering instant value from all your data

© Talend 2014

1

“I may say that this is the greatest factor: the way in which the expedition is equipped.” Roald Amundsen race to the south pole, 1911

© Talend 2014 © Talend 2014

Source of Roal Amundsen portrait: Norwegian National Library

2

2

The New Data Integration Economics 45x

6x

savings. $1,000/TB for Hadoop vs $45,000/TB for traditional

faster ROI using big data analytics tools vs traditional EDW

600x “Big data is what happened when the cost of keeping information became less than the cost of throwing it away.”

active data. Neustar moved from storing 1% of data for 60 days to 100% for one year

$600B revenue shift by 2020 to companies that use big data effectively

– Technology Historian George Dyson

© Talend 2014

3

Macro Trends Revolutionizing the Integration Market The amount of data will grow 50X from 2010 to 2020

64% of enterprises surveyed indicate that they’re deploying or planning Big Data projects

By 2020, 55% of CIOs will source all their critical apps in the Cloud

Source: Gartner and Cisco reports

© Talend 2014

4

CIO: It’s tough at the top

No End-2-End meta-data visibility Expanding Data Volumes Master Data Consistency

Lack of Talent / Skills

© Talend 2014

Hadoop & NoSQL Siloed Data due to SAAS Data Quality Latency & Velocity

5

Existing Infrastructures Under Distress: Architecturally and Economically Metadata Standard Reports

Ad-hoc Query Tools

Weblogs

External Data Sources

Data Mining

Data explosion

Batch to real-time

Transform MDD/OLAP Relational Systems/ERP

Legacy Systems

© Talend 2014

Data Marts (the data warehouse)

Need more active data

Analytical Applications

6

Benefits of Hadoop and NoSQL

NoSQL

NoSQL

Standard Reports

Web Logs

Data explosion

Ad-hoc Query Tools

IOT

Data Mining

ERP

MDD/OLA P

DBMS /EDW Legacy Systems

Analytical Applications

Batch to Real-Time

Longer active data

Data Marts (the data warehouse)

© Talend 2014

7

Different flavors of Big Data across industries Manufacturing

Retail

Banking

• Product as a Service • Innovation in R&D • Preventive Maintenance

• Real time offers and personalization • In store customer experience and clienteling • Dynamic PRicing

• Multi Channel customer journeys • Fraud, anti money Laundering • Personalized offers

Insurance

Heathcare

Transports/Travel

• Frauds & Risk Mgmt • Customer recommendations • Pay per use and personalized services

• Adverse effects Mgmt • Personalized Healthcare. • Prevention and diagnoses • Genomic computation

• Planning and management of events related to logistics • Customer real-time service • Energy saving • Dynamic pricing

Public Sector

Telecom

Consumer Product

• Linked Data • Frauds, crime, Public Safety • Guided learning in Education • Citizen realtionship management

• Multi channel customer journeys • Big Data Monetization (e.g. geo localization) • Fraud and churn mgmt

• Sentiment analysis • Consumer Relationship management • Product as a service

How is this related to your world ? © Talend 2014

8

Top Big Data Challenges

“How To” Challenges

© Talend 2014

Source: Gartner - Survey Analysis: Big Data Adoption in 2013 Shows Substance Behind the Hype - 12 September 2013 - G00255160

9

A Brief History of Hadoop and Talend Apache Project Established

Enterprise Hadoop distribution Vendors Hortonworks, Cloudera, …

HDP 2.0 release include Hadoop2.0 and Yarn

2014 2004

2006

2008

2010

April 2010 v4 include Hive and HDFS support

1st release of Talend Open Studio

2012

Adopted technology

Talend support YARN /Hadoop2.0

2014 2005

2006

2008

2010

2012

Prefered solution for BigData integration

Talend is matching and supporting the Hadoop ecosystem © Talend 2014

10

What is Talend for Big Data? The best way to get rid of manual/hand coding script.  No need to learn : MapReduce, Pig, Hive, Spark, Flume, Kafka, Sqoop, Storm, etc….  Leverage a nice, user-friendly Designer Studio to create your Big Data integration 

© Talend 2014

11

Trying to get from this…

© Talend 2014

12

to this…

Why Talend… Talend generates code that is executed within map reduce. This open approach removes the limitation of a proprietary “engine” to provide a truly unique and powerful set of tools for big data.

© Talend 2014

13

The Talend Platform

© Talend 2014

14

Talend Big Data Sandbox Virtual Image installed with • Four scenarios for you to try: - Clickstream data - Twitter sentiment - Apache weblogs - ETL offload

© Talend 2014

15

View more...

Comments

Copyright © 2017 KINPDF Inc.