天天干天天操天天爱-天天干天天操天天操-天天干天天操天天插-天天干天天操天天干-天天干天天操天天摸

課程目錄: 用Scala和Spark進(jìn)行大數(shù)據(jù)分析培訓(xùn)

4401 人關(guān)注
(78637/99817)
課程大綱:

用Scala和Spark進(jìn)行大數(shù)據(jù)分析培訓(xùn)

 

 

 

WEEK 1

Getting Started + Spark Basics

Get up and running with Scala on your computer.

Complete an example assignment to familiarize yourself with our unique way of submitting assignments.

In this week, we'll bridge the gap between data parallelism

in the shared memory scenario (learned in the Parallel Programming course, prerequisite)

and the distributed scenario. We'll look at important concerns that arise in distributed systems,

like latency and failure. We'll go on to cover the basics of Spark,

a functionally-oriented framework for big data processing in Scala.

We'll end the first week by exercising what we learned about Spark

by immediately getting our hands dirty analyzing a real-world data set.

WEEK 2

Reduction Operations & Distributed Key-Value Pairs

This week, we'll look at a special kind of RDD called pair RDDs.

With this specialized kind of RDD in hand, we'll cover essential operations on large data sets,

such as reductions and joins.WEEK 3

Partitioning and Shuffling

This week we'll look at some of the performance implications of using operations like joins.

Is it possible to get the same result without having to pay for the overhead of moving data over the network?

We'll answer this question by delving into how we can partition our data to achieve better data locality,

in turn optimizing some of our Spark jobs.WEEK 4

Structured data: SQL, Dataframes, and Datasets

With our newfound understanding of the cost of data movement

in a Spark job, and some experience optimizing jobs for data locality last week,

this week we'll focus on how we can more easily achieve similar optimizations.

Can structured data help us? We'll look at Spark SQL and its powerful optimizer which uses structure

to apply impressive optimizations. We'll move on to cover DataFrames and Datasets,

which give us a way to mix RDDs with the powerful automatic optimizations behind Spark SQL.


 

主站蜘蛛池模板: 色视频免费网站 | 在线视频一区二区三区 | 成人亚洲精品777777 | 黄色一级视屏 | 五月四房播| 日韩黄 | 精品91自产拍在线观看99re | 九色国产在视频线精品视频 | 亚洲香蕉视频 | 久久国产精品无码网站 | 欧美性生活视频 | 亚洲五月花 | 日本成人一区二区三区 | 欧美一级视频精品观看 | 91在线免费看 | 国产逼逼视频 | 成年视频xxxxxx在线 | a级免费观看 | 亚洲国产精品成人综合久久久 | 欧美一区二区三区性 | 黄色毛片免费进入 | 91精品福利老司机在线观看 | 亚洲精品在线观看视频 | 欧美成人h精品网站 | 男女性高清爱潮视频免费观看 | 欲色影视天天一区二区三区色香欲 | 国产精品久久久久久久久鸭 | 欧美日韩黄色 | 国产色影院 | 美女被免费网站91 | 国产小视频2023 | 欧美一级毛片大片免费播放 | 伊人青青久 | 亚洲国产成人久久 | 久久国产精品免费专区 | 日本一级特黄特色大片免费视频 | 成人91视频| 国产日产欧产麻豆精品精品推荐 | 中文字幕成人在线 | 国产亚洲人成网站在线观看 | 久久午夜国产片 |