天天干天天操天天爱-天天干天天操天天操-天天干天天操天天插-天天干天天操天天干-天天干天天操天天摸

課程目錄: 用Scala和Spark進行大數據分析培訓

4401 人關注
(78637/99817)
課程大綱:

用Scala和Spark進行大數據分析培訓

 

 

 

WEEK 1

Getting Started + Spark Basics

Get up and running with Scala on your computer.

Complete an example assignment to familiarize yourself with our unique way of submitting assignments.

In this week, we'll bridge the gap between data parallelism

in the shared memory scenario (learned in the Parallel Programming course, prerequisite)

and the distributed scenario. We'll look at important concerns that arise in distributed systems,

like latency and failure. We'll go on to cover the basics of Spark,

a functionally-oriented framework for big data processing in Scala.

We'll end the first week by exercising what we learned about Spark

by immediately getting our hands dirty analyzing a real-world data set.

WEEK 2

Reduction Operations & Distributed Key-Value Pairs

This week, we'll look at a special kind of RDD called pair RDDs.

With this specialized kind of RDD in hand, we'll cover essential operations on large data sets,

such as reductions and joins.WEEK 3

Partitioning and Shuffling

This week we'll look at some of the performance implications of using operations like joins.

Is it possible to get the same result without having to pay for the overhead of moving data over the network?

We'll answer this question by delving into how we can partition our data to achieve better data locality,

in turn optimizing some of our Spark jobs.WEEK 4

Structured data: SQL, Dataframes, and Datasets

With our newfound understanding of the cost of data movement

in a Spark job, and some experience optimizing jobs for data locality last week,

this week we'll focus on how we can more easily achieve similar optimizations.

Can structured data help us? We'll look at Spark SQL and its powerful optimizer which uses structure

to apply impressive optimizations. We'll move on to cover DataFrames and Datasets,

which give us a way to mix RDDs with the powerful automatic optimizations behind Spark SQL.


 

主站蜘蛛池模板: 一级黄色网址 | 18hd xxxx国产在线 | 国产免费一级视频 | 加勒比色老久久爱综合网 | 日韩中文字幕高清在线专区 | 日本人一级毛片视频 | 在线观看高清视频bbixx | 亚洲国产精品免费视频 | 成人aaaa | 国产三级视频在线播放 | 午夜一级毛片看看 | 麻豆网址在线观看 | 男女爱爱免费 | 一级片aaaa | 大学生毛片a左线播放 | 狠狠色婷婷综合天天久久丁香 | 欧美日韩亚洲区久久综合 | 国产国语一级毛片 | 国产精品小黄鸭一区二区三区 | 九色在线免费观看 | 亚洲女人性视频 | 久久er精品| 欧美黄色录像 | 国产午夜亚洲精品久久www | 96精品视频 | 老妇xxxxbbbb| 爱爱www在线观看视频高清 | 野外啪啪抽搐一进一出 | 国产小视频在线观看 | 婷婷在线成人免费观看搜索 | 人成xxxwww免费视频 | 亚洲最大黄色网址 | 国产黄网在线 | 日本成本人啪啪黄3d动漫 | 免费国产免费福利视频 | 国内精品网站 | 99久久中文字幕 | 欧美精品毛片 | 操碰在线视频 | 国产尤物二区三区在线观看 | 日本免费看片在线播放 |