欢迎关注Hadoop、Spark、Flink、Hive、Hbase、Flume等大数据资料分享微信公共账号:iteblog_hadoop
  1. 文章总数:961
  2. 浏览总数:11,490,358
  3. 评论:3873
  4. 分类目录:103 个
  5. 注册用户数:5846
  6. 最后更新:2018年10月17日
过往记忆博客公众号iteblog_hadoop
欢迎关注微信公众号:
iteblog_hadoop
大数据技术博客公众号bigdata_ai
大数据猿:
bigdata_ai

Learning Spark pdf下载

  经过这段时间的整理以及格式调整,以及纠正其中的一些错误修改,整理出PDF下载。下载地址:
CSDN免积分下载

  完整版可以到这里下载Learning Spark完整版下载
附录:Learning Spark目录

Chapter 1 Introduction to Data Analysis with Spark
  What Is Apache Spark?
  A Unified Stack
  Who Uses Spark, and for What?
  A Brief History of Spark
  Spark Versions and Releases
  Storage Layers for Spark
Chapter 2 Downloading Spark and Getting Started
  Downloading Spark
  Introduction to Spark’s Python and Scala Shells
  Introduction to Core Spark Concepts
  Standalone Applications
  Conclusion
Chapter 3 Programming with RDDs
  RDD Basics
  Creating RDDs
  RDD Operations
  Passing Functions to Spark
  Common Transformations and Actions
  Persistence (Caching)
  Conclusion
Chapter 4 Working with Key/Value Pairs
  Motivation
  Creating Pair RDDs
  Transformations on Pair RDDs
  Actions Available on Pair RDDs
  Data Partitioning (Advanced)
  Conclusion
Chapter 5 Loading and Saving Your Data
  Motivation
  File Formats
  Filesystems
  Structured Data with Spark SQL
  Databases
  Conclusion
Chapter 6 Advanced Spark Programming
  Introduction
  Accumulators
  Broadcast Variables
  Working on a Per-Partition Basis
  Piping to External Programs
  Numeric RDD Operations
  Conclusion
Chapter 7 Running on a Cluster
  Introduction
  Spark Runtime Architecture
  Deploying Applications with spark-submit
  Packaging Your Code and Dependencies
  Scheduling Within and Between Spark Applications
  Cluster Managers
  Which Cluster Manager to Use?
  Conclusion
Chapter 8 Tuning and Debugging Spark
  Configuring Spark with SparkConf
  Components of Execution: Jobs, Tasks, and Stages
  Finding Information
  Key Performance Considerations
  Conclusion
Chapter 9 Spark SQL
  Linking with Spark SQL
  Using Spark SQL in Applications
  Loading and Saving Data
  JDBC/ODBC Server
  User-Defined Functions
  Spark SQL Performance
  Conclusion
Chapter 10 Spark Streaming
  A Simple Example
  Architecture and Abstraction
  Transformations
  Output Operations
  Input Sources
  24/7 Operation
  Streaming UI
  Performance Considerations
  Conclusion
Chapter 11 Machine Learning with MLlib
  Overview
  System Requirements
  Machine Learning Basics
  Data Types
  Algorithms
  Tips and Performance Considerations
  Pipeline API
  Conclusion
本博客文章除特别声明,全部都是原创!
转载本文请加上:转载自过往记忆(https://www.iteblog.com/)
本文链接: 【Learning Spark pdf下载】(https://www.iteblog.com/archives/1249.html)
喜欢 (81)
分享 (0)
发表我的评论
取消评论

表情
本博客评论系统带有自动识别垃圾评论功能,请写一些有意义的评论,谢谢!
(6)个小伙伴在吐槽
  1. 看看好不好哦
    guugerer2015-02-13 10:33 回复
  2. 写得很不错
    kay132015-02-04 10:05 回复
    • 是的,这书的确很不错。
      w3970907702015-02-04 10:14 回复
  3. 求下载 
    lidl2015-01-23 14:55 回复
  4. 很好的资源!
    fan8070175702015-01-23 10:09 回复
  5. 非常好 正需要的学习资料
    东林奥术2015-01-13 17:25 回复