欢迎关注大数据技术架构与案例微信公众号:过往记忆大数据
过往记忆博客公众号iteblog_hadoop
欢迎关注微信公众号:
过往记忆大数据

Learning Spark pdf下载

  经过这段时间的整理以及格式调整,以及纠正其中的一些错误修改,整理出PDF下载。下载地址:
CSDN免积分下载

  完整版可以到这里下载Learning Spark完整版下载
附录:Learning Spark目录

Chapter 1 Introduction to Data Analysis with Spark
  What Is Apache Spark?
  A Unified Stack
  Who Uses Spark, and for What?
  A Brief History of Spark
  Spark Versions and Releases
  Storage Layers for Spark
Chapter 2 Downloading Spark and Getting Started
  Downloading Spark
  Introduction to Spark’s Python and Scala Shells
  Introduction to Core Spark Concepts
  Standalone Applications
  Conclusion
Chapter 3 Programming with RDDs
  RDD Basics
  Creating RDDs
  RDD Operations
  Passing Functions to Spark
  Common Transformations and Actions
  Persistence (Caching)
  Conclusion
Chapter 4 Working with Key/Value Pairs
  Motivation
  Creating Pair RDDs
  Transformations on Pair RDDs
  Actions Available on Pair RDDs
  Data Partitioning (Advanced)
  Conclusion
Chapter 5 Loading and Saving Your Data
  Motivation
  File Formats
  Filesystems
  Structured Data with Spark SQL
  Databases
  Conclusion
Chapter 6 Advanced Spark Programming
  Introduction
  Accumulators
  Broadcast Variables
  Working on a Per-Partition Basis
  Piping to External Programs
  Numeric RDD Operations
  Conclusion
Chapter 7 Running on a Cluster
  Introduction
  Spark Runtime Architecture
  Deploying Applications with spark-submit
  Packaging Your Code and Dependencies
  Scheduling Within and Between Spark Applications
  Cluster Managers
  Which Cluster Manager to Use?
  Conclusion
Chapter 8 Tuning and Debugging Spark
  Configuring Spark with SparkConf
  Components of Execution: Jobs, Tasks, and Stages
  Finding Information
  Key Performance Considerations
  Conclusion
Chapter 9 Spark SQL
  Linking with Spark SQL
  Using Spark SQL in Applications
  Loading and Saving Data
  JDBC/ODBC Server
  User-Defined Functions
  Spark SQL Performance
  Conclusion
Chapter 10 Spark Streaming
  A Simple Example
  Architecture and Abstraction
  Transformations
  Output Operations
  Input Sources
  24/7 Operation
  Streaming UI
  Performance Considerations
  Conclusion
Chapter 11 Machine Learning with MLlib
  Overview
  System Requirements
  Machine Learning Basics
  Data Types
  Algorithms
  Tips and Performance Considerations
  Pipeline API
  Conclusion
本博客文章除特别声明,全部都是原创!
原创文章版权归过往记忆大数据(过往记忆)所有,未经许可不得转载。
本文链接: 【Learning Spark pdf下载】(https://www.iteblog.com/archives/1249.html)
喜欢 (83)
分享 (0)
发表我的评论
取消评论

表情
本博客评论系统带有自动识别垃圾评论功能,请写一些有意义的评论,谢谢!
(6)个小伙伴在吐槽
  1. 看看好不好哦

    guugerer2015-02-13 10:33 回复
  2. 写得很不错

    kay132015-02-04 10:05 回复
    • 是的,这书的确很不错。

      w3970907702015-02-04 10:14 回复
  3. 求下载 

    lidl2015-01-23 14:55 回复
  4. 很好的资源!

    fan8070175702015-01-23 10:09 回复
  5. 非常好 正需要的学习资料

    东林奥术2015-01-13 17:25 回复