欢迎关注大数据技术架构与案例微信公众号:过往记忆大数据
过往记忆博客公众号iteblog_hadoop
欢迎关注微信公众号:
过往记忆大数据

[电子书]Learning Apache Spark 2 PDF下载

本书于2017-03由Packt Publishing出版,作者Muhammad Asif Abbasi,全书356页。

通过本书你将学到以下知识:

  • Get an overview of big data analytics and its importance for organizations and data professionals
  • Delve into Spark to see how it is different from existing processing platforms
  • Understand the intricacies of various file formats, and how to process them with Apache Spark.
  • Realize how to deploy Spark with YARN, MESOS or a Stand-alone cluster manager.
  • Learn the concepts of Spark SQL, SchemaRDD, Caching and working with Hive and Parquet file formats
  • Understand the architecture of Spark MLLib while discussing some of the off-the-shelf algorithms that come with Spark.
  • Introduce yourself to the deployment and usage of SparkR.
  • Walk through the importance of Graph computation and the graph processing systems available in the market
  • Check the real world example of Spark by building a recommendation engine with Spark using ALS.
  • Use a Telco data set, to predict customer churn using Random Forests.
Learning-Apache-Spark-2
如果想及时了解Spark、Hadoop或者Hbase相关的文章,欢迎关注微信公共帐号:iteblog_hadoop

本书的章节

  1. Architecture and Installation
  2. Transformations and Actions with Spark RDDs
  3. ETL with Spark
  4. Spark SQL
  5. Spark Streaming
  6. Machine Learning with Spark
  7. GraphX
  8. Operating in Clustered Mode
  9. Building a Recommendation System
  10. Customer Churn Prediction
  11. There's More with Spark

下载地址

提供了PDF、azw3 以及 epub 二种格式的下载。

点击进入下载

本博客文章除特别声明,全部都是原创!
原创文章版权归过往记忆大数据(过往记忆)所有,未经许可不得转载。
本文链接: 【[电子书]Learning Apache Spark 2 PDF下载】(https://www.iteblog.com/archives/2214.html)
喜欢 (29)
分享 (0)
发表我的评论
取消评论

表情
本博客评论系统带有自动识别垃圾评论功能,请写一些有意义的评论,谢谢!