欢迎关注Hadoop、Spark、Flink、Hive、Hbase、Flume等大数据资料分享微信公共账号:iteblog_hadoop
  1. 文章总数:961
  2. 浏览总数:11,461,183
  3. 评论:3870
  4. 分类目录:103 个
  5. 注册用户数:5832
  6. 最后更新:2018年10月17日
过往记忆博客公众号iteblog_hadoop
欢迎关注微信公众号:
iteblog_hadoop
大数据技术博客公众号bigdata_ai
大数据猿:
bigdata_ai

[电子书]Scala and Spark for Big Data Analytics PDF下载

本书于2017-07由Packt Publishing出版,作者Md. Rezaul Karim, Sridhar Alla,全书1587页。

Scala_and_Spark_for_Big_Data_Analytics_iteblog
关注大数据猿(bigdata_ai)公众号及时获取最新大数据相关电子书、资讯等

通过本书你将学到以下知识

  • Understand object-oriented & functional programming concepts of Scala
  • In-depth understanding of Scala collection APIs
  • Work with RDD and DataFrame to learn Spark’s core abstractions
  • Analysing structured and unstructured data using SparkSQL and GraphX
  • Scalable and fault-tolerant streaming application development using Spark structured streaming
  • Learn machine-learning best practices for classification, regression, dimensionality reduction, and recommendation system to build predictive models with widely used algorithms in Spark MLlib & ML
  • Build clustering models to cluster a vast amount of data
  • Understand tuning, debugging, and monitoring Spark applications
  • Deploy Spark applications on real clusters in Standalone, Mesos, and YARN
Scala_and_Spark_for_Big_Data_Analytics_iteblog
如果想及时了解Spark、Hadoop或者Hbase相关的文章,欢迎关注微信公共帐号:iteblog_hadoop

本书的章节

  1. INTRODUCTION TO SCALA
  2. OBJECT-ORIENTED SCALA
  3. FUNCTIONAL PROGRAMMING CONCEPTS
  4. COLLECTION APIS
  5. TACKLE BIG DATA – SPARK COMES TO THE PARTY
  6. START WORKING WITH SPARK – REPL AND RDDS
  7. SPECIAL RDD OPERATIONS
  8. INTRODUCE A LITTLE STRUCTURE - SPARK SQL
  9. STREAM ME UP, SCOTTY - SPARK STREAMING
  10. EVERYTHING IS CONNECTED - GRAPHX
  11. LEARNING MACHINE LEARNING - SPARK MLLIB AND SPARK ML
  12. ADVANCED MACHINE LEARNING BEST PRACTICES
  13. MY NAME IS BAYES, NAIVE BAYES
  14. TIME TO PUT SOME ORDER - CLUSTER YOUR DATA WITH SPARK MLLIB
  15. TEXT ANALYTICS USING SPARK ML
  16. SPARK TUNING
  17. TIME TO GO TO CLUSTERLAND - DEPLOYING SPARK ON A CLUSTER
  18. TESTING AND DEBUGGING SPARK
  19. PYSPARK AND SPARKR
  20. ACCELERATING SPARK WITH ALLUXIO
  21. INTERACTIVE DATA ANALYTICS WITH APACHE ZEPPELIN

下载地址

提供了PDF、azw3 以及 epub 三种格式的下载。
点击进入下载

本博客文章除特别声明,全部都是原创!
转载本文请加上:转载自过往记忆(https://www.iteblog.com/)
本文链接: 【[电子书]Scala and Spark for Big Data Analytics PDF下载】(https://www.iteblog.com/archives/2241.html)
喜欢 (19)
分享 (0)
发表我的评论
取消评论

表情
本博客评论系统带有自动识别垃圾评论功能,请写一些有意义的评论,谢谢!