读learning spark lighting

chapter 1 introduction to the analysis with spark

the conponents of Sparks

　　spark core(contains the basic functionality of sparks. spark Core is also the home to the APIs that defines the RDDs),

　　spark sql(structured data ) is the package for working with the structured data.it allow query data via SQL as well as Apache hive , and it support many sources of data ,including Hive tables ,Parquet And jason.also allow developers to intermix SQL queries with the programatic data manipulation supported By the RDDs in Python ,java And Scala .

　　spark streaming(real-time),enables processing the live of streaming of data.

　　MLib(machine learning)

　　GraphX(graph processing )is a library for manipulating the graph .

A Brief History of Spark

　　spark is a open source project that has beed And is maintained By a thriving And diverse community of developer .

chapter 2 downloading spark And getting started

　　walking through the process of downloding And running the sprak on local mode on single computer .

　　you don‘t needmaster Scala,java orPython.Spark itself is written in Scala, and runs on the Java Virtual Machine (JVM). To run Spark
on either your laptop or a cluster, all you need is an installation of Java 6 or newer. If you wish to use the Python API you will also need a Python interpreter (version 2.6 or newer).Spark does not yet work with Python 3.

downloading spark,select the "pre-build for Hadoop 2.4 And later".

tips:

widows user May run into issues installing .you can use the ziptool untar the .tar file Note :instatll spark in a directionalry with no space (e.g. c:\spark).

after you untar you will get a new directionaru with the same name but without the final .tar suffix .

damn it:

Most of this book includes code in all of Spark’s languages, but interactive shells are
available only in Python and Scala. Because a shell is very useful for learning the API, we recommend using one of these languages for these examples even if you are a Java
developer. The API is similar in every language.

change the directionaty to the spark,type bin\pyspark,you will see the logo.

时间： 2024-10-08 14:26:30

读learning spark lighting

读learning spark lighting的相关文章

Spark的Python和Scala shell介绍（翻译自Learning.Spark.Lightning-Fast.Big.Data.Analysis）

【原】Learning Spark (Python版) 学习笔记(四)----Spark Sreaming与MLlib机器学习

线性回归的Spark实现 [Linear Regression / Machine Learning / Spark]

Learning Spark: Lightning-Fast Big Data Analysis 中文翻译

Learning Spark中文版--第三章--RDD编程（1）

Learning Spark——使用spark-shell运行Word Count

单独的应用程序（翻译自Learning.Spark.Lightning-Fast.Big.Data.Analysis）

Spark核心概念介绍（翻译自Learning.Spark.Lightning-Fast.Big.Data.Analysis）

逻辑回归的分布式实现 [Logistic Regression / Machine Learning / Spark ]