专栏首页实时计算NoteBook学习(二)-------- Zeppelin简介与安装

NoteBook学习(二)-------- Zeppelin简介与安装

Zeppelin官网地址:

http://zeppelin.apache.org/

Github地址:

https://github.com/apache/zeppelin

(参照官网)

1、什么是zeppelin

  多用途的笔记本。数据的采集 发现 分析 可视化 协作。。 支持20+种后端语言,支持多种解释器 内置集成Spark

2、安装

  这里安装zeppelin0.8.0

Name

Value

Oracle JDK

1.7 (set JAVA_HOME)

OS

Mac OSX Ubuntu 14.X CentOS 6.X Windows 7 Pro SP1

需要环境:JDK 7 以上 centos6以上

  下载完整包,下载地址:

  http://www.apache.org/dyn/closer.cgi/zeppelin/zeppelin-0.8.0/zeppelin-0.8.0-bin-all.tgz

  下载完成后,上传centos服务器 解压:

  tar -zxvf zeppelin-0.8.0-bin-all.tgz

  在目录下启动,注意默认端口是8080:

  bin/zeppelin-daemon.sh start

  停止进程 zeppelin-daemon.sh stop

  注意此时配置没有修改,用的都是默认的 java_home xmx xms ...

   启动成功后,浏览器访问:localhost:8080

   就能看到UI页面

   bin/zeppelin-daemon.sh stop 停止进程
也可以注册成服务
3、配置

zeppelin的配置主要是两个:
配置环境变量   conf/zeppelin-env.sh
配置java属性  conf/zeppelin-site.xml

环境变量优先

官网表格如下:

zeppelin-env.sh

zeppelin-site.xml

Default value

Description

ZEPPELIN_PORT

zeppelin.server.port

8080

Zeppelin server port Note: Please make sure you're not using the same port with Zeppelin web application development port (default: 9000).

ZEPPELIN_SSL_PORT

zeppelin.server.ssl.port

8443

Zeppelin Server ssl port (used when ssl environment/property is set to true)

ZEPPELIN_MEM

N/A

-Xmx1024m -XX:MaxPermSize=512m

JVM mem options

ZEPPELIN_INTP_MEM

N/A

ZEPPELIN_MEM

JVM mem options for interpreter process

ZEPPELIN_JAVA_OPTS

N/A

JVM options

ZEPPELIN_ALLOWED_ORIGINS

zeppelin.server.allowed.origins

*

Enables a way to specify a ',' separated list of allowed origins for REST and websockets. e.g. http://localhost:8080

ZEPPELIN_CREDENTIALS_PERSIST

zeppelin.credentials.persist

true

Persist credentials on a JSON file (credentials.json)

ZEPPELIN_CREDENTIALS_ENCRYPT_KEY

zeppelin.credentials.encryptKey

If provided, encrypt passwords on the credentials.json file (passwords will be stored as plain-text otherwise

N/A

zeppelin.anonymous.allowed

true

The anonymous user is allowed by default.

ZEPPELIN_SERVER_CONTEXT_PATH

zeppelin.server.context.path

/

Context path of the web application

ZEPPELIN_SSL

zeppelin.ssl

false

ZEPPELIN_SSL_CLIENT_AUTH

zeppelin.ssl.client.auth

false

ZEPPELIN_SSL_KEYSTORE_PATH

zeppelin.ssl.keystore.path

keystore

ZEPPELIN_SSL_KEYSTORE_TYPE

zeppelin.ssl.keystore.type

JKS

ZEPPELIN_SSL_KEYSTORE_PASSWORD

zeppelin.ssl.keystore.password

ZEPPELIN_SSL_KEY_MANAGER_PASSWORD

zeppelin.ssl.key.manager.password

ZEPPELIN_SSL_TRUSTSTORE_PATH

zeppelin.ssl.truststore.path

ZEPPELIN_SSL_TRUSTSTORE_TYPE

zeppelin.ssl.truststore.type

ZEPPELIN_SSL_TRUSTSTORE_PASSWORD

zeppelin.ssl.truststore.password

ZEPPELIN_NOTEBOOK_HOMESCREEN

zeppelin.notebook.homescreen

Display note IDs on the Apache Zeppelin homescreen e.g. 2A94M5J1Z

ZEPPELIN_NOTEBOOK_HOMESCREEN_HIDE

zeppelin.notebook.homescreen.hide

false

Hide the note ID set by ZEPPELIN_NOTEBOOK_HOMESCREEN on the Apache Zeppelin homescreen. For the further information, please read Customize your Zeppelin homepage.

ZEPPELIN_WAR_TEMPDIR

zeppelin.war.tempdir

webapps

Location of the jetty temporary directory

ZEPPELIN_NOTEBOOK_DIR

zeppelin.notebook.dir

notebook

The root directory where notebook directories are saved

ZEPPELIN_NOTEBOOK_S3_BUCKET

zeppelin.notebook.s3.bucket

zeppelin

S3 Bucket where notebook files will be saved

ZEPPELIN_NOTEBOOK_S3_USER

zeppelin.notebook.s3.user

user

User name of an S3 buckete.g. bucket/user/notebook/2A94M5J1Z/note.json

ZEPPELIN_NOTEBOOK_S3_ENDPOINT

zeppelin.notebook.s3.endpoint

s3.amazonaws.com

Endpoint for the bucket

ZEPPELIN_NOTEBOOK_S3_KMS_KEY_ID

zeppelin.notebook.s3.kmsKeyID

AWS KMS Key ID to use for encrypting data in S3 (optional)

ZEPPELIN_NOTEBOOK_S3_EMP

zeppelin.notebook.s3.encryptionMaterialsProvider

Class name of a custom S3 encryption materials provider implementation to use for encrypting data in S3 (optional)

ZEPPELIN_NOTEBOOK_S3_SSE

zeppelin.notebook.s3.sse

false

Save notebooks to S3 with server-side encryption enabled

ZEPPELIN_NOTEBOOK_S3_SIGNEROVERRIDE

zeppelin.notebook.s3.signerOverride

Optional override to control which signature algorithm should be used to sign AWS requests

ZEPPELIN_NOTEBOOK_AZURE_CONNECTION_STRING

zeppelin.notebook.azure.connectionString

The Azure storage account connection stringe.g. DefaultEndpointsProtocol=https;AccountName=<accountName>;AccountKey=<accountKey>

ZEPPELIN_NOTEBOOK_AZURE_SHARE

zeppelin.notebook.azure.share

zeppelin

Azure Share where the notebook files will be saved

ZEPPELIN_NOTEBOOK_AZURE_USER

zeppelin.notebook.azure.user

user

Optional user name of an Azure file sharee.g. share/user/notebook/2A94M5J1Z/note.json

ZEPPELIN_NOTEBOOK_STORAGE

zeppelin.notebook.storage

org.apache.zeppelin.notebook.repo.GitNotebookRepo

Comma separated list of notebook storage locations

ZEPPELIN_NOTEBOOK_ONE_WAY_SYNC

zeppelin.notebook.one.way.sync

false

If there are multiple notebook storage locations, should we treat the first one as the only source of truth?

ZEPPELIN_NOTEBOOK_PUBLIC

zeppelin.notebook.public

true

Make notebook public (set only owners) by default when created/imported. If set to falsewill add user to readers and writers as well, making it private and invisible to other users unless permissions are granted.

ZEPPELIN_INTERPRETERS

zeppelin.interpreters

org.apache.zeppelin.spark.SparkInterpreter,org.apache.zeppelin.spark.PySparkInterpreter,org.apache.zeppelin.spark.SparkSqlInterpreter,org.apache.zeppelin.spark.DepInterpreter,org.apache.zeppelin.markdown.Markdown,org.apache.zeppelin.shell.ShellInterpreter,...

Comma separated interpreter configurations [Class] NOTE: This property is deprecated since Zeppelin-0.6.0 and will not be supported from Zeppelin-0.7.0.

ZEPPELIN_INTERPRETER_DIR

zeppelin.interpreter.dir

interpreter

Interpreter directory

ZEPPELIN_INTERPRETER_DEP_MVNREPO

zeppelin.interpreter.dep.mvnRepo

http://repo1.maven.org/maven2/

Remote principal repository for interpreter's additional dependency loading

ZEPPELIN_INTERPRETER_OUTPUT_LIMIT

zeppelin.interpreter.output.limit

102400

Output message from interpreter exceeding the limit will be truncated

ZEPPELIN_INTERPRETER_CONNECT_TIMEOUT

zeppelin.interpreter.connect.timeout

30000

Output message from interpreter exceeding the limit will be truncated

ZEPPELIN_DEP_LOCALREPO

zeppelin.dep.localrepo

local-repo

Local repository for dependency loader.ex)visualiztion modules of npm.

ZEPPELIN_HELIUM_NODE_INSTALLER_URL

zeppelin.helium.node.installer.url

https://nodejs.org/dist/

Remote Node installer url for Helium dependency loader

ZEPPELIN_HELIUM_NPM_INSTALLER_URL

zeppelin.helium.npm.installer.url

http://registry.npmjs.org/

Remote Npm installer url for Helium dependency loader

ZEPPELIN_HELIUM_YARNPKG_INSTALLER_URL

zeppelin.helium.yarnpkg.installer.url

https://github.com/yarnpkg/yarn/releases/download/

Remote Yarn package installer url for Helium dependency loader

ZEPPELIN_WEBSOCKET_MAX_TEXT_MESSAGE_SIZE

zeppelin.websocket.max.text.message.size

1024000

Size(in characters) of the maximum text message that can be received by websocket.

ZEPPELIN_SERVER_DEFAULT_DIR_ALLOWED

zeppelin.server.default.dir.allowed

false

Enable directory listings on server.

ZEPPELIN_NOTEBOOK_GIT_REMOTE_URL

zeppelin.notebook.git.remote.url

GitHub's repository URL. It could be either the HTTP URL or the SSH URL. For example git@github.com:apache/zeppelin.git

ZEPPELIN_NOTEBOOK_GIT_REMOTE_USERNAME

zeppelin.notebook.git.remote.username

token

GitHub username. By default it is `token` to use GitHub's API

ZEPPELIN_NOTEBOOK_GIT_REMOTE_ACCESS_TOKEN

zeppelin.notebook.git.remote.access-token

token

GitHub access token to use GitHub's API. If username/password combination is used and not GitHub API, then this value is the password

ZEPPELIN_NOTEBOOK_GIT_REMOTE_ORIGIN

zeppelin.notebook.git.remote.origin

token

GitHub remote name. Default is `origin`

配置ssl需要更多的配置

4 Zeppelin UI


登录到localhost:8080后,就可以看到UI页面
可以参照官方文档学习相关操作:
https://zeppelin.apache.org/docs/latest/quickstart/explore_ui.html

 左侧 import note 可以导入笔记本

create note 是创建笔记本  也可以看到之前的笔记本  他们默认保存在$ZEPPELIN_HOME/notebook下

右上角可以设置shiro配置   配置信息  凭证  解释器  解释器可以编辑  比如spark的路径

创建好后进入编辑页面,可以编写代码 ,执行。

本文参与腾讯云自媒体分享计划,欢迎正在阅读的你也加入,一起分享。

我来说两句

0 条评论
登录 后参与评论

相关文章

  • 数据架构的未来——浅谈流处理架构

    ​ 业务数据系统存储事务性数据,比如SQL, NOSQL数据库,这种数据拥有准确的数据,比如用户业务,支付业务等体系都可以这样实现,这类需要经常更新,是整体业务...

    实时计算
  • Kafka学习(二)-------- 什么是Kafka

    通过Kafka的快速入门 https://www.cnblogs.com/tree1123/p/11150927.html

    实时计算
  • Kafka2.6.0发布——性能大幅提升

    如果要从2.1.x之前的版本升级,请参阅以下注释,以了解用于存储使用者偏移量的架构的更改。将inter.broker.protocol.version更改为最新...

    实时计算
  • Apache Zeppelin配置

    Zeppelin Properties 有两个位置可以配置Apache Zeppelin。 环境变量可以被定义conf/zeppelin-env.sh(co...

    片刻
  • 【渗透技巧】XSS三重URL编码绕过实例

    跨站脚本攻击(Cross Site Scripting),缩写为XSS,恶意攻击者往Web页面里插入恶意Script代码,当用户浏览该页之时,嵌入其中...

    Bypass
  • 软件工程师需要了解的网络知识:从铜线到HTTP(二)—— 以太网与交换机

    JohnLui:程序员,Swift Contributor,正在写《iOS 可视化编程与 Auto Layout》。 网络七层、四层模型 ? 四层模型是 TC...

    前端黑板报
  • 2019年终盘点【AI版】

    导语:小时候写作文,总爱用”时光如脱缰的野马”开头,渐渐长大才懂得,野马脱缰是多么迅疾多么可怕的事儿。年初烟花炮竹的烟尘刚刚散去,新年伊始的规划和绸缪犹在眼前,...

    用户6888863
  • Quick-Media 中文二维码支持

    Quick-Media 项目提供了一些列多媒体操作的开箱即用工具类,比如图片编辑合成,markdown/html/svg渲染,音频处理;当然还有本文重点说明的二...

    一灰灰blog
  • Quick-Media 中文二维码支持

    Quick-Media 项目提供了一些列多媒体操作的开箱即用工具类,比如图片编辑合成,markdown/html/svg渲染,音频处理;当然还有本文重点说明的二...

    一灰灰blog
  • 如何在 Android 中完成一个 APT 项目的开发?

    APT(Annotation Processing Tool)即注解处理器,是一种处理注解的工具。

    Android技术干货分享

扫码关注云+社区

领取腾讯云代金券