前往小程序,Get更优阅读体验!
立即前往
首页
学习
活动
专区
工具
TVP
发布
社区首页 >专栏 >在CentOS6.9中 搭建 Flume

在CentOS6.9中 搭建 Flume

作者头像
北漂的我
发布2019-05-28 12:40:39
4340
发布2019-05-28 12:40:39
举报
文章被收录于专栏:北漂的我

配置 flume 环境变量

代码语言:javascript
复制
export FLUME_HOME=/opt/apache-flume-1.7.0-bin
export PATH=$PATH:$JAVA_HOME/bin:$HADOOP_HOME/bin:$FLUME_HOME/bin:$HOME/bin

然后 记得 source ~/.bash_profile

根据需求,配置不同的 source/channel/sink,添加配置文件到 conf/中

  • flume_exec_hdfs.conf
代码语言:javascript
复制
logAgent.sources = logSource
logAgent.channels = fileChannel
logAgent.sinks = hdfsSink

logAgent.sources.logSource.type = exec
logAgent.sources.logSource.command = tail -F /aura/data/flume-search/logs
logAgent.sources.logSource.channels = fileChannel

logAgent.sinks.hdfsSink.type = hdfs
logAgent.sinks.hdfsSink.hdfs.path = hdfs://bigdata:9000/flume/record/%Y-%m-%d/%H%M
logAgent.sinks.hdfsSink.hdfs.rollCount= 10000
logAgent.sinks.hdfsSink.hdfs.rollSize= 0
logAgent.sinks.hdfsSink.hdfs.batchSize= 1000
logAgent.sinks.hdfsSink.hdfs.filePrefix= transaction_log
logAgent.sinks.hdfsSink.hdfs.rollInterval= 600
logAgent.sinks.hdfsSink.hdfs.roundUnit = minute
logAgent.sinks.hdfsSink.hdfs.fileType = DataStream
logAgent.sinks.hdfsSink.hdfs.useLocalTimeStamp = true
logAgent.sinks.hdfsSink.channel = fileChannel

logAgent.channels.fileChannel.type = memory
logAgent.channels.logSource.capacity=1000
logAgent.channels.logSource.transactionCapacity=100
  • flume_avro_hdfs.conf
代码语言:javascript
复制
logAgent.sources = logSource
logAgent.channels = fileChannel
logAgent.sinks = hdfsSink

logAgent.sources.logSource.type = avro
logAgent.sources.logSource.bind = 127.0.0.1
logAgent.sources.logSource.port = 44444
logAgent.sources.logSource.channels = fileChannel

logAgent.sinks.hdfsSink.type = hdfs
logAgent.sinks.hdfsSink.hdfs.path = hdfs://bigdata:9000/flume/record/%Y-%m-%d/%H%M
logAgent.sinks.hdfsSink.hdfs.rollCount= 10000
logAgent.sinks.hdfsSink.hdfs.rollSize= 0
logAgent.sinks.hdfsSink.hdfs.batchSize= 1000
logAgent.sinks.hdfsSink.hdfs.filePrefix= transaction_log
logAgent.sinks.hdfsSink.hdfs.rollInterval= 600
logAgent.sinks.hdfsSink.hdfs.roundUnit = minute
logAgent.sinks.hdfsSink.hdfs.fileType = DataStream
logAgent.sinks.hdfsSink.hdfs.useLocalTimeStamp = true
logAgent.sinks.hdfsSink.channel = fileChannel

logAgent.channels.fileChannel.type = memory
logAgent.channels.logSource.capacity=1000
logAgent.channels.logSource.transactionCapacity=100
  • flume_dir_hdfs.conf
代码语言:javascript
复制
logAgent.sources = logSource
logAgent.channels = fileChannel
logAgent.sinks = hdfsSink

logAgent.sources.logSource.type = spooldir
logAgent.sources.logSource.spoolDir =/aura/data/flume-search
logAgent.sources.logSource.channels = fileChannel

logAgent.sinks.hdfsSink.type = hdfs
logAgent.sinks.hdfsSink.hdfs.path = hdfs://bigdata:9000/flume/record/%Y-%m-%d/%H%M
logAgent.sinks.hdfsSink.hdfs.rollCount= 10000
logAgent.sinks.hdfsSink.hdfs.rollSize= 0
logAgent.sinks.hdfsSink.hdfs.batchSize= 1000
logAgent.sinks.hdfsSink.hdfs.filePrefix= transaction_log
logAgent.sinks.hdfsSink.hdfs.rollInterval= 600
logAgent.sinks.hdfsSink.hdfs.roundUnit = minute
logAgent.sinks.hdfsSink.hdfs.fileType = DataStream
logAgent.sinks.hdfsSink.hdfs.useLocalTimeStamp = true
logAgent.sinks.hdfsSink.channel = fileChannel

logAgent.channels.fileChannel.type = memory
logAgent.channels.logSource.capacity=1000
logAgent.channels.logSource.transactionCapacity=100
代码语言:javascript
复制
bin/flume-ng agent -n logAgent -c conf -f conf/flume_exec_hdfs.conf -Dflume.root.logger=INFO,console
本文参与 腾讯云自媒体同步曝光计划,分享自作者个人站点/博客。
如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 作者个人站点/博客 前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体同步曝光计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
相关产品与服务
大数据
全栈大数据产品,面向海量数据场景,帮助您 “智理无数,心中有数”!
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档