前往小程序,Get更优阅读体验!
立即前往
首页
学习
活动
专区
工具
TVP
发布
社区首页 >专栏 >kafka Disks and Filesystem(磁盘和文件系统)

kafka Disks and Filesystem(磁盘和文件系统)

作者头像
sanmutongzi
发布2020-03-04 14:40:39
3550
发布2020-03-04 14:40:39
举报
文章被收录于专栏:stream processstream process

转载请注明来源地址:http://www.cnblogs.com/dongxiao-yang/p/5206631.html

We recommend using multiple drives to get good throughput and not sharing the same drives used for Kafka data with application logs or other OS filesystem activity to ensure good latency. You can either RAID these drives together into a single volume or format and mount each drive as its own directory. Since Kafka has replication the redundancy provided by RAID can also be provided at the application level. This choice has several tradeoffs.

我们推荐服务器使用多块硬盘:(1)实现高吞吐 (2)隔离kafka数据文件与应用的日志文件以及其他系统相关的磁盘消耗以保证低延迟。多块硬盘可以raid成一个卷或者每块硬盘单独显示一个盘符挂载。由于kakfa在应用级别已经可以提供raid所提供的数据冗余备份的功能,可以通过几个方面权衡选择的策略。

If you configure multiple data directories partitions will be assigned round-robin to data directories. Each partition will be entirely in one of the data directories. If data is not well balanced among partitions this can lead to load imbalance between disks.

如果配置为多块硬盘,分区将会轮询分布到硬盘文件下,每个分区将会完全落到一块单独磁盘上。如果数据里的分区并不是均匀分布的话会可能导致磁盘之间的负载不均衡。

RAID can potentially do better at balancing load between disks (although it doesn't always seem to) because it balances load at a lower level. The primary downside of RAID is that it is usually a big performance hit for write throughput and reduces the available disk space.

raid先天性的在硬盘间数据均衡上表现的更好(虽然并不总是如此),因为raid是在更底层的层面实现的数据均衡。但其主要缺点是raid通常在写吞吐上会有很高的消耗,并且会减少可用的磁盘空间。

Another potential benefit of RAID is the ability to tolerate disk failures. However our experience has been that rebuilding the RAID array is so I/O intensive that it effectively disables the server, so this does not provide much real availability improvement.

raid 的另一个潜在的好处是能够容忍磁盘故障。然而,我们的经验是,重建raid队列的动作是一个过于io密集的工作,显著地使服务器工作失能,因此这不提供很多实际的可用性改进。

本文参与 腾讯云自媒体分享计划,分享自作者个人站点/博客。
原始发表:2016-02-22 ,如有侵权请联系 cloudcommunity@tencent.com 删除

本文分享自 作者个人站点/博客 前往查看

如有侵权,请联系 cloudcommunity@tencent.com 删除。

本文参与 腾讯云自媒体分享计划  ,欢迎热爱写作的你一起参与!

评论
登录后参与评论
0 条评论
热度
最新
推荐阅读
相关产品与服务
对象存储
对象存储(Cloud Object Storage,COS)是由腾讯云推出的无目录层次结构、无数据格式限制,可容纳海量数据且支持 HTTP/HTTPS 协议访问的分布式存储服务。腾讯云 COS 的存储桶空间无容量上限,无需分区管理,适用于 CDN 数据分发、数据万象处理或大数据计算与分析的数据湖等多种场景。
领券
问题归档专栏文章快讯文章归档关键词归档开发者手册归档开发者手册 Section 归档