HBase权威指南 - [美] Lars George

HBase权威指南

[美] Lars George

出版时间

2013-10-01

ISBN

9787115318893

评分

★★★★★
书籍介绍

《hbase权威指南》探讨了如何通过使用与hbase高度集成的hadoop将hbase的可伸缩性变得简单;把大型数据集分布到相对廉价的商业服务器集群中;使用本地java客户端,或者通过提供了rest、avro和thrift应用编程接口的网关服务器来访问hbase;了解hbase架构的细节,包括存储格式、预写日志、后台进程等;在hbase中集成mapreduce框架;了解如何调节集群、设计模式、拷贝表、导入批量数据和删除节点等。

《hbase权威指南》适合使用hbase进行数据库开发的高级数据库研发人员阅读

lars george,hbase的committer,cloudera公司的解决方案架构师,主要为hadoop和hbase提供技术支持、咨询和培训工作。他多次在各种hadoop用户组会议和大型会议中发表演讲,如布鲁塞尔的fosdem会议。

AI导读
核心看点
  • 深入解析HBase架构细节与存储格式
  • 详解HBase与Hadoop的高度集成方案
  • 涵盖集群调节、数据导入及运维实战
适合谁读
  • 从事HBase数据库开发的高级研发人员
  • 需要处理海量数据的大数据工程师
  • 希望深入理解NoSQL原理的技术人员
读前提醒
  • 建议结合官方文档与BigTable论文阅读
  • 原理部分讲解较少,需配合实践理解
  • 适合作为案头参考书,而非纯入门读物
读者共识
  • 内容详实,图示清晰,避免源码堆积
  • Coprocessor部分说明详细,参考价值高
  • 经典权威指南,适合系统学习HBase

本导读基于书籍简介、目录、原文摘录、短评和书评生成,不等同于全文精读。

精彩摘录
  • "You should think about rows and columns not being arranged like the classic spreadsheet model, but rather use a tag metaphor, that is, information is available under a specific tag."
  • "Every column value, or cell, either is timestamped implicitly by the system or can be set explicitly by the user. This can be used, for example, to save multiple versions of a value as it changes over time. Different versions of a cell are stored in decreasing time-stamp order, allowing you to read "
  • "There is also the option to run client-supplied code in the address space of the server.The server-side framework to support this is called coprocessors."
  • "The data is stored in store files, called HFiles, which are persistent and ordered immut-able maps from keys to values. Internally, the files are sequences of blocks with a block index stored at the end."
  • "There are three major components to HBase: the client library, one master server, and many region servers."
  • "A more critical feature is the so-called delayed allocation, and it is recommended that you turn it off for Hadoop and HBase use. Delayed allocation keeps the data in memory and reserves the required number of blocks until the data is finally flushed to disk."
  • "HBase implicitly assumes that data is stored in a reliable manner by the filesystem. It has no added means to replicate data or even maintain copies of its own storage files. This functionality must be provided by the lower-level system."
  • "You could implement most of the filter functionality in your client code as well, but you would have to transfer much more data—something you need to avoid at scale."
作者简介
lars george,hbase的committer,cloudera公司的解决方案架构师,主要为hadoop和hbase提供技术支持、咨询和培训工作。他多次在各种hadoop用户组会议和大型会议中发表演讲,如布鲁塞尔的fosdem会议。
目录
《hbase权威指南》
第1章 简介 1
1.1 海量数据的黎明 1
1.2 关系数据库系统的问题 5
1.3 非关系型数据库系统not-only-sql(简称nosql) 7

显示全部
用户评论
2016年读。
了解hbase, 原理部分讲得有点少
大部分内容在hbase ref guide和big table论文都有说明了,对coprocessor说明得比较详细,当做参考书还不错。要在使用中踩踩坑才能更有经验啊
一贯的XXX Definitive Guide风格
http://vonzhou.com/hbase-definitive.html
范欣欣那本比这本好
入门很不错,讲解算是详细
下载
收藏