HBase 默认配置文件 hbase-default.xml 注释解析

HBase默认配置文件注释解析：

hbase-default.xml

hbase.tmp.dir

${java.io.tmpdir}/hbase-${user.name}

Temporary directory on the local filesystem.

Change this setting to point to a location more permanent

than '/tmp', the usual resolve for java.io.tmpdir, as the

'/tmp' directory is cleared on machine restart.

hbase.rootdir

${hbase.tmp.dir}/hbase

The directory shared by region servers and into

which HBase persists. The URL should be 'fully-qualified'

to include the filesystem scheme. For example, to specify the

HDFS directory '/hbase' where the HDFS instance's namenode is

running at namenode.example.org on port 9000, set this value to:

hdfs://namenode.example.org:9000/hbase. By default, we write

to whatever ${hbase.tmp.dir} is set too -- usually /tmp --

so change this configuration or else all data will be lost on

machine restart.

hbase.fs.tmp.dir

/user/${user.name}/hbase-staging

A staging directory in default file system (HDFS)

for keeping temporary data.

hbase.bulkload.staging.dir

${hbase.fs.tmp.dir}

A staging directory in default file system (HDFS)

for bulk loading.

hbase.cluster.distributed

false

The mode the cluster will be in. Possible values are

false for standalone mode and true for distributed mode. If

false, startup will run all HBase and ZooKeeper daemons together

in the one JVM.

hbase.zookeeper.quorum

localhost

Comma separated list of servers in the ZooKeeper ensemble

(This config. should have been named hbase.zookeeper.ensemble).

For example, "host1.mydomain.com,host2.mydomain.com,host3.mydomain.com".

By default this is set to localhost for local and pseudo-distributed

modes

of operation. For a fully-distributed setup, this should be set to a

full

list of ZooKeeper ensemble servers. If HBASE_MANAGES_ZK is set in

hbase-env.sh

this is the list of servers which hbase will start/stop ZooKeeper on as

part of cluster start/stop. Client-side, we will take this list of

ensemble members and put it together with the

hbase.zookeeper.clientPort

config. and pass it into zookeeper constructor as the connectString

parameter.

hbase.local.dir

${hbase.tmp.dir}/local/

Directory on the local filesystem to be used

as a local storage.

hbase.master.port

16000

The port the HBase Master should bind to.

hbase.master.info.port

16010

The port for the HBase Master web UI.

Set to -1 if you do not want a UI instance run.

hbase.master.info.bindAddress

0.0.0.0

The bind address for the HBase Master web UI

hbase.master.logcleaner.plugins

org.apache.hadoop.hbase.master.cleaner.TimeToLiveLogCleaner

A comma-separated list of BaseLogCleanerDelegate invoked

the LogsCleaner service. These WAL cleaners are called in order,

so put the cleaner that prunes the most files in front. To

implement your own BaseLogCleanerDelegate, just put it in HBase's classpath

and add the fully qualified class name here. Always add the above

default log cleaners in the list.

hbase.master.logcleaner.ttl

600000

Maximum time a WAL can stay in the .oldlogdir directory,

after which it will be cleaned by a Master thread.

hbase.master.hfilecleaner.plugins

org.apache.hadoop.hbase.master.cleaner.TimeToLiveHFileCleaner

A comma-separated list of BaseHFileCleanerDelegate

invoked by

the HFileCleaner service. These HFiles cleaners are called in order,

so put the cleaner that prunes the most files in front. To

implement your own BaseHFileCleanerDelegate, just put it in HBase's classpath

and add the fully qualified class name here. Always add the above

default log cleaners in the list as they will be overwritten in

hbase-site.xml.

hbase.master.catalog.timeout

600000

Timeout value for the Catalog Janitor from the master to

META.

hbase.master.infoserver.redirect

true

Whether or not the Master listens to the Master web

UI port (hbase.master.info.port) and redirects requests to the web

UI server shared by the Master and RegionServer.

hbase.regionserver.port

16020

The port the HBase RegionServer binds to.

hbase.regionserver.info.port

16030

The port for the HBase RegionServer web UI

Set to -1 if you do not want the RegionServer UI to run.

hbase.regionserver.info.bindAddress

0.0.0.0

The address for the HBase RegionServer web UI

hbase.regionserver.info.port.auto

false

Whether or not the Master or RegionServer

UI should search for a port to bind to. Enables automatic port

search if hbase.regionserver.info.port is already in use.

Useful for testing, turned off by default.

hbase.regionserver.handler.count

Count of RPC Listener instances spun up on RegionServers.

Same property is used by the Master for count of master handlers.

hbase.ipc.server.callqueue.handler.factor

0.1

Factor to determine the number of call queues.

A value of 0 means a single queue shared between all the handlers.

A value of 1 means that each handler has its own queue.

hbase.ipc.server.callqueue.read.ratio

Split the call queues into read and write queues.

The specified interval (which should be between 0.0 and 1.0)

will be multiplied by the number of call queues.

A value of 0 indicate to not split the call queues, meaning that both

read and write

requests will be pushed to the same set of queues.

A value lower than 0.5 means that there will be less read queues than

write queues.

A value of 0.5 means there will be the same number of read and write

queues.

A value greater than 0.5 means that there will be more read queues

than write queues.

A value of 1.0 means that all the queues except one are used to

dispatch read requests.

Example: Given the total number of call queues being 10

a read.ratio of 0 means that: the 10 queues will contain both

read/write requests.

a read.ratio of 0.3 means that: 3 queues will contain only read

requests

and 7 queues will contain only write requests.

a read.ratio of 0.5 means that: 5 queues will contain only read

requests

and 5 queues will contain only write requests.

a read.ratio of 0.8 means that: 8 queues will contain only read

requests

and 2 queues will contain only write requests.

a read.ratio of 1 means that: 9 queues will contain only read requests

and 1 queues will contain only write requests.

hbase.ipc.server.callqueue.scan.ratio

Given the number of read call queues, calculated from the

total number

of call queues multiplied by the callqueue.read.ratio, the scan.ratio

property

will split the read call queues into small-read and long-read queues.

A value lower than 0.5 means that there will be less long-read queues

than short-read queues.

A value of 0.5 means that there will be the same number of short-read

and long-read queues.

A value greater than 0.5 means that there will be more long-read

queues than short-read queues

A value of 0 or 1 indicate to use the same set of queues for gets and

scans.

Example: Given the total number of read call queues being 8

a scan.ratio of 0 or 1 means that: 8 queues will contain both long and

short read requests.

a scan.ratio of 0.3 means that: 2 queues will contain only long-read

requests

and 6 queues will contain only short-read requests.

a scan.ratio of 0.5 means that: 4 queues will contain only long-read

requests

and 4 queues will contain only short-read requests.

a scan.ratio of 0.8 means that: 6 queues will contain only long-read

requests

and 2 queues will contain only short-read requests.

hbase.regionserver.msginterval

3000

Interval between messages from the RegionServer to Master

in milliseconds.

hbase.regionserver.logroll.period

3600000

Period at which we will roll the commit log regardless

of how many edits it has.

hbase.regionserver.logroll.errors.tolerated

The number of consecutive WAL close errors we will allow

before triggering a server abort. A setting of 0 will cause the

region server to abort if closing the current WAL writer fails during

log rolling. Even a small value (2 or 3) will allow a region server

to ride over transient HDFS errors.

hbase.regionserver.hlog.reader.impl

org.apache.hadoop.hbase.regionserver.wal.ProtobufLogReader

The WAL file reader implementation.

hbase.regionserver.hlog.writer.impl

org.apache.hadoop.hbase.regionserver.wal.ProtobufLogWriter

The WAL file writer implementation.

hbase.regionserver.global.memstore.size

Maximum size of all memstores in a region server before

new

updates are blocked and flushes are forced. Defaults to 40% of heap (0.4).

Updates are blocked and flushes are forced until size of all

memstores

in a region server hits

hbase.regionserver.global.memstore.size.lower.limit.

The default value in this configuration has been intentionally left

emtpy in order to

honor the old hbase.regionserver.global.memstore.upperLimit property if

present.

hbase.regionserver.global.memstore.size.lower.limit

Maximum size of all memstores in a region server before

flushes are forced.

Defaults to 95% of hbase.regionserver.global.memstore.size (0.95).

A 100% value for this value causes the minimum possible flushing to

occur when updates are

blocked due to memstore limiting.

The default value in this configuration has been intentionally left

emtpy in order to

honor the old hbase.regionserver.global.memstore.lowerLimit property if

present.

hbase.regionserver.optionalcacheflushinterval

3600000

Maximum amount of time an edit lives in memory before being automatically

flushed.

Default 1 hour. Set it to 0 to disable automatic flushing.

hbase.regionserver.catalog.timeout

600000

Timeout value for the Catalog Janitor from the

regionserver to META.

hbase.regionserver.dns.interface

default

The name of the Network Interface from which a region

server

should report its IP address.

hbase.regionserver.dns.nameserver

default

The host name or IP address of the name server (DNS)

which a region server should use to determine the host name used by

the

master for communication and display purposes.

hbase.regionserver.region.split.policy

org.apache.hadoop.hbase.regionserver.IncreasingToUpperBoundRegionSplitPolicy

A split policy determines when a region should be split. The various

other split policies that

are available currently are ConstantSizeRegionSplitPolicy,

DisabledRegionSplitPolicy,

DelimitedKeyPrefixRegionSplitPolicy, KeyPrefixRegionSplitPolicy etc.

hbase.regionserver.regionSplitLimit

1000

Limit for the number of regions after which no more region splitting

should take place.

This is not hard limit for the number of regions but acts as a guideline

for the regionserver

to stop splitting after a certain limit. Default is set to 1000.

zookeeper.session.timeout

90000

ZooKeeper session timeout in milliseconds. It is used in

two different ways.

First, this value is used in the ZK client that HBase uses to connect to

the ensemble.

It is also used by HBase when it starts a ZK server and it is passed as

the 'maxSessionTimeout'. See

http://hadoop.apache.org/zookeeper/docs/current/zookeeperProgrammers.html#ch_zkSessions.

For example, if a HBase region server connects to a ZK ensemble

that's also managed by HBase, then the

session timeout will be the one specified by this configuration. But, a

region server that connects

to an ensemble managed with a different configuration will be subjected

that ensemble's maxSessionTimeout. So,

even though HBase might propose using 90 seconds, the ensemble can have a

max timeout lower than this and

it will take precedence. The current default that ZK ships with is 40

seconds, which is lower than HBase's.

zookeeper.znode.parent

/hbase

Root ZNode for HBase in ZooKeeper. All of HBase's

ZooKeeper

files that are configured with a relative path will go under this node.

By default, all of HBase's ZooKeeper file path are configured with a

relative path, so they will all go under this directory unless

changed.

zookeeper.znode.rootserver

root-region-server

Path to ZNode holding root region location. This is

written by

the master and read by clients and region servers. If a relative path is

given, the parent folder will be ${zookeeper.znode.parent}. By

default,

this means the root location is stored at /hbase/root-region-server.

zookeeper.znode.acl.parent

acl

Root ZNode for access control lists.

hbase.zookeeper.dns.interface

default

The name of the Network Interface from which a ZooKeeper

server

should report its IP address.

hbase.zookeeper.dns.nameserver

default

The host name or IP address of the name server (DNS)

which a ZooKeeper server should use to determine the host name used

by the

master for communication and display purposes.

hbase.zookeeper.peerport

2888

Port used by ZooKeeper peers to talk to each other.

See

http://hadoop.apache.org/zookeeper/docs/r3.1.1/zookeeperStarted.html#sc_RunningReplicatedZooKeeper

for more information.

hbase.zookeeper.leaderport

3888

Port used by ZooKeeper for leader election.

See

http://hadoop.apache.org/zookeeper/docs/r3.1.1/zookeeperStarted.html#sc_RunningReplicatedZooKeeper

for more information.

hbase.zookeeper.useMulti

true

Instructs HBase to make use of ZooKeeper's multi-update

functionality.

This allows certain ZooKeeper operations to complete more quickly and

prevents some issues

with rare Replication failure scenarios (see the release note of

HBASE-2611 for an example).

IMPORTANT: only set this to true if all ZooKeeper servers in the cluster are on

version 3.4+

and will not be downgraded. ZooKeeper versions before 3.4 do not support

multi-update and

will not fail gracefully if multi-update is invoked (see ZOOKEEPER-1495).

hbase.config.read.zookeeper.config

false

Set to true to allow HBaseConfiguration to read the

zoo.cfg file for ZooKeeper properties. Switching this to true

is not recommended, since the functionality of reading ZK

properties from a zoo.cfg file has been deprecated.

hbase.zookeeper.property.initLimit