Administering MapR-DB

Administration of the MapR-DB is done primarily via the commmand line (maprcli) or with the MapR Control System (MCS). Regardless of whether the MapR-DB table is used for binary files or JSON documents, the same types of commands are used with slightly different parameter options. MapR-DB administration is associated with tables, columns and column families, and table regions.

Why use MapR-DB?

From an administrator's point-of-view, MapR-DB provides the following capabilities:
  • Minimal administration: Single namespace for files, tables and streams, flexible schema that allows built-in data management and protection, automatic splits and merges as data grows and shrinks, and easy bulk data loading.
  • Self-healing from HW and SW failures: Replicated state and data for instant recovery and automated re-replication of data.
  • Global low-latency replication: Multi-master (that is, active to active) replication which is important for disaster recovery. Includes reduced risk of data loss, application failover, and faster data access.
  • High performance and low latency: Integrated system with fewer software layers, single hop to data, and no compactions with low I/O amplification.
  • Fine-grained security: Access permissions can be granted to tables (as well as files and streams) at a granular level using MapR Access Control Expressions (ACEs), which are designed for flexibility and ease-of-use.

How Do I Get Started?

The following graphics shows the basics steps (with hotspot links) for getting started.

Install MapR Create a table for bulkload involves specifying the table type (JSON or binary) and setting the bulkload flag. Bulkloading can be done either as a full or incremental bulkload. Different utilities are used for the bulk load depending on what you are trying to accomplish. Both ful and incremental bulkloads can be performed for MapR-DB JSON tables. This topic describes the three command-line utilties available for loading documents into JSON tables. Administration of tables describes how to create, read, update, and delete tables as well as other tasks such as managing permissions and auditing. This section cover the administration of column families including how to create column families, alter them, delete them, set permissions on them, and set and display parameter values. This topic decribes administrative tasks associated with table regions including how to set autosplitting.

Useful Administrator Resources

Links to Resources Descriptions

maprcli and REST API Syntax

Command line reference for MapR operations. For MapR-DB, the commands particularly applicable are assocociate with the maprcli table command. These commands include not only table CRUD operations but also table column family, table region, and table replication operations.
Utilities for MapR-DB JSON Tables Utilties for MapR-DB JSON tables. These utilities are used form managing JSON tables icluding importing and exporting data to and from JSON tables. Particularly useful are:
  • mapr importJSON utility with imports JSON documents into a MapR-DB JSON table.
  • mapr dbshellutility which performs CRUD operations on JSON documents and tables.
Utilities for MapR-DB Binary Tables Utilities for MapR-DB binary tables. These utitlies are used for managing binary tables. Particulary useful is CopyTable which is used to copy data from one MapR-DB binary table to another.
Note: To import HFile or Result files in a MapR-DB binary table, the hbase command can be used. See Loading Data into Binary Tables.
Configuring Security Information on security tasks for configuring MapR security, managing secure clusters, and administering auditing.
Hadoop and Big Data Security MapR information on Security and Big Data Governance that identified key unique advantages including authentication, authorization, auditing, and encryption.
Provisioning Secure Access Controls in MapR-DB MapR blog discussing MapR's boolean Access Control Expressions (ACEs) which provide granular-level permissions including topics and examples of best practices.
HBase and MapR-DB: Designed for Distribution, Scale, and Speed MapR blog describing the history and need for NoSQL databases including descriptions of the Apache HBase data model.