About Release 7.6.1
This site contains documentation for HPE Ezmeral Data Fabric release 7.6.1, including installation, configuration, administration, and reference content, as well as content for the associated ecosystem components and drivers.
7.6.1 Installation
This section contains information about installing and upgrading HPE Ezmeral Data Fabric software. It also contains information about how to migrate data and applications from an Apache Hadoop cluster to a HPE Ezmeral Data Fabric cluster.
- Planning the Cluster
  Describes information and factors used in planning your cluster.
- Installing Core and Ecosystem Components
  Describes how to install HPE Ezmeral Data Fabric software and ecosystem components with or without the Installer.
- Installing the HPE Ezmeral Data Fabric File Store
  Describes how to install File Store software with or without the Installer.
- Installing HPE Ezmeral Data Fabric Object Store
  Describes installation of the HPE Ezmeral Data Fabric Object Store software with or without the Installer.
- Installing Kubernetes Interfaces for Data Fabric
  This section describes how to plan for and install the Container Storage Interface (CSI) Storage Plugin and the Kubernetes Interfaces for Data Fabric FlexVolume Driver.
- Upgrading Core or EEP Components
  Depending on your current configuration, you may choose to upgrade the release version (core), ecosystem components, clients, or monitoring components.
- Setting Up Clients and Services
  Describes how to set up and use interfaces to an HPE Ezmeral Data Fabric cluster from a client computer.
- Setting Up the Control System
  Describes how to configure and access the Control System.
- Migrating to the HPE Ezmeral Data Fabric
  Provides instructions for migrating business-critical data and applications from an Apache Hadoop cluster to an HPE Ezmeral Data Fabric cluster.
  - Planning and Initial Deployment
    There are a number of considerations to take into account before migrating from Apache Hadoop to data-fabric Hadoop.
  - Component Migration
    This section describes how to migrate customized components to Hadoop for the HPE Ezmeral Data Fabric.
  - Application Migration
    Before you migrate your applications to the MapR Hadoop distribution, consider testing your applications using a small subset of data.
  - Data Migration
    After you migrate your applications to the MapR cluster, you can copy your data from the Apache Hadoop HDFS to the MapR cluster.
  - Node Migration
    You can add decommissioned HDFS data nodes to your MapR cluster.
- Applying a Patch
  You can apply a patch by using the Installer, by using the command line (a manual process), or by using an Installer Stanza.
7.6.1 Data Fabric
HPE Ezmeral Data Fabric is the industry-leading data platform for AI and analytics that solves enterprise business needs.
7.6.1 Administration
This section describes how to manage the nodes and services that make up a cluster.
7.6.1 Development
This section contains information related to application development for Ezmeral ecosystem components and HPE Ezmeral Data Fabric products, including the file system, Database (Key-Value and JSON), and Event Streams.
Other Docs
This section contains release-independent information, including: Installer documentation, Ecosystem release notes, interoperability matrices, security vulnerabilities, and links to other data-fabric version documentation.
Glossary
Definitions for commonly used terms in MapR Converged Data Platform environments.

Planning and Initial Deployment

There are a number of considerations to take into account before migrating from Apache Hadoop to data-fabric Hadoop.

The first phase of migration is planning. In this phase you will identify the requirements and goals of the migration, identify potential issues in the migration, and define a strategy.

The requirements and goals of the migration depend on a number of factors:

Data migration: can you move your datasets individually, or must the data be moved all at once?
Downtime: can you tolerate downtime, or is it important to complete the migration with no interruption in service?
Customization: what custom patches or applications are running on the cluster?
Storage: is there enough space to store the data during the migration?

The data-fabric Hadoop distribution is 100% plug-and-play compatible with Apache Hadoop, so you do not need to make changes to your applications to run them on a data-fabric cluster. Data Fabric Hadoop automatically configures compression and memory settings, task heap sizes, and local volumes for shuffle data.

Initial Deployment

The initial data-fabric deployment phase consists of installing, configuring, and testing the data-fabric cluster and any ecosystem components (such as Hive or Pig) on an initial set of nodes. Once you have the data-fabric cluster deployed, you will be able to begin migrating data and applications.

To deploy the data-fabric cluster on the selected nodes, see the Installing Core and Ecosystem Components