Performing a Rolling Upgrade on a CDH 4 Cluster
Minimum Required Role: Cluster Administrator (also provided by Full Administrator)
The rolling upgrade feature takes advantage of parcels and the HDFS high availability to enable you to upgrade your cluster software and restart the upgraded services without taking the entire cluster down. You must have HDFS high availability enabled to perform a rolling upgrade.
This page described how to perform a rolling upgrade between minor versions of CDH 4. For rolling upgrade between CDH 5 versions, see Performing a Rolling Upgrade on a CDH 5 Cluster.
You cannot perform a rolling upgrade from CDH 4 to CDH 5 because of incompatibilities between the two major versions. Instead, follow the instructions for a full upgrade at Upgrading from CDH 4 to CDH 5 Parcels.
- Download, distribute, and activate the parcel for the new software you want to install.
- Perform a rolling restart to restart the services in your cluster. You can do a rolling restart of individual services, or if you have high availability enabled, you can perform a restart of the entire cluster. Cloudera Manager will manually fail over your NameNode at the appropriate point in the process so that your cluster will not be without a functional NameNode.
The steps to perform a rolling upgrade of a cluster are as follows:
- Ensure High Availability is Enabled
- Download, Distribute, and Activate Parcels
- Upgrade the Hive Metastore Database
- Upgrade the Oozie ShareLib
- Upgrade Sqoop
- Upgrade the Sentry Database
- Upgrade Spark
- Restart the Cluster
- Remove CDH 4 Packages
- Restore Hue Database
Ensure High Availability is Enabled
To enable high availability, see HDFS High Availability for instructions. You do not need to enable automatic failover for rolling restart to work, though you can enable it if you want. Automatic failover does not affect the rolling restart operation. If you have JobTracker high availability configured, Cloudera Manager will fail over the JobTracker during the rolling restart, but this is not a requirement for performing a rolling upgrade.
Download, Distribute, and Activate Parcels
- In the Cloudera Manager Admin Console, click the Parcels indicator in the top navigation bar ( or ) to go to the Parcels page.
- In the parcels page, click Download for the version(s) you want to download. If the parcel you want is not shown here — for example, you want to upgrade to version of CDH that is not the most current version — you can make additional parcel repos available through the parcel settings page. If your Cloudera Manager server does not have Internet access, you can obtain the required parcel file(s) and put them into the local repository. See Creating and Using a Remote Parcel Repository for Cloudera Manager for more details.
- When the download has completed, click Distribute for the version you downloaded.
- When the parcel has been distributed and unpacked, the button will change to say Activate.
- Click Activate. You are asked if you want to restart the cluster. Do not restart the cluster at this time.
- Click Close.
Upgrade the Hive Metastore Database
- Go to the Hive service.
- Select Stop to confirm. and click
- Select Upgrade Hive Metastore Database Schema to confirm. and click
- If you have multiple instances of Hive, perform the upgrade on each metastore database.
Upgrade the Oozie ShareLib
- Go to the Oozie service.
- Select Start to confirm. and click
- Select Install Oozie ShareLib to confirm. and click
Upgrade Sqoop
- Go to the Sqoop service.
- Select Stop to confirm. and click
- Select Upgrade Sqoop to confirm. and click
Upgrade the Sentry Database
- CDH 5.1 to 5.2 or higher
- CDH 5.2 to 5.3 or higher
- CDH 5.4 to 5.5 or higher
- Go to the Sentry service.
- Select Stop to confirm. and click
- Select Upgrade Sentry Database Tables to confirm. and click
Upgrade Spark
- Go to the Spark service.
- Select Stop to confirm. and click
- Select Install Spark JAR to confirm. and click
- Select Create Spark History Log Dir to confirm. and click
Restart the Cluster
- On the tab, click to the right of the cluster name and click Rolling Restart to proceed with a rolling restart. Rolling restart is available only if high availability is enabled. Click Restart to perform a normal restart. Services that do not support rolling restart will undergo a normal restart, and will not be available during the restart process.
- For a rolling restart, a pop-up allows you to chose which services you want to restart, and presents caveats to be aware of for those services that can undergo a rolling restart.
Note: If you have just upgraded your Cloudera Manager deployment to 4.6, and are now doing a rolling upgrade of your cluster, you must ensure that MapReduce is restarted before the rest of your services, or the restart may fail. This is necessary to ensure the MapReduce configuration changes are propagated.
Further, if you are upgrading from CDH 4.1 with Impala to CDH 4.2 or 4.3, you must restart MapReduce before Impala restarts (by default Impala is restarted before MapReduce).
The workaround is to perform a restart of MapReduce alone as the first step, then perform a cluster restart of the remaining services.
- Click Confirm to start the rolling restart.
Remove CDH 4 Packages
If your previous installation of CDH was done using packages, remove those packages on all hosts on which you installed the parcels and refresh the symlinks so that clients will run the new software versions. Skip this step if your previous installation was using parcels.
- If Hue is configured to use SQLite as its database:
- Stop the Hue service.
- Back up the desktop.db to a temporary location before deleting the old Hue Common package. The location of the database can be found in the Hue service Configuration tab under Service > Database > Hue's Database Directory.
Important: Removing the Hue Common package will remove your Hue database; if you do not back it up you may lose all your Hue user account information. - Uninstall the CDH packages on each host:
- Not including Impala and Search
Operating System Command RHEL $ sudo yum remove hadoop hue-common bigtop-jsvc bigtop-tomcat
SLES $ sudo zypper remove hadoop hue-common bigtop-jsvc bigtop-tomcat
Ubuntu or Debian $ sudo apt-get purge hadoop hue-common bigtop-jsvc bigtop-tomcat
- Including Impala and Search
Operating System Command RHEL $ sudo yum remove hadoop hue-common impala-shell solr-server 'bigtop-*'
SLES $ sudo zypper remove hadoop hue-common impala-shell solr-server 'bigtop-*'
Ubuntu or Debian $ sudo apt-get purge hadoop hue-common impala-shell solr-server 'bigtop-*'
- Not including Impala and Search
- Restart all the Cloudera Manager Agents to force an update of the symlinks to point to the newly installed components on each
host:
$ sudo service cloudera-scm-agent restart
Restore Hue Database
If you removed CDH 4 packages, restore the Hue database back up.- Go to the Hue service.
- Select Stop to confirm. and click
- Copy the backup from the temporary location to the newly created Hue database directory: /opt/cloudera/parcels/CDH-4.x.0-x.cdh4.x.0.p0.xx/share/hue/desktop.
- Restart the Hue service.
<< Performing a Rolling Upgrade to CDH 5.1 | ©2016 Cloudera, Inc. All rights reserved | Upgrading to CDH Maintenance Releases >> |
Terms and Conditions Privacy Policy |