This is the documentation for Cloudera Enterprise 5.8.x. Documentation for other versions is available at Cloudera Documentation.

DSSD DataNode Metrics

In addition to these base metrics, many aggregate metrics are available. If an entity type has parents defined, you can formulate all possible aggregate metrics using the formula base_metric_across_parents.

In addition, metrics for aggregate totals can be formed by adding the prefix total_ to the front of the metric name.

Use the type-ahead feature in the Cloudera Manager chart browser to find the exact aggregate metric name, in case the plural form does not end in "s".

For example, the following metric names may be valid for DSSD DataNode:

  • alerts_rate_across_clusters
  • total_alerts_rate_across_clusters

Some metrics, such as alerts_rate, apply to nearly every metric context. Others only apply to a certain service or role.

For more information about metrics, see Cloudera Manager Metrics and Metric Aggregation.

Metric Name Description Unit Parents CDH Version
alerts_rate The number of alerts. events per second cluster, hdfs, rack CDH 4, CDH 5
cgroup_cpu_system_rate CPU usage of the role's cgroup seconds per second cluster, hdfs, rack CDH 4, CDH 5
cgroup_cpu_user_rate User Space CPU usage of the role's cgroup seconds per second cluster, hdfs, rack CDH 4, CDH 5
cgroup_mem_page_cache Page cache usage of the role's cgroup bytes cluster, hdfs, rack CDH 4, CDH 5
cgroup_mem_rss Resident memory of the role's cgroup bytes cluster, hdfs, rack CDH 4, CDH 5
cgroup_mem_swap Swap usage of the role's cgroup bytes cluster, hdfs, rack CDH 4, CDH 5
cgroup_read_bytes_rate Bytes read from all disks by the role's cgroup bytes per second cluster, hdfs, rack CDH 4, CDH 5
cgroup_read_ios_rate Number of read I/O operations from all disks by the role's cgroup ios per second cluster, hdfs, rack CDH 4, CDH 5
cgroup_write_bytes_rate Bytes written to all disks by the role's cgroup bytes per second cluster, hdfs, rack CDH 4, CDH 5
cgroup_write_ios_rate Number of write I/O operations to all disks by the role's cgroup ios per second cluster, hdfs, rack CDH 4, CDH 5
cpu_system_rate Total System CPU seconds per second cluster, hdfs, rack CDH 4, CDH 5
cpu_user_rate Total CPU user time seconds per second cluster, hdfs, rack CDH 4, CDH 5
events_critical_rate The number of critical events. events per second cluster, hdfs, rack CDH 4, CDH 5
events_important_rate The number of important events. events per second cluster, hdfs, rack CDH 4, CDH 5
events_informational_rate The number of informational events. events per second cluster, hdfs, rack CDH 4, CDH 5
fd_max Maximum number of file descriptors file descriptors cluster, hdfs, rack CDH 4, CDH 5
fd_open Open file descriptors. file descriptors cluster, hdfs, rack CDH 4, CDH 5
health_bad_rate Percentage of Time with Bad Health seconds per second cluster, hdfs, rack CDH 4, CDH 5
health_concerning_rate Percentage of Time with Concerning Health seconds per second cluster, hdfs, rack CDH 4, CDH 5
health_disabled_rate Percentage of Time with Disabled Health seconds per second cluster, hdfs, rack CDH 4, CDH 5
health_good_rate Percentage of Time with Good Health seconds per second cluster, hdfs, rack CDH 4, CDH 5
health_unknown_rate Percentage of Time with Unknown Health seconds per second cluster, hdfs, rack CDH 4, CDH 5
mem_rss Resident memory used bytes cluster, hdfs, rack CDH 4, CDH 5
mem_swap Amount of swap memory used by this role's process. bytes cluster, hdfs, rack CDH 4, CDH 5
mem_virtual Virtual memory used bytes cluster, hdfs, rack CDH 4, CDH 5
oom_exits_rate The number of times the role's backing process was killed due to an OutOfMemory error. This counter is only incremented if the Cloudera Manager "Kill When Out of Memory" option is enabled. exits per second cluster, hdfs, rack CDH 4, CDH 5
read_bytes_rate The number of bytes read from the device bytes per second cluster, hdfs, rack CDH 4, CDH 5
unexpected_exits_rate The number of times the role's backing process exited unexpectedly. exits per second cluster, hdfs, rack CDH 4, CDH 5
uptime For a host, the amount of time since the host was booted. For a role, the uptime of the backing process. seconds cluster, hdfs, rack CDH 4, CDH 5
write_bytes_rate The number of bytes written to the device bytes per second cluster, hdfs, rack CDH 4, CDH 5
hdfs_block_checksum_op_avg_time Block Checksum Average Time ms cluster, hdfs, rack CDH 5
hdfs_block_checksum_op_rate Block Checksum Operations operations per second cluster, hdfs, rack CDH 5
hdfs_block_reports_avg_time Block Reports Average Time ms cluster, hdfs, rack CDH 5
hdfs_block_reports_rate Block Reports Operations operations per second cluster, hdfs, rack CDH 5
hdfs_block_verification_failures_rate Block Verification Failures failures per second cluster, hdfs, rack CDH 5
hdfs_blocks_cached_rate The total number of HDFS blocks cached over the lifetime of the process. blocks per second cluster, hdfs, rack CDH 5
hdfs_blocks_get_local_path_info_rate Blocks Get Local Path Info operations per second cluster, hdfs, rack CDH 5
hdfs_blocks_read_rate Blocks Read blocks per second cluster, hdfs, rack CDH 5
hdfs_blocks_removed_rate Blocks Removed blocks per second cluster, hdfs, rack CDH 5
hdfs_blocks_replicated_rate Blocks Replicated blocks per second cluster, hdfs, rack CDH 5
hdfs_blocks_total Blocks total blocks cluster, hdfs, rack CDH 5
hdfs_blocks_uncached_rate The total number of HDFS blocks uncached over the lifetime of the process. blocks per second cluster, hdfs, rack CDH 5
hdfs_blocks_verified_rate Blocks Verified blocks per second cluster, hdfs, rack CDH 5
hdfs_blocks_written_rate Blocks Written blocks per second cluster, hdfs, rack CDH 5
hdfs_bytes_read_rate Number of bytes read bytes per second cluster, hdfs, rack CDH 5
hdfs_bytes_written_rate Bytes Written bytes per second cluster, hdfs, rack CDH 5
hdfs_cache_reports_avg_time The average time to generate cache reports on the DataNode. ms cluster, hdfs, rack CDH 5
hdfs_cache_reports_rate The total number of generate cache reports operations on the DataNode. operations per second cluster, hdfs, rack CDH 5
hdfs_copy_block_op_avg_time Copy Block Average Time ms cluster, hdfs, rack CDH 5
hdfs_copy_block_op_rate Copy Block Operations operations per second cluster, hdfs, rack CDH 5
hdfs_datanode_namenode_connections_bad NameNode connections in a bad state connections cluster, hdfs, rack CDH 5
hdfs_datanode_namenode_connections_good NameNode connections in a good state connections cluster, hdfs, rack CDH 5
hdfs_datanode_namenode_connections_unknown NameNode connections in a unknown state connections cluster, hdfs, rack CDH 5
hdfs_delete_block_pool_avg_time Delete Block Pool Average Time ms cluster, hdfs, rack CDH 5
hdfs_delete_block_pool_rate Delete Block Pool Operations operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_failed_heartbeat_cumulative_duration_rate The cumulative time taken by all failed heartbeat registration operations. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_async_read_ops_rate The total number of flood object async read operations issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_async_write_ops_rate The total number of flood object async write operations issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_failed_read_cumulative_duration_rate The cumulative amount of time failed read operations on a flood object took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_failed_read_ops_rate The total number of times a read operation on a flood object was issued against stable storage and it failed. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_failed_write_cumulative_duration_rate The cumulative amount of time failed write operations on a flood object took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_failed_write_ops_rate The total number of times a write operation on a flood object was issued against stable storage and it failed. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_read_ops_rate The total number of flood object read operations issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_successful_read_cumulative_duration_rate The cumulative amount of time successful read operations on a flood object took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_successful_read_ops_rate The total number of times a read operation on a flood object was issued against stable storage and it succeeded. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_successful_write_cumulative_duration_rate The cumulative amount of time successful write operations on a flood object took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_successful_write_ops_rate The total number of times a write operation on a flood object was issued against stable storage and it succeeded. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flood_obj_write_ops_rate The total number of times a write operation on a flood object was issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_flush_block_ops_rate The total number of times the flushBlock() operation was invoked. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_get_available_ops_rate The total number of times the getAvailable() operation was invoked. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_get_block_ops_rate The total number of times the getBlock() operation was invoked. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_get_block_pool_list_ops_rate The total number of times the getBlockPoolList() operation was invoked. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_get_size_ops_rate The total number of times the getSize() operation was invoked. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_get_storage_report_ops_rate The total number of times the getStorageReport() operation was invoked. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_get_used_ops_rate The total number of times the getUsed() operation was invoked. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_heartbeat_ops_rate The total number of heartbeat registration operations. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_failed_read_cumulative_duration_rate The cumulative amount of time failed read operations from page cache took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_failed_read_ops_rate The total number of times a read operation from page cache was issued against stable storage and it failed. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_failed_write_cumulative_duration_rate The cumulative amount of time failed write operations to page cache took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_failed_write_ops_rate The total number of times a write operation to page cache was issued against stable storage and it failed. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_page_evicted_rate The total number of pages that were evicted from the the page cache. pages per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_read_hit_rate The total number of times a read operation was satisfied from the page cache. hits per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_read_issued_rate The total number of times a read operation was issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_successful_read_cumulative_duration_rate The cumulative amount of time successful read operations from page cache took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_successful_read_ops_rate The total number of times a read operation from page cache was issued against stable storage and it succeeded. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_successful_write_cumulative_duration_rate The cumulative amount of time successful write operations to page cache took against stable storage. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_successful_write_ops_rate The total number of times a write operation to page cache was issued against stable storage and it succeeded. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_write_dirty_cached_rate The total number of dirtied pages that were cached at the time of the write of operation. pages per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_write_dirty_init_rate The total number of dirtied pages that were not cached at the time of the write of operation. The page is instantiated in the cache and written. pages per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_write_ops_rate The total number of times a write operation to page cache was issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_page_cache_write_partial_page_rate The total number of partial writes into the page cache. The size of the write operation is less than the size of the page and therefore a read-modify-write is performed against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_stat_ops_rate The total number of stat operations issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_dssd_successful_heartbeat_cumulative_duration_rate The cumulative time taken by all successful heartbeat registration operations. nanos per second cluster, hdfs, rack CDH 5
hdfs_dssd_trim_ops_rate The total number of trim operations issued against stable storage. operations per second cluster, hdfs, rack CDH 5
hdfs_flush_nanos_avg_time Average Disk Flush Time nanos cluster, hdfs, rack CDH 5
hdfs_flush_nanos_rate Disk Flushes operations per second cluster, hdfs, rack CDH 5
hdfs_fsync_nanos_avg_time Average Disk Fsync Time nanos cluster, hdfs, rack CDH 5
hdfs_fsync_nanos_rate Disk Fsyncs operations per second cluster, hdfs, rack CDH 5
hdfs_fsync_rate Fsync Operations operations per second cluster, hdfs, rack CDH 5
hdfs_get_block_local_path_info_avg_time Get Block Local Path Info Average Time ms cluster, hdfs, rack CDH 5
hdfs_get_block_local_path_info_rate Get Block Local Path Info Operations operations per second cluster, hdfs, rack CDH 5
hdfs_get_hdfs_blocks_metadata_avg_time Get HDFS Blocks Metadata Average Time ms cluster, hdfs, rack CDH 5
hdfs_get_hdfs_blocks_metadata_rate Get HDFS Blocks Metadata Operations operations per second cluster, hdfs, rack CDH 5
hdfs_get_replica_visible_length_avg_time Get Replica Visible Length Average Time ms cluster, hdfs, rack CDH 5
hdfs_get_replica_visible_length_rate Get Replica Visible Length Operations operations per second cluster, hdfs, rack CDH 5
hdfs_heartbeats_avg_time Heartbeat Average Time ms cluster, hdfs, rack CDH 5
hdfs_heartbeats_rate Heartbeats operations per second cluster, hdfs, rack CDH 5
hdfs_init_replica_recovery_avg_time Init Replica Recovery Average Time ms cluster, hdfs, rack CDH 5
hdfs_init_replica_recovery_rate Init Replica Recovery Operations operations per second cluster, hdfs, rack CDH 5
hdfs_jvm_blocked_threads Blocked threads threads cluster, hdfs, rack CDH 5
hdfs_jvm_gc_rate Number of garbage collections garbage collections per second cluster, hdfs, rack CDH 5
hdfs_jvm_gc_time_ms_rate Total time spent garbage collecting. ms per second cluster, hdfs, rack CDH 5
hdfs_jvm_heap_committed_mb Total amount of committed heap memory. MB cluster, hdfs, rack CDH 5
hdfs_jvm_heap_used_mb Total amount of used heap memory. MB cluster, hdfs, rack CDH 5
hdfs_jvm_max_memory_mb Maximum allowed memory. MB cluster, hdfs, rack CDH 5
hdfs_jvm_new_threads New threads threads cluster, hdfs, rack CDH 5
hdfs_jvm_non_heap_committed_mb Total amount of committed non-heap memory. MB cluster, hdfs, rack CDH 5
hdfs_jvm_non_heap_used_mb Total amount of used non-heap memory. MB cluster, hdfs, rack CDH 5
hdfs_jvm_pause_time_rate The amount of extra time the jvm was paused above the requested sleep time. The JVM pause monitor sleeps for 500 milliseconds and any extra time it waited above this is counted in the pause time. ms per second cluster, hdfs, rack CDH 5
hdfs_jvm_pauses_info_threshold_rate Number of JVM pauses longer than the info threshold but shorter than the warning threshold. By default the info threshold is set to 1 second. To change use this configuration key JvmPauseMonitorService.info-threshold.ms pauses per second cluster, hdfs, rack CDH 5
hdfs_jvm_pauses_warn_threshold_rate Number of JVM pauses longer than the warning threshold. By default the warning threshold is set to 10 second. To change use this configuration key JvmPauseMonitorService.warn-threshold.ms pauses per second cluster, hdfs, rack CDH 5
hdfs_jvm_runnable_threads Runnable threads threads cluster, hdfs, rack CDH 5
hdfs_jvm_terminated_threads Terminated threads threads cluster, hdfs, rack CDH 5
hdfs_jvm_timed_waiting_threads Timed waiting threads threads cluster, hdfs, rack CDH 5
hdfs_jvm_waiting_threads Waiting threads threads cluster, hdfs, rack CDH 5
hdfs_log_error_rate Logged Errors messages per second cluster, hdfs, rack CDH 5
hdfs_log_fatal_rate Logged Fatals messages per second cluster, hdfs, rack CDH 5
hdfs_log_info_rate Logged Infos messages per second cluster, hdfs, rack CDH 5
hdfs_log_warn_rate Logged Warnings messages per second cluster, hdfs, rack CDH 5
hdfs_login_failure_avg_time Average Failed Login Time ms cluster, hdfs, rack CDH 5
hdfs_login_failure_rate Login Failures operations per second cluster, hdfs, rack CDH 5
hdfs_login_success_avg_time Average Successful Login Time ms cluster, hdfs, rack CDH 5
hdfs_login_success_rate Login Successes operations per second cluster, hdfs, rack CDH 5
hdfs_metrics_dropped_pub_all Dropped Metrics Updates By All Sinks updates cluster, hdfs, rack CDH 5
hdfs_metrics_num_active_sinks Active Metrics Sinks Count sinks cluster, hdfs, rack CDH 5
hdfs_metrics_num_active_sources Active Metrics Sources Count sources cluster, hdfs, rack CDH 5
hdfs_metrics_num_all_sinks All Metrics Sinks Count sinks cluster, hdfs, rack CDH 5
hdfs_metrics_num_all_sources All Metrics Sources Count sources cluster, hdfs, rack CDH 5
hdfs_metrics_publish_avg_time Metrics Publish Average Time ms cluster, hdfs, rack CDH 5
hdfs_metrics_publish_rate Metrics Publish Operations operations per second cluster, hdfs, rack CDH 5
hdfs_metrics_snapshot_avg_time Metrics Snapshot Average Time ms cluster, hdfs, rack CDH 5
hdfs_metrics_snapshot_rate Metrics Snapshot Average Operations operations per second cluster, hdfs, rack CDH 5
hdfs_packet_ack_round_trip_time_nanos_avg_time Packet Ack Round Trip Average Time nanos cluster, hdfs, rack CDH 5
hdfs_packet_ack_round_trip_time_nanos_rate Packet Ack Round Trip Operations operations per second cluster, hdfs, rack CDH 5
hdfs_read_block_op_avg_time Read Block Average Time ms cluster, hdfs, rack CDH 5
hdfs_read_block_op_rate Read Block Operations operations per second cluster, hdfs, rack CDH 5
hdfs_reads_from_local_client_rate Reads From Local Clients operations per second cluster, hdfs, rack CDH 5
hdfs_reads_from_remote_client_rate Reads From Remote Clients operations per second cluster, hdfs, rack CDH 5
hdfs_refresh_namenodes_avg_time Refresh NameNodes Average Time ms cluster, hdfs, rack CDH 5
hdfs_refresh_namenodes_rate Refresh NameNodes Operations operations per second cluster, hdfs, rack CDH 5
hdfs_replace_block_op_avg_time Replace Block Operation Average Time ms cluster, hdfs, rack CDH 5
hdfs_replace_block_op_rate Replace Block Operations operations per second cluster, hdfs, rack CDH 5
hdfs_rpc_authentication_failures_rate RPC Authentication Failures operations per second cluster, hdfs, rack CDH 5
hdfs_rpc_authentication_successes_rate RPC Authentication Successes operations per second cluster, hdfs, rack CDH 5
hdfs_rpc_authorization_failures_rate RPC Authorization Failures operations per second cluster, hdfs, rack CDH 5
hdfs_rpc_authorization_successes_rate RPC Authorization Successes operations per second cluster, hdfs, rack CDH 5
hdfs_rpc_call_queue_length RPC Call Queue Length items cluster, hdfs, rack CDH 5
hdfs_rpc_num_open_connections Open RPC Connections connections cluster, hdfs, rack CDH 5
hdfs_rpc_processing_time_avg_time Average RPC Processing Time ms cluster, hdfs, rack CDH 5
hdfs_rpc_processing_time_rate RPCs Processed operations per second cluster, hdfs, rack CDH 5
hdfs_rpc_queue_time_avg_time Average RPC Queue Time ms cluster, hdfs, rack CDH 5
hdfs_rpc_queue_time_rate RPCs Queued operations per second cluster, hdfs, rack CDH 5
hdfs_rpc_received_bytes_rate RPC Received Bytes bytes per second cluster, hdfs, rack CDH 5
hdfs_rpc_sent_bytes_rate RPC Sent Bytes bytes per second cluster, hdfs, rack CDH 5
hdfs_send_data_packet_blocked_on_network_nanos_avg_time Send Data Packet Blocked On Network Average Time nanos cluster, hdfs, rack CDH 5
hdfs_send_data_packet_blocked_on_network_nanos_rate Send Data Packet Blocked On Network Operations operations per second cluster, hdfs, rack CDH 5
hdfs_send_data_packet_transfer_nanos_avg_time Send Data Packet Transfer Average Time nanos cluster, hdfs, rack CDH 5
hdfs_send_data_packet_transfer_nanos_rate Send Data Packet Transfer Operations operations per second cluster, hdfs, rack CDH 5
hdfs_update_replica_under_recovery_avg_time Update Replica Under Recovery Average Time ms cluster, hdfs, rack CDH 5
hdfs_update_replica_under_recovery_rate Update Replica Under Recovery Operations operations per second cluster, hdfs, rack CDH 5
hdfs_write_block_op_avg_time Write Block Average Time ms cluster, hdfs, rack CDH 5
hdfs_write_block_op_rate Write Block Operations operations per second cluster, hdfs, rack CDH 5
hdfs_writes_from_local_client_rate Writes From Local Clients operations per second cluster, hdfs, rack CDH 5
hdfs_writes_from_remote_client_rate Writes From Remote Clients operations per second cluster, hdfs, rack CDH 5
hdfs_xceivers Transceivers transceivers cluster, hdfs, rack CDH 5
web_metrics_collection_duration Web Server Responsiveness ms cluster, hdfs, rack CDH 5
Page generated July 8, 2016.