diff --git a/README.md b/README.md index f813534d..05b10f0f 100644 --- a/README.md +++ b/README.md @@ -115,191 +115,195 @@ Further Information ### Metrics -| Name | Type | Cardinality | Help | -|----------------------------------------------------------------------|------------|-------------|-----------------------------------------------------------------------------------------------------| -| elasticsearch_breakers_estimated_size_bytes | gauge | 4 | Estimated size in bytes of breaker | -| elasticsearch_breakers_limit_size_bytes | gauge | 4 | Limit size in bytes for breaker | -| elasticsearch_breakers_tripped | counter | 4 | tripped for breaker | -| elasticsearch_cluster_health_active_primary_shards | gauge | 1 | The number of primary shards in your cluster. This is an aggregate total across all indices. | -| elasticsearch_cluster_health_active_shards | gauge | 1 | Aggregate total of all shards across all indices, which includes replica shards. | -| elasticsearch_cluster_health_delayed_unassigned_shards | gauge | 1 | Shards delayed to reduce reallocation overhead | -| elasticsearch_cluster_health_initializing_shards | gauge | 1 | Count of shards that are being freshly created. | -| elasticsearch_cluster_health_number_of_data_nodes | gauge | 1 | Number of data nodes in the cluster. | -| elasticsearch_cluster_health_number_of_in_flight_fetch | gauge | 1 | The number of ongoing shard info requests. | -| elasticsearch_cluster_health_number_of_nodes | gauge | 1 | Number of nodes in the cluster. | -| elasticsearch_cluster_health_number_of_pending_tasks | gauge | 1 | Cluster level changes which have not yet been executed | -| elasticsearch_cluster_health_task_max_waiting_in_queue_millis | gauge | 1 | Max time in millis that a task is waiting in queue. | -| elasticsearch_cluster_health_relocating_shards | gauge | 1 | The number of shards that are currently moving from one node to another node. | -| elasticsearch_cluster_health_status | gauge | 3 | Whether all primary and replica shards are allocated. | -| elasticsearch_cluster_health_unassigned_shards | gauge | 1 | The number of shards that exist in the cluster state, but cannot be found in the cluster itself. | -| elasticsearch_clustersettings_stats_max_shards_per_node | gauge | 0 | Current maximum number of shards per node setting. | -| elasticsearch_clustersettings_allocation_threshold_enabled | gauge | 0 | Is disk allocation decider enabled. | -| elasticsearch_clustersettings_allocation_watermark_flood_stage_bytes | gauge | 0 | Flood stage watermark as in bytes. | -| elasticsearch_clustersettings_allocation_watermark_high_bytes | gauge | 0 | High watermark for disk usage in bytes. | -| elasticsearch_clustersettings_allocation_watermark_low_bytes | gauge | 0 | Low watermark for disk usage in bytes. | -| elasticsearch_clustersettings_allocation_watermark_flood_stage_ratio | gauge | 0 | Flood stage watermark as a ratio. | -| elasticsearch_clustersettings_allocation_watermark_high_ratio | gauge | 0 | High watermark for disk usage as a ratio. | -| elasticsearch_clustersettings_allocation_watermark_low_ratio | gauge | 0 | Low watermark for disk usage as a ratio. | -| elasticsearch_filesystem_data_available_bytes | gauge | 1 | Available space on block device in bytes | -| elasticsearch_filesystem_data_free_bytes | gauge | 1 | Free space on block device in bytes | -| elasticsearch_filesystem_data_size_bytes | gauge | 1 | Size of block device in bytes | -| elasticsearch_filesystem_io_stats_device_operations_count | gauge | 1 | Count of disk operations | -| elasticsearch_filesystem_io_stats_device_read_operations_count | gauge | 1 | Count of disk read operations | -| elasticsearch_filesystem_io_stats_device_write_operations_count | gauge | 1 | Count of disk write operations | -| elasticsearch_filesystem_io_stats_device_read_size_kilobytes_sum | gauge | 1 | Total kilobytes read from disk | -| elasticsearch_filesystem_io_stats_device_write_size_kilobytes_sum | gauge | 1 | Total kilobytes written to disk | -| elasticsearch_ilm_status | gauge | 1 | Current status of ILM. Status can be `STOPPED`, `RUNNING`, `STOPPING`. | -| elasticsearch_ilm_index_status | gauge | 4 | Status of ILM policy for index | -| elasticsearch_indices_active_queries | gauge | 1 | The number of currently active queries | -| elasticsearch_indices_docs | gauge | 1 | Count of documents on this node | -| elasticsearch_indices_docs_deleted | gauge | 1 | Count of deleted documents on this node | -| elasticsearch_indices_deleted_docs_primary | gauge | 1 | Count of deleted documents with only primary shards | -| elasticsearch_indices_docs_primary | gauge | 1 | Count of documents with only primary shards on all nodes | -| elasticsearch_indices_docs_total | gauge | | Count of documents with shards on all nodes | -| elasticsearch_indices_fielddata_evictions | counter | 1 | Evictions from field data | -| elasticsearch_indices_fielddata_memory_size_bytes | gauge | 1 | Field data cache memory usage in bytes | -| elasticsearch_indices_filter_cache_evictions | counter | 1 | Evictions from filter cache | -| elasticsearch_indices_filter_cache_memory_size_bytes | gauge | 1 | Filter cache memory usage in bytes | -| elasticsearch_indices_flush_time_seconds | counter | 1 | Cumulative flush time in seconds | -| elasticsearch_indices_flush_total | counter | 1 | Total flushes | -| elasticsearch_indices_get_exists_time_seconds | counter | 1 | Total time get exists in seconds | -| elasticsearch_indices_get_exists_total | counter | 1 | Total get exists operations | -| elasticsearch_indices_get_missing_time_seconds | counter | 1 | Total time of get missing in seconds | -| elasticsearch_indices_get_missing_total | counter | 1 | Total get missing | -| elasticsearch_indices_get_time_seconds | counter | 1 | Total get time in seconds | -| elasticsearch_indices_get_total | counter | 1 | Total get | -| elasticsearch_indices_indexing_delete_time_seconds_total | counter | 1 | Total time indexing delete in seconds | -| elasticsearch_indices_indexing_delete_total | counter | 1 | Total indexing deletes | -| elasticsearch_indices_index_current | gauge | 1 | The number of documents currently being indexed to an index | -| elasticsearch_indices_indexing_index_time_seconds_total | counter | 1 | Cumulative index time in seconds | -| elasticsearch_indices_indexing_index_total | counter | 1 | Total index calls | -| elasticsearch_indices_mappings_stats_fields | gauge | 1 | Count of fields currently mapped by index | -| elasticsearch_indices_mappings_stats_json_parse_failures_total | counter | 0 | Number of errors while parsing JSON | -| elasticsearch_indices_mappings_stats_scrapes_total | counter | 0 | Current total Elasticsearch Indices Mappings scrapes | -| elasticsearch_indices_mappings_stats_up | gauge | 0 | Was the last scrape of the Elasticsearch Indices Mappings endpoint successful | -| elasticsearch_indices_merges_docs_total | counter | 1 | Cumulative docs merged | -| elasticsearch_indices_merges_total | counter | 1 | Total merges | -| elasticsearch_indices_merges_total_size_bytes_total | counter | 1 | Total merge size in bytes | -| elasticsearch_indices_merges_total_time_seconds_total | counter | 1 | Total time spent merging in seconds | -| elasticsearch_indices_query_cache_cache_total | counter | 1 | Count of query cache | -| elasticsearch_indices_query_cache_cache_size | gauge | 1 | Size of query cache | -| elasticsearch_indices_query_cache_count | counter | 2 | Count of query cache hit/miss | -| elasticsearch_indices_query_cache_evictions | counter | 1 | Evictions from query cache | -| elasticsearch_indices_query_cache_memory_size_bytes | gauge | 1 | Query cache memory usage in bytes | -| elasticsearch_indices_query_cache_total | counter | 1 | Size of query cache total | -| elasticsearch_indices_refresh_time_seconds_total | counter | 1 | Total time spent refreshing in seconds | -| elasticsearch_indices_refresh_total | counter | 1 | Total refreshes | -| elasticsearch_indices_request_cache_count | counter | 2 | Count of request cache hit/miss | -| elasticsearch_indices_request_cache_evictions | counter | 1 | Evictions from request cache | -| elasticsearch_indices_request_cache_memory_size_bytes | gauge | 1 | Request cache memory usage in bytes | -| elasticsearch_indices_search_fetch_time_seconds | counter | 1 | Total search fetch time in seconds | -| elasticsearch_indices_search_fetch_total | counter | 1 | Total number of fetches | -| elasticsearch_indices_search_query_time_seconds | counter | 1 | Total search query time in seconds | -| elasticsearch_indices_search_query_total | counter | 1 | Total number of queries | -| elasticsearch_indices_segments_count | gauge | 1 | Count of index segments on this node | -| elasticsearch_indices_segments_memory_bytes | gauge | 1 | Current memory size of segments in bytes | -| elasticsearch_indices_settings_creation_timestamp_seconds | gauge | 1 | Timestamp of the index creation in seconds | -| elasticsearch_indices_settings_stats_read_only_indices | gauge | 1 | Count of indices that have read_only_allow_delete=true | -| elasticsearch_indices_settings_total_fields | gauge | | Index setting value for index.mapping.total_fields.limit (total allowable mapped fields in a index) | -| elasticsearch_indices_settings_replicas | gauge | | Index setting value for index.replicas | -| elasticsearch_indices_shards_docs | gauge | 3 | Count of documents on this shard | -| elasticsearch_indices_shards_docs_deleted | gauge | 3 | Count of deleted documents on each shard | -| elasticsearch_indices_store_size_bytes | gauge | 1 | Current size of stored index data in bytes | -| elasticsearch_indices_store_size_bytes_primary | gauge | | Current size of stored index data in bytes with only primary shards on all nodes | -| elasticsearch_indices_store_size_bytes_total | gauge | | Current size of stored index data in bytes with all shards on all nodes | -| elasticsearch_indices_store_throttle_time_seconds_total | counter | 1 | Throttle time for index store in seconds | -| elasticsearch_indices_translog_operations | counter | 1 | Total translog operations | -| elasticsearch_indices_translog_size_in_bytes | counter | 1 | Total translog size in bytes | -| elasticsearch_indices_warmer_time_seconds_total | counter | 1 | Total warmer time in seconds | -| elasticsearch_indices_warmer_total | counter | 1 | Total warmer count | -| elasticsearch_jvm_gc_collection_seconds_count | counter | 2 | Count of JVM GC runs | -| elasticsearch_jvm_gc_collection_seconds_sum | counter | 2 | GC run time in seconds | -| elasticsearch_jvm_memory_committed_bytes | gauge | 2 | JVM memory currently committed by area | -| elasticsearch_jvm_memory_max_bytes | gauge | 1 | JVM memory max | -| elasticsearch_jvm_memory_used_bytes | gauge | 2 | JVM memory currently used by area | -| elasticsearch_jvm_memory_pool_used_bytes | gauge | 3 | JVM memory currently used by pool | -| elasticsearch_jvm_memory_pool_max_bytes | counter | 3 | JVM memory max by pool | -| elasticsearch_jvm_memory_pool_peak_used_bytes | counter | 3 | JVM memory peak used by pool | -| elasticsearch_jvm_memory_pool_peak_max_bytes | counter | 3 | JVM memory peak max by pool | -| elasticsearch_os_cpu_percent | gauge | 1 | Percent CPU used by the OS | -| elasticsearch_os_load1 | gauge | 1 | Shortterm load average | -| elasticsearch_os_load5 | gauge | 1 | Midterm load average | -| elasticsearch_os_load15 | gauge | 1 | Longterm load average | -| elasticsearch_process_cpu_percent | gauge | 1 | Percent CPU used by process | -| elasticsearch_process_cpu_seconds_total | counter | 1 | Process CPU time in seconds | -| elasticsearch_process_mem_resident_size_bytes | gauge | 1 | Resident memory in use by process in bytes | -| elasticsearch_process_mem_share_size_bytes | gauge | 1 | Shared memory in use by process in bytes | -| elasticsearch_process_mem_virtual_size_bytes | gauge | 1 | Total virtual memory used in bytes | -| elasticsearch_process_open_files_count | gauge | 1 | Open file descriptors | -| elasticsearch_snapshot_stats_number_of_snapshots | gauge | 1 | Total number of snapshots | -| elasticsearch_snapshot_stats_oldest_snapshot_timestamp | gauge | 1 | Oldest snapshot timestamp | -| elasticsearch_snapshot_stats_snapshot_start_time_timestamp | gauge | 1 | Last snapshot start timestamp | -| elasticsearch_snapshot_stats_latest_snapshot_timestamp_seconds | gauge | 1 | Timestamp of the latest SUCCESS or PARTIAL snapshot | -| elasticsearch_snapshot_stats_snapshot_end_time_timestamp | gauge | 1 | Last snapshot end timestamp | -| elasticsearch_snapshot_stats_snapshot_number_of_failures | gauge | 1 | Last snapshot number of failures | -| elasticsearch_snapshot_stats_snapshot_number_of_indices | gauge | 1 | Last snapshot number of indices | -| elasticsearch_snapshot_stats_snapshot_failed_shards | gauge | 1 | Last snapshot failed shards | -| elasticsearch_snapshot_stats_snapshot_successful_shards | gauge | 1 | Last snapshot successful shards | -| elasticsearch_snapshot_stats_snapshot_total_shards | gauge | 1 | Last snapshot total shard | -| elasticsearch_thread_pool_active_count | gauge | 14 | Thread Pool threads active | -| elasticsearch_thread_pool_completed_count | counter | 14 | Thread Pool operations completed | -| elasticsearch_thread_pool_largest_count | gauge | 14 | Thread Pool largest threads count | -| elasticsearch_thread_pool_queue_count | gauge | 14 | Thread Pool operations queued | -| elasticsearch_thread_pool_rejected_count | counter | 14 | Thread Pool operations rejected | -| elasticsearch_thread_pool_threads_count | gauge | 14 | Thread Pool current threads count | -| elasticsearch_transport_rx_packets_total | counter | 1 | Count of packets received | -| elasticsearch_transport_rx_size_bytes_total | counter | 1 | Total number of bytes received | -| elasticsearch_transport_tx_packets_total | counter | 1 | Count of packets sent | -| elasticsearch_transport_tx_size_bytes_total | counter | 1 | Total number of bytes sent | -| elasticsearch_clusterinfo_last_retrieval_success_ts | gauge | 1 | Timestamp of the last successful cluster info retrieval | -| elasticsearch_clusterinfo_up | gauge | 1 | Up metric for the cluster info collector | -| elasticsearch_clusterinfo_version_info | gauge | 6 | Constant metric with ES version information as labels | -| elasticsearch_slm_stats_up | gauge | 0 | Up metric for SLM collector | -| elasticsearch_slm_stats_total_scrapes | counter | 0 | Number of scrapes for SLM collector | -| elasticsearch_slm_stats_json_parse_failures | counter | 0 | JSON parse failures for SLM collector | -| elasticsearch_slm_stats_retention_runs_total | counter | 0 | Total retention runs | -| elasticsearch_slm_stats_retention_failed_total | counter | 0 | Total failed retention runs | -| elasticsearch_slm_stats_retention_timed_out_total | counter | 0 | Total retention run timeouts | -| elasticsearch_slm_stats_retention_deletion_time_seconds | gauge | 0 | Retention run deletion time | -| elasticsearch_slm_stats_total_snapshots_taken_total | counter | 0 | Total snapshots taken | -| elasticsearch_slm_stats_total_snapshots_failed_total | counter | 0 | Total snapshots failed | -| elasticsearch_slm_stats_total_snapshots_deleted_total | counter | 0 | Total snapshots deleted | -| elasticsearch_slm_stats_total_snapshots_failed_total | counter | 0 | Total snapshots failed | -| elasticsearch_slm_stats_snapshots_taken_total | counter | 1 | Snapshots taken by policy | -| elasticsearch_slm_stats_snapshots_failed_total | counter | 1 | Snapshots failed by policy | -| elasticsearch_slm_stats_snapshots_deleted_total | counter | 1 | Snapshots deleted by policy | -| elasticsearch_slm_stats_snapshot_deletion_failures_total | counter | 1 | Snapshot deletion failures by policy | -| elasticsearch_slm_stats_operation_mode | gauge | 1 | SLM operation mode (Running, stopping, stopped) | -| elasticsearch_data_stream_stats_up | gauge | 0 | Up metric for Data Stream collection | -| elasticsearch_data_stream_stats_total_scrapes | counter | 0 | Total scrapes for Data Stream stats | -| elasticsearch_data_stream_stats_json_parse_failures | counter | 0 | Number of parsing failures for Data Stream stats | -| elasticsearch_data_stream_backing_indices_total | gauge | 1 | Number of backing indices for Data Stream | -| elasticsearch_data_stream_store_size_bytes | gauge | 1 | Current size of data stream backing indices in bytes | -| elasticsearch_health_report_creating_primaries | gauge | 1 | The number of creating primary shards | -| elasticsearch_health_report_creating_replicas | gauge | 1 | The number of creating replica shards | -| elasticsearch_health_report_data_stream_lifecycle_status | gauge | 2 | Data stream lifecycle status | -| elasticsearch_health_report_disk_status | gauge | 2 | disk status | -| elasticsearch_health_report_ilm_policies | gauge | 1 | The number of ILM Policies | -| elasticsearch_health_report_ilm_stagnating_indices | gauge | 1 | The number of stagnating indices | -| elasticsearch_health_report_ilm_status | gauge | 2 | ILM status | -| elasticsearch_health_report_initializing_primaries | gauge | 1 | The number of initializing primary shards | -| elasticsearch_health_report_initializing_replicas | gauge | 1 | The number of initializing replica shards | -| elasticsearch_health_report_master_is_stable_status | gauge | 2 | Master is stable status | -| elasticsearch_health_report_max_shards_in_cluster_data | gauge | 1 | The number of maximum shards in a cluster | -| elasticsearch_health_report_max_shards_in_cluster_frozen | gauge | 1 | The number of maximum frozen shards in a cluster | -| elasticsearch_health_report_repository_integrity_status | gauge | 2 | Repository integrity status | -| elasticsearch_health_report_restarting_primaries | gauge | 1 | The number of restarting primary shards | -| elasticsearch_health_report_restarting_replicas | gauge | 1 | The number of restarting replica shards | -| elasticsearch_health_report_shards_availabilty_status | gauge | 2 | Shards availabilty status | -| elasticsearch_health_report_shards_capacity_status | gauge | 2 | Shards capacity status | -| elasticsearch_health_report_slm_policies | gauge | 1 | The number of SLM policies | -| elasticsearch_health_report_slm_status | gauge | 2 | SLM status | -| elasticsearch_health_report_started_primaries | gauge | 1 | The number of started primary shards | -| elasticsearch_health_report_started_replicas | gauge | 1 | The number of started replica shards | -| elasticsearch_health_report_status | gauge | 2 | Overall cluster status | -| elasticsearch_health_report_total_repositories | gauge | 1 | The number snapshot repositories | -| elasticsearch_health_report_unassigned_primaries | gauge | 1 | The number of unassigned primary shards | -| elasticsearch_health_report_unassigned_replicas | gauge | 1 | The number of unassigned replica shards | +| Name | Type | Cardinality | Help | +|----------------------------------------------------------------------|-----------|-------------|-----------------------------------------------------------------------------------------------------| +| elasticsearch_breakers_estimated_size_bytes | gauge | 4 | Estimated size in bytes of breaker | +| elasticsearch_breakers_limit_size_bytes | gauge | 4 | Limit size in bytes for breaker | +| elasticsearch_breakers_tripped | counter | 4 | tripped for breaker | +| elasticsearch_cluster_health_active_primary_shards | gauge | 1 | The number of primary shards in your cluster. This is an aggregate total across all indices. | +| elasticsearch_cluster_health_active_shards | gauge | 1 | Aggregate total of all shards across all indices, which includes replica shards. | +| elasticsearch_cluster_health_delayed_unassigned_shards | gauge | 1 | Shards delayed to reduce reallocation overhead | +| elasticsearch_cluster_health_initializing_shards | gauge | 1 | Count of shards that are being freshly created. | +| elasticsearch_cluster_health_number_of_data_nodes | gauge | 1 | Number of data nodes in the cluster. | +| elasticsearch_cluster_health_number_of_in_flight_fetch | gauge | 1 | The number of ongoing shard info requests. | +| elasticsearch_cluster_health_number_of_nodes | gauge | 1 | Number of nodes in the cluster. | +| elasticsearch_cluster_health_number_of_pending_tasks | gauge | 1 | Cluster level changes which have not yet been executed | +| elasticsearch_cluster_health_task_max_waiting_in_queue_millis | gauge | 1 | Max time in millis that a task is waiting in queue. | +| elasticsearch_cluster_health_relocating_shards | gauge | 1 | The number of shards that are currently moving from one node to another node. | +| elasticsearch_cluster_health_status | gauge | 3 | Whether all primary and replica shards are allocated. | +| elasticsearch_cluster_health_unassigned_shards | gauge | 1 | The number of shards that exist in the cluster state, but cannot be found in the cluster itself. | +| elasticsearch_clustersettings_stats_max_shards_per_node | gauge | 0 | Current maximum number of shards per node setting. | +| elasticsearch_clustersettings_allocation_threshold_enabled | gauge | 0 | Is disk allocation decider enabled. | +| elasticsearch_clustersettings_allocation_watermark_flood_stage_bytes | gauge | 0 | Flood stage watermark as in bytes. | +| elasticsearch_clustersettings_allocation_watermark_high_bytes | gauge | 0 | High watermark for disk usage in bytes. | +| elasticsearch_clustersettings_allocation_watermark_low_bytes | gauge | 0 | Low watermark for disk usage in bytes. | +| elasticsearch_clustersettings_allocation_watermark_flood_stage_ratio | gauge | 0 | Flood stage watermark as a ratio. | +| elasticsearch_clustersettings_allocation_watermark_high_ratio | gauge | 0 | High watermark for disk usage as a ratio. | +| elasticsearch_clustersettings_allocation_watermark_low_ratio | gauge | 0 | Low watermark for disk usage as a ratio. | +| elasticsearch_filesystem_data_available_bytes | gauge | 1 | Available space on block device in bytes | +| elasticsearch_filesystem_data_free_bytes | gauge | 1 | Free space on block device in bytes | +| elasticsearch_filesystem_data_size_bytes | gauge | 1 | Size of block device in bytes | +| elasticsearch_filesystem_io_stats_device_operations_count | gauge | 1 | Count of disk operations | +| elasticsearch_filesystem_io_stats_device_read_operations_count | gauge | 1 | Count of disk read operations | +| elasticsearch_filesystem_io_stats_device_write_operations_count | gauge | 1 | Count of disk write operations | +| elasticsearch_filesystem_io_stats_device_read_size_kilobytes_sum | gauge | 1 | Total kilobytes read from disk | +| elasticsearch_filesystem_io_stats_device_write_size_kilobytes_sum | gauge | 1 | Total kilobytes written to disk | +| elasticsearch_ilm_status | gauge | 1 | Current status of ILM. Status can be `STOPPED`, `RUNNING`, `STOPPING`. | +| elasticsearch_ilm_index_status | gauge | 4 | Status of ILM policy for index | +| elasticsearch_indices_active_queries | gauge | 1 | The number of currently active queries | +| elasticsearch_indices_docs | gauge | 1 | Count of documents on this node | +| elasticsearch_indices_docs_deleted | gauge | 1 | Count of deleted documents on this node | +| elasticsearch_indices_deleted_docs_primary | gauge | 1 | Count of deleted documents with only primary shards | +| elasticsearch_indices_docs_primary | gauge | 1 | Count of documents with only primary shards on all nodes | +| elasticsearch_indices_docs_total | gauge | | Count of documents with shards on all nodes | +| elasticsearch_indices_fielddata_evictions | counter | 1 | Evictions from field data | +| elasticsearch_indices_fielddata_memory_size_bytes | gauge | 1 | Field data cache memory usage in bytes | +| elasticsearch_indices_filter_cache_evictions | counter | 1 | Evictions from filter cache | +| elasticsearch_indices_filter_cache_memory_size_bytes | gauge | 1 | Filter cache memory usage in bytes | +| elasticsearch_indices_flush_time_seconds | counter | 1 | Cumulative flush time in seconds | +| elasticsearch_indices_flush_total | counter | 1 | Total flushes | +| elasticsearch_indices_get_exists_time_seconds | counter | 1 | Total time get exists in seconds | +| elasticsearch_indices_get_exists_total | counter | 1 | Total get exists operations | +| elasticsearch_indices_get_missing_time_seconds | counter | 1 | Total time of get missing in seconds | +| elasticsearch_indices_get_missing_total | counter | 1 | Total get missing | +| elasticsearch_indices_get_time_seconds | counter | 1 | Total get time in seconds | +| elasticsearch_indices_get_total | counter | 1 | Total get | +| elasticsearch_indices_indexing_delete_time_seconds_total | counter | 1 | Total time indexing delete in seconds | +| elasticsearch_indices_indexing_delete_total | counter | 1 | Total indexing deletes | +| elasticsearch_indices_index_current | gauge | 1 | The number of documents currently being indexed to an index | +| elasticsearch_indices_indexing_index_time_seconds_total | counter | 1 | Cumulative index time in seconds | +| elasticsearch_indices_indexing_index_total | counter | 1 | Total index calls | +| elasticsearch_indices_mappings_stats_fields | gauge | 1 | Count of fields currently mapped by index | +| elasticsearch_indices_mappings_stats_json_parse_failures_total | counter | 0 | Number of errors while parsing JSON | +| elasticsearch_indices_mappings_stats_scrapes_total | counter | 0 | Current total Elasticsearch Indices Mappings scrapes | +| elasticsearch_indices_mappings_stats_up | gauge | 0 | Was the last scrape of the Elasticsearch Indices Mappings endpoint successful | +| elasticsearch_indices_merges_docs_total | counter | 1 | Cumulative docs merged | +| elasticsearch_indices_merges_total | counter | 1 | Total merges | +| elasticsearch_indices_merges_total_size_bytes_total | counter | 1 | Total merge size in bytes | +| elasticsearch_indices_merges_total_time_seconds_total | counter | 1 | Total time spent merging in seconds | +| elasticsearch_indices_query_cache_cache_total | counter | 1 | Count of query cache | +| elasticsearch_indices_query_cache_cache_size | gauge | 1 | Size of query cache | +| elasticsearch_indices_query_cache_count | counter | 2 | Count of query cache hit/miss | +| elasticsearch_indices_query_cache_evictions | counter | 1 | Evictions from query cache | +| elasticsearch_indices_query_cache_memory_size_bytes | gauge | 1 | Query cache memory usage in bytes | +| elasticsearch_indices_query_cache_total | counter | 1 | Size of query cache total | +| elasticsearch_indices_refresh_time_seconds_total | counter | 1 | Total time spent refreshing in seconds | +| elasticsearch_indices_refresh_total | counter | 1 | Total refreshes | +| elasticsearch_indices_request_cache_count | counter | 2 | Count of request cache hit/miss | +| elasticsearch_indices_request_cache_evictions | counter | 1 | Evictions from request cache | +| elasticsearch_indices_request_cache_memory_size_bytes | gauge | 1 | Request cache memory usage in bytes | +| elasticsearch_indices_search_fetch_time_seconds | counter | 1 | Total search fetch time in seconds | +| elasticsearch_indices_search_fetch_total | counter | 1 | Total number of fetches | +| elasticsearch_indices_search_query_time_seconds | counter | 1 | Total search query time in seconds | +| elasticsearch_indices_search_query_total | counter | 1 | Total number of queries | +| elasticsearch_indices_segments_count | gauge | 1 | Count of index segments on this node | +| elasticsearch_indices_segments_memory_bytes | gauge | 1 | Current memory size of segments in bytes | +| elasticsearch_indices_settings_creation_timestamp_seconds | gauge | 1 | Timestamp of the index creation in seconds | +| elasticsearch_indices_settings_stats_read_only_indices | gauge | 1 | Count of indices that have read_only_allow_delete=true | +| elasticsearch_indices_settings_total_fields | gauge | | Index setting value for index.mapping.total_fields.limit (total allowable mapped fields in a index) | +| elasticsearch_indices_settings_replicas | gauge | | Index setting value for index.replicas | +| elasticsearch_indices_shards_docs | gauge | 3 | Count of documents on this shard | +| elasticsearch_indices_shards_docs_deleted | gauge | 3 | Count of deleted documents on each shard | +| elasticsearch_indices_store_size_bytes | gauge | 1 | Current size of stored index data in bytes | +| elasticsearch_indices_store_size_bytes_primary | gauge | | Current size of stored index data in bytes with only primary shards on all nodes | +| elasticsearch_indices_store_size_bytes_total | gauge | | Current size of stored index data in bytes with all shards on all nodes | +| elasticsearch_indices_store_throttle_time_seconds_total | counter | 1 | Throttle time for index store in seconds | +| elasticsearch_indices_translog_operations | counter | 1 | Total translog operations | +| elasticsearch_indices_translog_size_in_bytes | counter | 1 | Total translog size in bytes | +| elasticsearch_indices_warmer_time_seconds_total | counter | 1 | Total warmer time in seconds | +| elasticsearch_indices_warmer_total | counter | 1 | Total warmer count | +| elasticsearch_jvm_gc_collection_seconds_count | counter | 2 | Count of JVM GC runs | +| elasticsearch_jvm_gc_collection_seconds_sum | counter | 2 | GC run time in seconds | +| elasticsearch_jvm_memory_committed_bytes | gauge | 2 | JVM memory currently committed by area | +| elasticsearch_jvm_memory_max_bytes | gauge | 1 | JVM memory max | +| elasticsearch_jvm_memory_used_bytes | gauge | 2 | JVM memory currently used by area | +| elasticsearch_jvm_memory_pool_used_bytes | gauge | 3 | JVM memory currently used by pool | +| elasticsearch_jvm_memory_pool_max_bytes | counter | 3 | JVM memory max by pool | +| elasticsearch_jvm_memory_pool_peak_used_bytes | counter | 3 | JVM memory peak used by pool | +| elasticsearch_jvm_memory_pool_peak_max_bytes | counter | 3 | JVM memory peak max by pool | +| elasticsearch_os_cpu_percent | gauge | 1 | Percent CPU used by the OS | +| elasticsearch_os_load1 | gauge | 1 | Shortterm load average | +| elasticsearch_os_load5 | gauge | 1 | Midterm load average | +| elasticsearch_os_load15 | gauge | 1 | Longterm load average | +| elasticsearch_process_cpu_percent | gauge | 1 | Percent CPU used by process | +| elasticsearch_process_cpu_seconds_total | counter | 1 | Process CPU time in seconds | +| elasticsearch_process_mem_resident_size_bytes | gauge | 1 | Resident memory in use by process in bytes | +| elasticsearch_process_mem_share_size_bytes | gauge | 1 | Shared memory in use by process in bytes | +| elasticsearch_process_mem_virtual_size_bytes | gauge | 1 | Total virtual memory used in bytes | +| elasticsearch_process_open_files_count | gauge | 1 | Open file descriptors | +| elasticsearch_snapshot_stats_number_of_snapshots | gauge | 1 | Total number of snapshots | +| elasticsearch_snapshot_stats_oldest_snapshot_timestamp | gauge | 1 | Oldest snapshot timestamp | +| elasticsearch_snapshot_stats_snapshot_start_time_timestamp | gauge | 1 | Last snapshot start timestamp | +| elasticsearch_snapshot_stats_latest_snapshot_timestamp_seconds | gauge | 1 | Timestamp of the latest SUCCESS or PARTIAL snapshot | +| elasticsearch_snapshot_stats_snapshot_end_time_timestamp | gauge | 1 | Last snapshot end timestamp | +| elasticsearch_snapshot_stats_snapshot_number_of_failures | gauge | 1 | Last snapshot number of failures | +| elasticsearch_snapshot_stats_snapshot_number_of_indices | gauge | 1 | Last snapshot number of indices | +| elasticsearch_snapshot_stats_snapshot_failed_shards | gauge | 1 | Last snapshot failed shards | +| elasticsearch_snapshot_stats_snapshot_successful_shards | gauge | 1 | Last snapshot successful shards | +| elasticsearch_snapshot_stats_snapshot_total_shards | gauge | 1 | Last snapshot total shard | +| elasticsearch_thread_pool_active_count | gauge | 14 | Thread Pool threads active | +| elasticsearch_thread_pool_completed_count | counter | 14 | Thread Pool operations completed | +| elasticsearch_thread_pool_largest_count | gauge | 14 | Thread Pool largest threads count | +| elasticsearch_thread_pool_queue_count | gauge | 14 | Thread Pool operations queued | +| elasticsearch_thread_pool_rejected_count | counter | 14 | Thread Pool operations rejected | +| elasticsearch_thread_pool_threads_count | gauge | 14 | Thread Pool current threads count | +| elasticsearch_transport_rx_packets_total | counter | 1 | Count of packets received | +| elasticsearch_transport_rx_size_bytes_total | counter | 1 | Total number of bytes received | +| elasticsearch_transport_tx_packets_total | counter | 1 | Count of packets sent | +| elasticsearch_transport_tx_size_bytes_total | counter | 1 | Total number of bytes sent | +| elasticsearch_clusterinfo_last_retrieval_success_ts | gauge | 1 | Timestamp of the last successful cluster info retrieval | +| elasticsearch_clusterinfo_up | gauge | 1 | Up metric for the cluster info collector | +| elasticsearch_clusterinfo_version_info | gauge | 6 | Constant metric with ES version information as labels | +| elasticsearch_slm_stats_up | gauge | 0 | Up metric for SLM collector | +| elasticsearch_slm_stats_total_scrapes | counter | 0 | Number of scrapes for SLM collector | +| elasticsearch_slm_stats_json_parse_failures | counter | 0 | JSON parse failures for SLM collector | +| elasticsearch_slm_stats_retention_runs_total | counter | 0 | Total retention runs | +| elasticsearch_slm_stats_retention_failed_total | counter | 0 | Total failed retention runs | +| elasticsearch_slm_stats_retention_timed_out_total | counter | 0 | Total retention run timeouts | +| elasticsearch_slm_stats_retention_deletion_time_seconds | gauge | 0 | Retention run deletion time | +| elasticsearch_slm_stats_total_snapshots_taken_total | counter | 0 | Total snapshots taken | +| elasticsearch_slm_stats_total_snapshots_failed_total | counter | 0 | Total snapshots failed | +| elasticsearch_slm_stats_total_snapshots_deleted_total | counter | 0 | Total snapshots deleted | +| elasticsearch_slm_stats_total_snapshots_failed_total | counter | 0 | Total snapshots failed | +| elasticsearch_slm_stats_snapshots_taken_total | counter | 1 | Snapshots taken by policy | +| elasticsearch_slm_stats_snapshots_failed_total | counter | 1 | Snapshots failed by policy | +| elasticsearch_slm_stats_snapshots_deleted_total | counter | 1 | Snapshots deleted by policy | +| elasticsearch_slm_stats_snapshot_deletion_failures_total | counter | 1 | Snapshot deletion failures by policy | +| elasticsearch_slm_stats_operation_mode | gauge | 1 | SLM operation mode (Running, stopping, stopped) | +| elasticsearch_data_stream_stats_up | gauge | 0 | Up metric for Data Stream collection | +| elasticsearch_data_stream_stats_total_scrapes | counter | 0 | Total scrapes for Data Stream stats | +| elasticsearch_data_stream_stats_json_parse_failures | counter | 0 | Number of parsing failures for Data Stream stats | +| elasticsearch_data_stream_backing_indices_total | gauge | 1 | Number of backing indices for Data Stream | +| elasticsearch_data_stream_store_size_bytes | gauge | 1 | Current size of data stream backing indices in bytes | +| elasticsearch_health_report_creating_primaries | gauge | 1 | The number of creating primary shards | +| elasticsearch_health_report_creating_replicas | gauge | 1 | The number of creating replica shards | +| elasticsearch_health_report_data_stream_lifecycle_status | gauge | 2 | Data stream lifecycle status | +| elasticsearch_health_report_disk_status | gauge | 2 | disk status | +| elasticsearch_health_report_ilm_policies | gauge | 1 | The number of ILM Policies | +| elasticsearch_health_report_ilm_stagnating_indices | gauge | 1 | The number of stagnating indices | +| elasticsearch_health_report_ilm_status | gauge | 2 | ILM status | +| elasticsearch_health_report_initializing_primaries | gauge | 1 | The number of initializing primary shards | +| elasticsearch_health_report_initializing_replicas | gauge | 1 | The number of initializing replica shards | +| elasticsearch_health_report_master_is_stable_status | gauge | 2 | Master is stable status | +| elasticsearch_health_report_max_shards_in_cluster_data | gauge | 1 | The number of maximum shards in a cluster | +| elasticsearch_health_report_max_shards_in_cluster_frozen | gauge | 1 | The number of maximum frozen shards in a cluster | +| elasticsearch_health_report_repository_integrity_status | gauge | 2 | Repository integrity status | +| elasticsearch_health_report_restarting_primaries | gauge | 1 | The number of restarting primary shards | +| elasticsearch_health_report_restarting_replicas | gauge | 1 | The number of restarting replica shards | +| elasticsearch_health_report_shards_availabilty_status | gauge | 2 | Shards availabilty status | +| elasticsearch_health_report_shards_capacity_status | gauge | 2 | Shards capacity status | +| elasticsearch_health_report_slm_policies | gauge | 1 | The number of SLM policies | +| elasticsearch_health_report_slm_status | gauge | 2 | SLM status | +| elasticsearch_health_report_started_primaries | gauge | 1 | The number of started primary shards | +| elasticsearch_health_report_started_replicas | gauge | 1 | The number of started replica shards | +| elasticsearch_health_report_status | gauge | 2 | Overall cluster status | +| elasticsearch_health_report_total_repositories | gauge | 1 | The number snapshot repositories | +| elasticsearch_health_report_unassigned_primaries | gauge | 1 | The number of unassigned primary shards | +| elasticsearch_health_report_unassigned_replicas | gauge | 1 | The number of unassigned replica shards | +| elasticsearch_ingest_total_count | gauge | 1 | Total number of ingest count since the last JVM restart | +| elasticsearch_ingest_total_time_seconds | gauge | 1 | The total ingest time in seconds since the last JVM restart | +| elasticsearch_ingest_total_failed | gauge | 1 | Total number of ingest failed docs since the last JVM restart | +| elasticsearch_ingest_total_current | gauge | 1 | Total number of current ingest docs since the last JVM restart | ### Alerts & Recording Rules diff --git a/collector/nodes.go b/collector/nodes.go index 2fb07693..46d90385 100644 --- a/collector/nodes.go +++ b/collector/nodes.go @@ -1521,6 +1521,54 @@ func NewNodes(logger *slog.Logger, client *http.Client, url *url.URL, all bool, }, Labels: defaultNodeLabelValues, }, + { + Type: prometheus.GaugeValue, + Desc: prometheus.NewDesc( + prometheus.BuildFQName(namespace, "ingest", "total_count"), + "Total number of ingest count since the last JVM restart", + defaultNodeLabels, nil, + ), + Value: func(node NodeStatsNodeResponse) float64 { + return float64(node.Ingest.Total.Count) + }, + Labels: defaultNodeLabelValues, + }, + { + Type: prometheus.GaugeValue, + Desc: prometheus.NewDesc( + prometheus.BuildFQName(namespace, "ingest", "total_time_seconds"), + "Total number of ingest time in seconds since the last JVM restart", + defaultNodeLabels, nil, + ), + Value: func(node NodeStatsNodeResponse) float64 { + return float64(node.Ingest.Total.TimeInMillis) * 0.001 + }, + Labels: defaultNodeLabelValues, + }, + { + Type: prometheus.GaugeValue, + Desc: prometheus.NewDesc( + prometheus.BuildFQName(namespace, "ingest", "total_current"), + "Total number of current ingest docs since the last JVM restart", + defaultNodeLabels, nil, + ), + Value: func(node NodeStatsNodeResponse) float64 { + return float64(node.Ingest.Total.Current) + }, + Labels: defaultNodeLabelValues, + }, + { + Type: prometheus.GaugeValue, + Desc: prometheus.NewDesc( + prometheus.BuildFQName(namespace, "ingest", "total_failed"), + "Total number of ingest failed docs since the last JVM restart", + defaultNodeLabels, nil, + ), + Value: func(node NodeStatsNodeResponse) float64 { + return float64(node.Ingest.Total.Failed) + }, + Labels: defaultNodeLabelValues, + }, }, gcCollectionMetrics: []*gcCollectionMetric{ { diff --git a/collector/nodes_response.go b/collector/nodes_response.go index 6ba2ad7b..c834022e 100644 --- a/collector/nodes_response.go +++ b/collector/nodes_response.go @@ -41,6 +41,7 @@ type NodeStatsNodeResponse struct { Transport NodeStatsTransportResponse `json:"transport"` Process NodeStatsProcessResponse `json:"process"` IndexingPressure map[string]NodeStatsIndexingPressureResponse `json:"indexing_pressure"` + Ingest NodeIngestResponse } // NodeStatsBreakersResponse is a representation of a statistics about the field data circuit breaker @@ -398,3 +399,16 @@ type ClusterHealthResponse struct { TimedOut bool `json:"timed_out"` UnassignedShards int64 `json:"unassigned_shards"` } + +// NodeIngestResponse is a representation of a node ingest stats +type NodeIngestResponse struct { + Total NodeIngestStatsResponse `json:"total"` +} + +// NodeIngestStatsResponse is a representation of a node ingest stats +type NodeIngestStatsResponse struct { + Count int64 `json:"count"` + TimeInMillis int64 `json:"time_in_millis"` + Current int64 `json:"current"` + Failed int64 `json:"failed"` +} diff --git a/collector/nodes_test.go b/collector/nodes_test.go index 9e731837..e7697710 100644 --- a/collector/nodes_test.go +++ b/collector/nodes_test.go @@ -292,6 +292,18 @@ func TestNodesStats(t *testing.T) { # HELP elasticsearch_indices_warmer_total Total warmer count # TYPE elasticsearch_indices_warmer_total counter elasticsearch_indices_warmer_total{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="127.0.0.1",name="bVrN1Hx"} 35 + # HELP elasticsearch_ingest_total_count Total number of ingest count since the last JVM restart + # TYPE elasticsearch_ingest_total_count gauge + elasticsearch_ingest_total_count{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="127.0.0.1",name="bVrN1Hx"} 0 + # HELP elasticsearch_ingest_total_current Total number of current ingest docs since the last JVM restart + # TYPE elasticsearch_ingest_total_current gauge + elasticsearch_ingest_total_current{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="127.0.0.1",name="bVrN1Hx"} 0 + # HELP elasticsearch_ingest_total_failed Total number of ingest failed docs since the last JVM restart + # TYPE elasticsearch_ingest_total_failed gauge + elasticsearch_ingest_total_failed{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="127.0.0.1",name="bVrN1Hx"} 0 + # HELP elasticsearch_ingest_total_time_seconds Total number of ingest time in seconds since the last JVM restart + # TYPE elasticsearch_ingest_total_time_seconds gauge + elasticsearch_ingest_total_time_seconds{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="127.0.0.1",name="bVrN1Hx"} 0 # HELP elasticsearch_jvm_buffer_pool_used_bytes JVM buffer currently used # TYPE elasticsearch_jvm_buffer_pool_used_bytes gauge elasticsearch_jvm_buffer_pool_used_bytes{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="127.0.0.1",name="bVrN1Hx",type="direct"} 2.52727869e+08 @@ -1274,6 +1286,18 @@ func TestNodesStats(t *testing.T) { # HELP elasticsearch_indices_warmer_total Total warmer count # TYPE elasticsearch_indices_warmer_total counter elasticsearch_indices_warmer_total{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="172.17.0.2",name="aaf5a8a0bceb"} 11 + # HELP elasticsearch_ingest_total_count Total number of ingest count since the last JVM restart + # TYPE elasticsearch_ingest_total_count gauge + elasticsearch_ingest_total_count{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="172.17.0.2",name="aaf5a8a0bceb"} 0 + # HELP elasticsearch_ingest_total_current Total number of current ingest docs since the last JVM restart + # TYPE elasticsearch_ingest_total_current gauge + elasticsearch_ingest_total_current{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="172.17.0.2",name="aaf5a8a0bceb"} 0 + # HELP elasticsearch_ingest_total_failed Total number of ingest failed docs since the last JVM restart + # TYPE elasticsearch_ingest_total_failed gauge + elasticsearch_ingest_total_failed{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="172.17.0.2",name="aaf5a8a0bceb"} 0 + # HELP elasticsearch_ingest_total_time_seconds Total number of ingest time in seconds since the last JVM restart + # TYPE elasticsearch_ingest_total_time_seconds gauge + elasticsearch_ingest_total_time_seconds{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="172.17.0.2",name="aaf5a8a0bceb"} 0 # HELP elasticsearch_jvm_buffer_pool_used_bytes JVM buffer currently used # TYPE elasticsearch_jvm_buffer_pool_used_bytes gauge elasticsearch_jvm_buffer_pool_used_bytes{cluster="elasticsearch",es_client_node="true",es_data_node="true",es_ingest_node="true",es_master_node="true",host="172.17.0.2",name="aaf5a8a0bceb",type="direct"} 8.811046e+06