1.7.1
版本发布时间: 2018-03-28 23:29:37
rcbops/rpc-maas最新发布版本:1.8.1(2019-01-03 23:00:23)
Release Notes
1.7.1
New Features
- Detailed logging was added to the maas_rally performance monitoring plugin.
- Automatic stale lock and resource cleanup was added to maas_rally. This makes the plugin more robust and resiliant to transitory environmental problems.
- A configurable quota factor was added to the maas_rally plugin. This allows resource cleanup and performance polling to run asynchronously.
- The maas_rally plugin will now generate an alarm event when too many consecutive intervals (default=3) required cleanup of stale resources.
- The maas_rally plugin will now generate an alarm event when too many consecutive intervals (default=3) were aborted waiting for immature locks.
- A rally_diag.sh script is now deployed to all utility containers. This script helps support to quickly identify resources (instances, images, etc) that were created by maas_rally.
Bug Fixes
-
Limit ceph_cluster_stats and ceph_mons_stats checks to groups['mons'][0] and ceph_rgw_stats to groups['rgws'][0] to prevent duplicate alarms on ceph clusters.
-
- Properly configure agent.plugin timeout value in plugin arguments.
- Add override to swift-recon checks and include a parser for timeout in swift-recon.py.
-
Added more meaningful process info in neutron_ovs_agent alarm exception message.
-
Added a new status_err_no_exit function call to allow plugins like neutron_ ovs_agent_check.py to run its cause and report correct metrics
-
Fixed an exotic KeyError premature exit of the rabbitmq_status.py _get_node_metrics check path. (See https://core.rackspace.com/ticket/180307-12728 for reference)
-
Using the new status_err_no_exit function call to allow plugins to run its cause and report correct metrics
-
Fixed an exotic CalledProcessError premature exit of the swift quarantine check path. (See https://core.rackspace.com/ticket/180307-05355 for reference)
-
Using the new status_err_no_exit function call to allow plugins to run its cause and report correct metrics
-
Fixed an exotic KeyError premature exit of the rabbitmq_status check path.
-
- Disable capacitive related checks: cinder_vg_check, ironic_capacity_check, and nova_cloud_stats_check.
- Disable alarms for CDM checks on all hosts except groups['shared-infra_hosts']. This includes cpu_check, disk_utilisation, and memory_check.
- Disable alarms for network_throughput across all hosts.
* Changes to galera_check: * Limit enablement to groups['galera_all'][0].* Remove alarm for aborted_clients. * Changes to rabbitmq_status: * Limit enablement to groups['rabbitmq_all'][0].* Modify metric msgs_excl_notifications to sum messages from consumed queues only. * Add metric msgs_without_consumers to sum messages from unconsumed queues only.* Fix bug in rabbitmq_qgrowth_excl_notifications alarm removing the division by check period. This is automatically handled by the rate() function. * Restructure rabbitmq_queues_without_consumers alarm with rabbitmq_msgs_without_consumers. This will alarm if unconsumed messages reaches the default threshold of 20000.* Remove default var for unused maas_rabbitmq_queues_without_consumers_limit. * Update maas_rabbitmq_queued_messages_excluding_notifications_threshold to 5000.* Add maas_rabbitmq_messages_without_consumers_threshold, defaulting to 20000.
- Update maas_swift_container_replication_avg_time_threshold from 50 to 300.
Other Notes
- The user configured in openrc_os_username (admin by default) will be granted the admin role on each project created for maas_rally scenarios. This facilitates listing swift containers in the rally_diag.sh script.