Yoga Series Release Notes¶
14.11.0-21¶
New Features¶
CentOS Stream 8 is not supported anymore. Try migration to Rocky Linux 9.
Upgrade Notes¶
CentOS Stream 8 is not supported anymore. Try migration to Rocky Linux 9.
Bug Fixes¶
Add conditionals for IPv6 sysctl settings that have IPV6 disabled in kernel. Changing sysctl settings related to IPv6 on those systems lead to errors. LP#1906306
Fixes the Python requests library issue when using custom CA by adding the REQUESTS_CA environment variable to the kolla-toolbox container. See LP#1967132
Updated configuration of service user tokens for all Nova and Cinder services to stop using admin role for service_token and use service role.
See LP#[2004555] and LP#[2049762] for more details.
Add
skip_kpartx yesto multipath.confdefaultssection to prevent kpartx scanning multipath devices and unlockmultipathd del mapoperation of os-brick for volume detaching oprtaions. LP#2078973 <https://launchpad.net/bugs/2078973>`__
Fixed an issue where the MariaDB Cluster recovery process would fail if the sequence number was not found in the logs. The recovery process now checks the complete log file for the sequence number and recovers the cluster. See LP#1821173 for details.
Removes the default /tmp/ mountpoint from the horizon container. This change is made to harden the container and prevent potential security issues. For more information, see the Bug Report: LP#2068126.
Add Keystone Service role. Keystone is creating service in bootstrap since Bobcat. Service role is needed for SLURP to work from Antelope. This role is also needed in Antelope and Zed for Cinder for proper service token support. LP#2049762
Fixes parsing of JSON output of inner modules called by
kolla-toolboxwhen data was returned on standard error. LP#2080544
14.11.0¶
New Features¶
Adds warning and support for automatic pruning of stale Kibana indices during OpenSearch migration by running
kolla-ansible opensearch-migrationwith--prune-kibana-indicesparameter.
The new command
kolla-ansible rabbitmq-reset-statehas been added. It force-resets the state of RabbitMQ. This is primarily designed to be used when enabling HA queues, see docs: https://docs.openstack.org/kolla-ansible/latest/reference/message-queues/rabbitmq.html#high-availability
Updates apache grok pattern to match the size of response in bytes, time taken to serve the request and user agent.
Masakari coordination backend can now be configured via masakari_coordination_backend variable. Coordination is optional and can now be set to either redis or etcd.
Set a log retention policy for OpenSearch via Index State Management (ISM). Documentation.
Adds the ability to configure rabbitmq via
rabbitmq_extra_configwhich can be overriden in globals.yml.
In the configuration template of the Senlin service the
cafileparameter is now set by default in theauthenticationsection. This way the use of self-signed certificates on the internal Keystone endpoint is also usable in the Senlin service.
Upgrade Notes¶
Added log retention in OpenSearch, previously handled by Elasticsearch Curator. By default the soft and hard retention periods are 30 and 60 days respectively. If you are upgrading from Elasticsearch, and have previously configured
elasticsearch_curator_soft_retention_period_daysorelasticsearch_curator_hard_retention_period_days, those variables will be used instead of the defaults. You should migrate your configuration to use the new variable names before the Caracal release.
Bug Fixes¶
Fixes 504 timeout when scraping openstack exporter. Ensures that HAProxy server timeout is the same as the scrape timeout for the openstack exporter backend. LP#2006051
Fixes non-persistent Neutron agent state data. LP2009884
Fixes glance-api failed to start privsep daemon when cinder_backend_ceph is set to true. See LP#2024541 for more details.
Fixes 2024554. Adds mariadb_port to the wsrep sync status check. This is so none standard ports can be used for mariadb deployments. LP#2024554
Fixes glance image import LP#2048525.
Fixes an issue where Prometheus would fail to scrape the OpenStack exporter when using internal TLS with an FQDN. LP#2008208
Fixes Docker health check for the
sahara_enginecontainer. LP#2046268
Fixes an issue where Fluentd was parsing Horizon WSGI application logs incorrectly. Horizon error logs are now written to
horizon-error.loginstead ofhorizon.log. See LP#1898174
Added log retention in OpenSearch, previously handled by Elasticsearch Curator, now using Index State Management (ISM) OpenSearch bundled plugin. LP#2047037.
Fixes an issue where Prometheus scraping of Etcd metrics would fail if Etcd TLS is enabled. LP#2036950
14.10.0¶
New Features¶
Added capability to specify custom kernel modules for Neutron: neutron_modules_default: Lists default modules. neutron_modules_extra: For custom modules and parameters.
Added a neutron check for ML2/OVS and ML2/OVN presence at the start of deploy phase. It will fail if neutron_plugin_agent is set to
ovnand use of ML2/OVS container detected. In case where neutron_plugin_agent is set toopenvswitchthe check will fail when it detects ML2/OVN container or any of the OVN specific volumes.
Upgrade Notes¶
Default keystone user role has been changed from deprecated role
_member_tomemberrole.
Now
ironic_tftpservice does not bind on 0.0.0.0, by default it uses ip address of theapi_interface. To revert to the old behaviour, please setironic_tftp_interface_address: 0.0.0.0inglobals.yml.
Configure Nova libvirt.num_pcie_ports to 16 by default. Nova currently sets ‘num_pcie_ports’ to “0” (defaults to libvirt’s “1”), which is not sufficient for hotplug use with ‘q35’ machine type.
Influxdb variable
infuxdb_internal_endpointhas been fixed toinfluxdb_internal_endpoint. Operators might need to review the relevant variable.
Changes default value of nova libvirt driver setting
skip_cpu_compare_on_destto true. With the libvirt driver, during live migration, skip comparing guest CPU with the destination host. When using QEMU >= 2.9 and libvirt >= 4.4.0, libvirt will do the correct thing with respect to checking CPU compatibility on the destination host during live migration.
Security Issues¶
Restrict the access to the http Openstack services exposed /server-status by default through the HAProxy on the public endpoint. Fixes issue for Ubuntu/Debian installations. RockyLinux/CentOS not affected. LP#1996913
Bug Fixes¶
Fixes issues with OVN NB/SB DB deployment, where first node needs to be rebootstrapped. LP#1875223
Set correct permissions for opensearch-dashboard data location LP#2020152 https://bugs.launchpad.net/kolla-ansible/+bug/2020152
enable_keystone_federationandkeystone_enable_federation_openidhave not been explicitly handled as bool in various templates in the keystone role so far. LP#2036390
Fixes an issue when Kolla is setting the producer tasks to None, and this disables all designate producer tasks. LP#1879557
Fixes
ironic_tftpwhich binds to all ip addresses on the system. Addedironic_tftp_interface,ironic_tftp_address_familyandironic_tftp_interface_addressparameters to set the address for theironic_tftpservice. LP#2024664
Fixes an OpenSearch migration process by adding precheck for Elasticsearch indexes in too low version for OpenSearch 2.x.
Fixes an issue where a Docker health check wasn’t configured for the OpenSearch Dashboards container. See bug 2028362.
Fixes an issue where ‘q35’ libvirt machine type VM could not hotplug more than one PCIe device at a time.
14.9.0¶
New Features¶
Since CVE-2022-29404 is fixed the default value for the LimitRequestBody directive in the Apache HTTP Server has been changed from 0 (unlimited) to 1073741824 (1 GiB). This limits the size of images (for example) uploaded in Horizon. Now this limit can be configured via
horizon_httpd_limitrequestbody. LP#2012588
Adds support for deploying OpenSearch and OpenSearch dashboards. These services directly replace ElasticSearch and Kibana which are now end-of-life. Support for sending logs to a remote ElasticSearch (or OpenSearch) cluster is maintained.
Adds support for migrating from Elasticsearch to OpenSearch by running
kolla-ansible opensearch-migrationcommand.
etcd is now exposed internally via HAProxy on
etcd_client_port.
Added two new flags to alter behaviour in RabbitMQ: * rabbitmq_message_ttl_ms, which lets you set a TTL on messages. * rabbitmq_queue_expiry_ms, which lets you set an expiry time on queues. See https://www.rabbitmq.com/ttl.html for more information on both.
The config option rabbitmq_ha_replica_count is added, to allow for changing the replication factor of mirrored queues in RabbitMQ. While the flag is unset, the queues are mirrored across all nodes using “ha-mode”:”all”. Note that this only has an effect if the flag ` om_enable_rabbitmq_high_availability` is set to True, as otherwise queues are not mirrored.
The config option rabbitmq_ha_promote_on_shutdown has been added, which allows changing the RabbitMQ definition ha-promote-on-shutdown. By default ha-promote-on-shutdown is “when-synced”. We recommend changing this to be “always”. This basically means we don’t mind losing some messages, instead we give priority to rabbitmq availability. This is most relevant when restarting rabbitmq, such as when upgrading. Note that setting the value of this flag, even to the default value of “when-synced”, will cause RabbitMQ to be restarted on the next deploy. For more details please see: https://www.rabbitmq.com/ha.html#cluster-shutdown
Services using etcd3gw via tooz now use etcd via haproxy. This removes a single point of failure, where we hardcoded the first etcd host for backend_url.
Upgrade Notes¶
Default tags of
neutron_tls_proxyandglance_tls_proxyhave been changed tohaproxy_tag, as both services are usinghaproxycontainer image. Any custom tag overrides for those services should be altered before upgrade.
Security Issues¶
The kolla-genpwd, kolla-mergepwd, kolla-readpwd and kolla-writepwd commands now creates or updates passwords.yml with correct permissions. Also they display warning message about incorrect permissions.
Bug Fixes¶
The precheck for RabbitMQ failed incorrectly when
kolla_externally_managed_certwas set totrue. LP#1999081
Fixes create sasl account before config file is ready. LP#2015589
The flags
--db-nb-pidand--db-sb-pidhave been corected to be--db-nb-pidfileand--db-sb-pidfilerespectively. See here for reference: https://github.com/ovn-org/ovn/blob/6c6a7ad1c64a21923dc9b5bea7069fd88bcdd6a8/utilities/ovn-ctl#L1045 LP#2018436
Configuration of service user tokens for all Nova and Cinder services is now done automatically, to ensure security of block-storage volume data.
See LP#[2004555] for more details.
Adds configuration necessary for application credential access rules to properly function. LP#1965111
Fixes deployment when using Ansible check mode. LP#2002661
Fixes the incorrect endpoint URLs and service type information for the Cyborg service in the Keystone. LP#2020080
Set the etcd internal hostname and cacert for tls internal enabled deployments. This allows services to work with etcd when coordination is enabled for TLS interal deployments. Without this fix, the coordination backend fails to connect to etcd and the service itself crashes.
fix missing [taskflow] section in masakari.conf.j2 LP#1966536
Fixes opensearch migration process. Including case when elasticsearch is located in regular folder instead of docker volume. Furthermore it now has checks if there is data to migrate.
When upgrading or deploying RabbitMQ, the policy ha-all is cleared if om_enable_rabbitmq_high_availability is set to false.
14.8.0¶
New Features¶
Adds the flag
om_enable_rabbitmq_high_availablity. Setting this totruewill enable both durable queues and classic mirrored queues in RabbitMQ. Note that classic queue mirroring and transient (aka non-durable) queues are deprecated and subject to removal in RabbitMQ version 4.0 (date of release unknown). Changes the pattern used in classic mirroring to exclude some queue types. This pattern is^(?!(amq\\.)|(.*_fanout_)|(reply_)).*.
Adds a new flag,
openvswitch_ovs_vsctl_wrapper_enabledwhich will install a wrapper script to/usr/bin/ovs-vsctlto docker exec into the openvswitchd container.
Adds
ovn-monitor-allvariable. A boolean value that tells if ovn-controller should unconditionally monitor all records in OVS databases. Settingovn-monitor-allvariable to ‘true’ will remove some CPU load from OVN SouthBound DB but will effect with more updates comming to ovn-controller. Might be helpfull in large deployments with many compute hosts.
For ovn added ovn-openflow-probe-interval variable, it sets the inactivity probe interval of the OpenFlow connection to the OpenvSwitch integration bridge, in seconds. If the value is zero, it disables the connection keepalive feature. The default value is 60 seconds.
Upgrade Notes¶
ovnrole has been split intoovn-controllerandovn-dbroles,ovn_extra_volumeswill still be used, but users will need to adapt their config to useovn_db_extra_volumesorovn_controller_extra_volumeswhen upgrading to Zed.
For ovn the default value of openflow-probe-interval was changed to 60 seconds. Use the ovn-openflow-probe-interval variable to override.
Bug Fixes¶
Fixes
kolla_dockermodule which did not take into account the common_options parameter, so there were always module’s default values. LP#2003079
The value of
[oslo_messaging_rabbit] heartbeat_in_pthreadis explicitly set to eithertruefor wsgi applications, orfalseotherwise.
Fix issue with octavia config generation when using
octavia_auto_configureand thegenconfigcommand. Note that access to the OpenStack API is necessary for Octavia auto configuration to work, even when generating config. See LP#1987299 for more details.
Fixes OVN deployment order - as recommended in OVN docs. LP#1979329
Fixes an issue where some prechecks would fail or not run when running in check mode. LP#2002657
14.7.0¶
Bug Fixes¶
Fixes an issue with
ironic-inspectorusing the wrong option to configure the interface used to communicate with the Ironic API. LP#1995246
14.6.0¶
Upgrade Notes¶
image_upload_use_cinder_backend = Trueis no longer set on the Cinder’s default Ceph RBD backend, the common upstream default is now used (Falsecurrently). See also LP#1991516
Bug Fixes¶
image_upload_use_cinder_backend = Trueis no longer set on the Cinder’s default Ceph RBD backend. Related ERRORs and WARNINGs in Cinder and Glance logs are prevented. LP#1991516
Fixes Keystone OIDC failing to validate JWT because of missing key on Azure auth-oidc endpoint. Adds new variable containing JWKS uri that delivers missing keys. LP#1990375
Fixes Ironic API healthchecks when backend TLS encryption is enabled. LP#1990819
Removes the
dhcp-sequential-ipconfiguration option fromironic_dnsmasqto avoid a race condition offering the same IP address to multiple hosts being inspected at the same time.
Fixes an issue with
ironic-neutron-agentusing the wrong option to configure the interface used to communicate with the Ironic API. LP#1990675
14.5.0¶
New Features¶
Adds support for deploying
prometheus-msteams, which can be used to forward Prometheus Alertmanager notifications to Microsoft Teams. It is enabled by settingenable_prometheus_msteamstotrue.
Bug Fixes¶
Fixes an issue with AlertManager external Web URL being unconfigurable. A new variable
prometheus_alertmanager_external_urlhas been introduced that users can use to set web.external-url to public.
Under circumstances of extended disruption to the Fluentd-ElasticSearch central logging pipeline, it is possible to generate a sufficient buffer of unsent log data that takes longer than the default Fluentd request timeout (default 5 seconds) to transfer the buffer. The default request timeout value is raised to
60s, and made configurable using new parameterfluentd_elasticsearch_request_timeout. LP#1983031
14.4.0¶
Security Issues¶
Kolla Ansible used to run Ironic’s tftpd as an (unprivileged) root user. Now, it will explicitly use the nobody user.
Bug Fixes¶
Fixes 1982777. Set multipathd user_friendly_names to “no” to make os-brick able to resize volumes online. Adds ability to override multipathd config. LP#1982777
Fixed bug #1987982 This bug caused the database log_bin_trust_function_creators variable not to be set back to “OFF” after a keystone upgrade.
If
ironic_enabled_notification_topicsis set totrue,ironic_notification_levelis set toinfoin order to ensure that Ironic actually sends out notifications.See bug 1969826 for details.
14.3.0¶
New Features¶
Adds variables to configure whether monitoring services should be exposed externally:
enable_grafana_externalenable_kibana_externalenable_prometheus_alertmanager_external
Bug Fixes¶
Fixes an issue where Ironic Inspector could be configured without authentication in a multi-region environment in a region without a local Keystone service.
14.2.0¶
New Features¶
Adds support for configuring the Openstack Compute API microversion used by the OpenStack exporter for Prometheus using the
prometheus_openstack_exporter_compute_api_versionvariable. The default value is2.1to keep metrics unchanged when using recent exporter releases.
Bug Fixes¶
Fixes the issue of exponential growth of /run/openvswitch mounts when kolla-toolbox container is restarted. LP#1979295
Fixes an issue with recovering multi-node MariaDB Galera cluster.
Increases
prometheus_openstack_exporter_timeoutto 45 seconds to reduce the odds of scrape failures on deployments with large number of OpenStack resources. LP#1976629
14.1.0¶
New Features¶
New switches added to control deployment of the Masakari monitors. The deployment of each type of monitors can be controlled individually via
enable_masakari_instancemonitorandenable_masakari_hostmonitor. By default, both are set totruewhen the deployment of the Masakari is enabled viaenable_masakari.
Bug Fixes¶
Fixes an issue with Masakari instance monitor when libvirt SASL is enabled. libvirt SASL was enabled by default in a recent change to Kolla Ansible. LP#1965754
The prometheus openstack exporters are now behind haproxy, providing a unique time series in the prometheus database. Also ensures that only one exporter queries the openstack APIs at any given time interval. With the previous behavior each openstack exporter was scraped at the same time. This caused each exporter to query the openstack APIs simultaneously introducing unneccesary load and duplicate time series in the prometheus database due to the instance label being unique for each exporter. LP#1972818
14.0.0¶
New Features¶
Adds support for the VMware NSX Policy plugin
Adds support for openEuler 20.03-LTS-SP2 as a host OS distribution.
Deploys and configures a prometheus-libvirt-exporter image as part of the Prometheus monitoring stack.
Adds support for the VMware FCD as Cinder volumes.
Adds a
tls_connectmodule to the Prometheus blackbox exporter. This can be used to test connectivity of TLS servers.
Adds the ability to use Prometheus as the metrics database for Ceilometer.
Adapts Ceilometer configurations so metrics can be pushed to a Prometheus Pushgateway. LP#1964135
Adds new variables to be used by the common role,
cron_logrotate_log_minsizeandcron_logrotate_log_maxsize. They allow to configure global logrotate’s minsize and maxsize options.
Allow to disable Designate Sink service (and notifications to/from it) by setting
designate_enable_notifications_sinktono.
Introduce
nova_enable_external_metadatathat defaults tonoto control if external facing metadata haproxy frontend should be configured.
With this release, kolla-ansible no longer creates admin endpoints for any service other than Keystone. Make sure that you only reference public or internal endpoints in your applications and configurations.
Allows the use of variables in ceph configuration and keyring files. This includes but is not limited to ansible lookup expressions. LP#1959565
Implements the HAProxy Admin Socket. Allows operators to set the flag
haproxy_socket_level_admin(default: “no”) which addslevel adminto socket that gets created at/var/lib/kolla/haproxy/haproxy.sockinside the HAProxy container. This allows operators to interact with HAProxy, including but not limited to disabling backend servers for controlled maintenance operations. bug 1960215.
horizondeployment now supports custom themes.
Implements container healthchecks for ironic-neutron-agent service. See blueprint
Implements container healthchecks for neutron-bgp-dragent service. See blueprint
Implements container healthchecks for solum services. See blueprint
Implements container healthchecks for storm services. See blueprint
Implements container healthchecks for zookeeper services. See blueprint
Support for both PXE and iPXE enabled in Ironic at the same time.
Adds support for running a libvirt daemon on the host, rather than in a container. This is done by setting
enable_nova_libvirt_containertofalse. Currently this is only supported for fresh deployments without an existingnova_libvirtcontainer.
Adds support for libvirt SASL authentication. It is enabled by default. LP#1964013
Adds support to the
kolla-ansible certificatescommand for generating certificates for libvirt TLS, whenlibvirt_tlsistrue. The same certificate and key are used for the libvirt client and server.The certificates use the same root CA as the other generated certificates, and are written to
{{ node_custom_config }}/nova/nova-libvirt/, ready to be picked up by nova-libvirt and nova-compute.
Adds a new variable to be used by the common role,
cron_logrotate_schedule. This allows to configure how often the cron runs for logrotate.
Adds an SSH key for Neutron server which can be used for passwordless public key authentication in external systems (e.g. for
networking-generic-switchmanaged switches).
Adds a
kolla-ansible nova-libvirt-cleanupcommand, which may be used to clean up thenova_libvirtcontainer. This may be useful if switching to a host libvirt daemon.
Keystone OIDC integration now uses memcached for the caching backend if
enable_memcachedisTrue. This can be disabled by settingkeystone_oidc_enable_memcachedtoFalse.
Adds functionality to enable HW offload in OpenvSwitch using
openvswitch_hw_offloadvariable.
Adds variables to define extra command-line parameters to be passed to Prometheus exporters:
prometheus_blackbox_exporter_cmdline_extrasprometheus_elasticsearch_exporter_cmdline_extrasprometheus_haproxy_exporter_cmdline_extrasprometheus_memcached_exporter_cmdline_extrasprometheus_mysqld_exporter_cmdline_extrasprometheus_node_exporter_cmdline_extrasprometheus_openstack_exporter_cmdline_extras
Add
enable_prometheus_etcd_integrationconfiguration parameter which can be used to configure Prometheus to scrape etcd metrics endpoints. The default value ofenable_prometheus_etcd_integrationis set to the combined values ofenable_prometheusandenable_etcd.
Adds “manila_cephfs_filesystem_name” variable to support multi-fs Ceph Pacific+ deloyments.
Adds support for Rocky Linux 8 as Host OS.
Adds support for configuring a Vendordata file for Nova. This allows users to pass through arbitrary data to instances.
Known Issues¶
Existing fluentd log rotation failed to delete old haproxy, swift, glance-tls-proxy and neutron-tls-proxy logs. These will not be deleted by the new logrotate config and will have to be removed manually.
Upgrade Notes¶
Minimum supported Ansible version is now
4(ansible-core 2.11) and maximum supported is5(ansible-core 2.12).
Restores upstream default value for
max_allowed_request_size_in_bytesinbarbican.conf. It was set to 1000000 bytes instead of the upstream default of 25000 bytes.
RabbitMQ’s Prometheus plugin is no longer enabled by default if Prometheus is not deployed. If external Prometheus is used, you need to turn on
rabbitmq_enable_prometheus_pluginto get old behaviour.
External Nova metadata service is now disabled by default. It can be enabled by setting
nova_enable_external_metadatatoyes.
With this release, kolla-ansible no longer creates admin endpoints for any service other than Keystone. Existing endpoints will not be removed automatically, if you want to clean up your existing cloud, you can use a command like:
openstack endpoint list --interface admin -f value | \ awk '!/keystone/ {print $1}' | xargs openstack endpoint delete
enable_host_ntpvariable is dropped per the deprecation process.
Support for deploying
vmtphas been dropped per the mailing list notice. Thevmtpproject is no longer buildable, is outside of the OpenStack namespace and looks plain abandoned. See the mailing list notice
fluentd_binaryandfluentd_versionvariables are no longer in use as Kolla Ansible supports a single fluentd version across all supported Kolla image flavours.
Starting with Yoga, Ironic has changed the default PXE from plain PXE to iPXE. Kolla Ansible follows this upstream decision but allows users to revert to the previous default of plain PXE. For details, please refer to Kolla Ansible’s documentation.
The bootloader used to boot Ironic nodes in UEFI boot mode during inspection when iPXE is enabled has been changed from
ipxe.efitosnponly.efi. This is in line with the default UEFI iPXE bootloader used in Ironic since the Xena release. The bootloader may be changed viaironic_dnsmasq_uefi_ipxe_boot_file.
Ironic volumes related to PXE (TFTP) and iPXE & direct deploy (HTTP) are refactored to share a common parent path at
/var/lib/ironic. This is done to support both PXE and iPXE at the same time. Operators doing advanced customisations might need to review the relevant defaults section.
Upgrades of Ironic will now wait for nodes in
waitstates to change their state. This is to improve the user experience by avoiding breaking processes being waited on. This can be disabled by settingironic_upgrade_skip_wait_checktoyes.
Ironic containers related to PXE (TFTP) and iPXE & direct deploy (HTTP) are renamed to better reflect their role:
ironic_pxeis nowironic_tftp, whileironic_ipxeis nowironic_http. Operators doing advanced customisations might need to review the relevant defaults section. Additionally, their respective host groups have changed analogously:ironic-pxeis nowironic-tftp, andironic-ipxeis nowironic-http.
ironic.confnow sets[pxe]\kernel_append_paramsinstead of[pxe]\pxe_append_paramswhich has been deprecated. Please override the new config option if you are overriding the old one.
The addition of libvirt SASL authentication requires a new password in
passwords.yml,libvirt_sasl_password. This may be generated using the existingkolla-genpwdandkolla-mergepwdtooling.
The addition of libvirt SASL authentication requires both the
nova_libvirtandnova_computecontainers to be updated simultaneously, using new images with the necessary Cyrus SASL dependencies, as well as configuration containing the SASL credentials.
If both Designate and Neutron are enabled, Neutron now uses the
subnet_dns_publish_fixed_ipinstead of the simplerdnsextension in order to support more features in the DNS integration. Override via theneutron_extension_driversconfig option if this is not suitable for your deployment.
It is no longer possible to override the removal of the Monasca Log Metrics service and it will be removed automatically if it hasn’t already been removed in the Wallaby release. It is up to the operator to remove any associated docker volumes.
The policy for classic transient mirrored queues in RabbitMQ has been removed from the RabbitMQ configuration. The policy will be removed automatically during upgrade of the RabbitMQ service.
The
wsrep-notify.shscript has been removed (following deprecation in Wallaby).
update the default value of node_custom_config to {{ node_config }}/config, when specified using –configdir
Deprecation Notes¶
enable_ironic_ipxeis deprecated in favour ofironic_dnsmasq_serve_ipxewhich reflects the effect better.enable_ironic_ipxewill be removed in Zed.
enable_ironic_pxe_uefiis deprecated and will be removed in Zed. This variable is not documented and results in a broken PXE setup for Ironic Inspector. The recommended way to support EFI/UEFI deployments in Ironic Inspector is to stay with the recommended default of iPXE in Ironic Inspector (see docs onironic_dnsmasq_serve_ipxe).
The
storage_interfacevariable is deprecated and will be removed in the next release as it was causing confusion. The variable only sets the default forswift_storage_interfacewhich we now recommend to set directly instead.
Security Issues¶
Explicitly removes the
net.ipv4.ip_forwardsysctl from/etc/sysctl.confon hosts with Neutron L3 Agent. In the absence of another source for this sysctl, it should revert to the default of 0 after the next reboot. This is a follow up to a previous change which stopped setting the sysctl, but leaves existing systems with the original value of 1 set.A deployer looking to more aggressively change the value may set
neutron_l3_agent_host_ipv4_ip_forwardto 0 using a Yoga release of Kolla Ansible. This option will be removed in future. Any deployments still relying on the previous value may setneutron_l3_agent_host_ipv4_ip_forwardto 1. LP#1945453
Fixes an issue where the default configuration of libvirt did not use authentication for the API exposed over TCP on the internal API network. This allowed anyone with access to the internal API network read-write access to libvirt. While the internal API network is typically trusted, other services on this network generally at least require authentication.
SASL authentication is now enabled for libvirt by default. Kolla Ansible supports libvirt TLS since the Train release, and this is recommended to provide a higher level of security. LP#1964013
Adds mitigation for the Apache Log4j2 Remote Code Execution (RCE) Vulnerability in Elasticsearch - CVE-2021-44228.
Bug Fixes¶
Fixes an issue with an OIDC authentication flow requiring unnecessary action from the user. Redirecting to the target IdP page now happens automatically. LP#930055
Removes custom value of
max_allowed_secret_in_bytesinbarbican.conf. The default maximum size in Barbican was doubled to avoid issues with some certificates. LP #1957795
Fixes deploy Zun with Cinder Ceph support. Adds support for zun to access cinder volumes when external ceph is configured for cinder. LP#1848934
Fixed the deployment failure of outward_rabbitmq by resolving port conflicts by customizing RabbitMQ’s
prometheus.tcp.port. LP #1885106
Use Volume V3 API in OpenStack exporter. Volume V2 API has been removed since OpenStack Wallaby. LP#1938194
Fixes generation of
cyborg.conf. LP#1941704
Adds the
nodeparameter when using therabbitmq_userAnsible module. LP#1946506
Fixes an issue with multinode MariaDB deployments which could fail the playbook execution on WSREP check due to the new behaviour of Galera 4. LP#1947485.
Fixes an issue with single node MariaDB deployments with HAProxy disabled. See bug 1947534 for details.
Fixes the generation of
wsrep_cluster_addressingalera.cnfwhen--limitis used while deploying MariaDB nodes. LP#1947589
Fixes the copy job for grafana custom home dashboard file. The copy job for the grafana home dashboard file needs to run priviliged, otherwise permission denied error occurs. LP#1947710
Fixes an error in placement role which prevents to deploy the placement service when custom policy file is used. LP#1948835
Fixes missing current Ansible version in the error message. LP#1948979
Fix octavia role doesn’t set the amphora network’s gateway_ip LP#1949260
Fixes Octavia’s “Connection refused” errors by adding
ovn_sb_connectiontooctavia.conf. LP#195011
Ironic API and Ironic Inspector API use separate policy files. Ironic role was updated to be able to handle both policies separately. LP#1952948
Only run
configure ovn in ovsdbtask on ovn-controller hosts The task will fail on hosts (like controller nodes) without tunnel interface LP#1953367
Continue to run all actions if one action failed in Elasticsearch curator. LP#1954720
Fixes Placement no logrotate configuration LP#1954723
Fixes Nova resize failing when
migration_interfaceis customised. LP#1956976
Fixes unable to connect to zun console when
kolla_enable_tls_externalis true. Access to console of any zun container fails whenkolla_enable_tls_externalis true. This fix sets the protocol for wsproxybase_urlinzun.confaccording to the value ofkolla_enable_tls_externalLP#1957117
Fixes
Register Identity Providers in OpenStacktask which was missing an = in the openstack command causing the task to fail to register an IDP with Keystone. LP#1959022
Fixes Glance with Cinder iSCSI backend failing due to lack of lock_path setting. LP#1959663
Fixes logrotate config missing for openvswitch and prometheus services. LP#1961795
Fixes an issue with Ironic’s PXE components not getting updated on upgrade. LP#1963752
Adds Fluentd configurations to allow matching OpenvSwitch logs. LP#1965815
Fixes an issue where the Nova API logs were written to files ending with -wsgi.log which affected the processing of these logs in the Fluentd pipeline. LP#1950185
Fixes configuration of the Prometheus HTTP API URL when using the Prometheus collector in CloudKitty. LP#1961615
Fixes an issue with Prometheus scraping when targets’ Ansible inventory hostnames (
inventory_hostname) do not resolve to reachable IP addresses. Reverts to the previous behaviour of using IP addresses to communicate with targets. The side effect of this is that targets instances will again be labelled using IP addresses rather than hostnames. LP#1955563
Fix the apache’s wsgi configuration for the aodh service in Debuntu binary flavours. LP#1953059
Fixes the baremetal role to avoid an error “Unable to remove “libvirtd”. Now the symlink /etc/apparmor.d/disable/usr.sbin.libvirtd is created by the role. LP#1960302
Existing fluentd log rotation failed to delete old haproxy, swift, glance-tls-proxy and neutron-tls-proxy logs. Standardise rotation and deletion of logs using logrotate.
Fixes an issue with setting up OIDC based Keystone federation against IDP that has a different response type than id_token. This can now be set using a new variable
keystone_federation_oidc_response_type. LP#1959781
adds back the option to configure the rabbitmq clustering interface via kolla LP#1900160 <https://bugs.launchpad.net/kolla-ansible/+bug/1900160>
On slower nodes, the initial grafana startup could experience a timeout failure when the migrations for setting up the database took longer than expected. This has been fixed by increasing the default timeout. The timeout settings can be changed via new parameters
grafana_start_first_node_delayandgrafana_start_first_node_retriesfor thegrafanarole. LP#1769962
Fixes an issue seen when using Jinja2 3.1.0.
Fixes the configuration option setting the type of endpoint used by Neutron to send requests to Placement. LP#1960503
Fixes a configuration issue with Node Exporter causing all file system metrics of a host to be identical. LP#1961438
Fixes an issue where a failure of any Nova compute service to register itself would cause only the host querying the nova API to fail. Now, only hosts that fail to register will fail the Kolla Ansible run. Alternatively, to fail all hosts in a cell when any compute service fails to register, set
nova_compute_registration_fataltotrue. LP#1940119
Fixes an issue where RabbitMQ was configured to mirror classic transient queues for all services. According to the RabbitMQ documentation this is not a supported configuration, and contributed to numerous bug reports. LP#1954925
Removes “fix_cephfs_owner.yaml” which related to pre-wallaby Manila’s use of subfolders. Post-wallaby Manila now uses cephfs volumes instead, as such this file is no longer required. LP#1938285 LP#1935784
Removes use of “cephfs_enable_snapshots” in Manila config as this option was removed from Manila in the Wallaby release.
Fixes an issue with Cinder upgrade where Cinder services would remain pinned to the previous release’s RPC & object versions. LP#1954932
Other Notes¶
The container
ironic-dnsmasqnow creates thednsmasq.logjust as the containerneutron-dhcp-agent. For both log files verbosity can be increased globally viaopenstack_logging_debugor per service viaironic_logging_debugorneutron_logging_debugvariables.