As part of its goal of further pushing back scaling limits within a given cluster, the Large Scale SIG collects scaling stories from OpenStack users.
There is a size/load limit for single clusters past which things in OpenStack start to break, and we need to start using multiple clusters or cells to scale out. The SIG is interested in hearing:
what broke first for you, is it RabbitMQ or something else
what were the first symptoms
what size/load did it start to break
things you did to fix it
This will be a great help to document expected limits, and identify where improvements should be focused.