The Data Processing service (sahara) provides a platform for the provisioning and management of instance clusters using processing frameworks such as Hadoop and Spark. Through the OpenStack Dashboard, or REST API, users are able to upload and execute framework applications which may access data in object storage or external providers. The data processing controller uses the Orchestration service (heat) to create clusters of instances which may exist as long-running groups that can grow and shrink as requested, or as transient groups created for a single workload.
- Introduction to Data processing
- Configuration and hardening