Link Search Menu Expand Document

Aggregate Descriptor

You are building an application that will need to run in the cloud that is composed of many parts. You may be using a Container Orchestrator to schedule running your containers. You want to be able to deploy your complex application into your cloud environment without having to constantly change the way in which deployment is carried out as small details (such as code versions) change.

How do you define a reusable architecture independent of particular application artifacts?

Useful applications are almost never composed of a single component. At a minimum, most applications require a database, possibly require a messaging system of some sort, and may require other aspects, such as load balancers or other front-end proxy or security components. On the cloud, it’s important to describe some of the deployment metadata that allows the application to take advantage of the unique features of cloud computing, such as Autoscaling. In the autoscaling case, there is a need to specify the upper and lower bounds of scaling in order to ensure appropriate resliance while not allowing the application to become too expensive to run.

The problem comes in as you start thinking about how you can bring all of these things together. There are two possible, but inadequate solutions:

  1. You could do all of the deployment and configuration manually
  2. You could write deployment scripts that automate the process

The problem with the first option is that it is both error prone and difficult to repeat. No one wants to have to use a GUI or type parameters in multiple times to deploy an application every time there is a change. Likewise, the problem with the second is that idempotency becomes an issue. You can write shell scripts, or scripts in other languages to deploy application components, but you have to include checks in every routine to decide what to do if the component is already deployed, or you can risk either disaster in losing existing configuraiton or data, or out-of-control costs as you end up with multiple deployments of everything. The answer has to lie somewhere in-between.

Therefore,

Define an Aggregate Descriptor of services, runtimes, and the connections between them. This Aggregate Descriptor is executed by a Deployment Engine that reads the descriptor and creates the necessary containers and other aspects of the runtime system. The resulting combination of Descriptor and Engine should respect idempotency and not repeat deploying what is already at the same level and version.

Examples of implementations of this pattern include Terraform templates and HELM charts.