As organizations deploy Amazon Managed Streaming for Apache Kafka (Amazon MSK) clusters throughout a number of use circumstances, the handbook administration of matter configurations might be difficult. This will result in a number of points:
- Inefficiency – Guide configuration is time-consuming and error-prone, particularly for giant deployments. Sustaining consistency throughout a number of configurations might be troublesome. To keep away from this, Kafka directors typically set the create.subjects.allow property on brokers, which ends up in cluster operation inefficiency.
- Human error – Guide configuration will increase the chance of errors that may disrupt knowledge circulation and influence purposes counting on Amazon MSK.
- Scalability challenges – Scaling an Amazon MSK setting with handbook configuration is cumbersome. Including new subjects or modifying present ones requires handbook intervention, hindering agility.
These challenges spotlight the necessity for a extra automated and strong strategy to MSK matter configuration administration.
On this publish, we handle this drawback through the use of Terraform to optimize the configuration of MSK subjects. This resolution helps each provisioned and serverless MSK clusters.
Resolution overview
Clients need a greater method to handle the overhead of subjects and their configurations. Manually dealing with matter configurations might be cumbersome and error-prone, making it troublesome to maintain monitor of modifications and updates.
To handle these challenges, you need to use Terraform, an infrastructure as code (IaC) device by HashiCorp. Terraform means that you can handle and provision infrastructure declaratively. It makes use of human-readable configuration information written in HashiCorp Configuration Language (HCL) to outline the specified state of infrastructure assets. These assets can span digital machines, networks, databases, and an enormous array of cloud provider-specific choices.
Terraform provides a compelling resolution to the challenges of handbook Kafka matter configuration. Terraform means that you can outline and handle your Kafka subjects by means of code. This strategy supplies a number of key advantages:
- Automation – Terraform automates the creation, modification, and deletion of MSK subjects.
- Consistency and repeatability – Terraform configurations present constant matter constructions and settings throughout your whole Amazon MSK setting. This simplifies administration and reduces the probability of configuration drift.
- Scalability – Terraform lets you provision and handle massive numbers of MSK subjects, facilitating the expansion of your Amazon MSK setting.
- Model management – Terraform configurations are saved in model management methods, permitting you to trace modifications, roll again if wanted, and collaborate successfully in your Amazon MSK infrastructure.
By utilizing Terraform for MSK matter configuration administration, you’ll be able to streamline your operations, reduce errors, and have a strong and scalable Amazon MSK setting.
On this publish, we offer a complete information for utilizing Terraform to handle Amazon MSK configurations. We discover the method of putting in Terraform on Amazon Elastic Compute Cloud (Amazon EC2), defining and decentralizing matter configurations, and deploying and updating configurations in an automatic method.
Conditions
Earlier than continuing with the answer, be sure to have the next assets and entry:
- To simplify the setup, use the offered AWS CloudFormation template. This template will create the mandatory Amazon MSK provisioned cluster and required assets for this publish. You’ll be able to create an MSK Serverless cluster utilizing the Amazon MSK console and use it on this resolution. That is pattern template, not manufacturing prepared, and AWS Id and Entry Administration (IAM) insurance policies must be applied utilizing finest practices and the precept of least privilege. For extra particulars, see Get began with AWS managed insurance policies and transfer towards least-privilege permissions. An EC2 occasion will probably be created as part of this template. The MSK cluster and EC2 occasion will probably be created on a single digital personal cloud (VPC); nevertheless, you’ll be able to set up Terraform in a distinct account or on completely different VPC. For extra particulars, see Join Kafka shopper purposes securely to your Amazon MSK cluster from completely different VPCs and AWS accounts.
- For this publish, we use the newest Terraform model (1.10.x) and Terraform plugins – Mongey/Kafka supplier. In Terraform, plugins are binary executables accountable for implementing useful resource sorts and suppliers. The plugins are put in routinely once we initialize a Terraform configuration utilizing the terraform init
- You want entry to an AWS account with enough permissions to create and handle assets, together with IAM roles and MSK clusters. For extra data, see IAM entry management.
By ensuring you might have these conditions in place, you can be able to streamline your matter configurations with Terraform.
Set up Terraform in your shopper machine
When your cluster and shopper machine are prepared, SSH to your shopper machine (Amazon EC2) and set up Terraform.
- Run the next instructions to put in Terraform:
- Run the next command to verify the set up:

This means that Terraform set up is profitable and you’re able to automate your MSK matter configuration.
Provision an MSK matter utilizing Terraform
To provision the MSK matter, full the next steps:
- Create a brand new file known as
major.tfand replica the next code into this file, changing the BOOTSTRAP_SERVERS and AWS_REGION data with the main points to your cluster. For directions on retrieving thebootstrap_serversdata for IAM authentication out of your MSK cluster, see Getting the bootstrap brokers for an Amazon MSK cluster. This script is widespread for Amazon MSK provisioned and MSK Serverless. - Add IAM bootstrap servers endpoints in a comma separated checklist format:
- Run the command
terraform initto initialize Terraform and obtain the required suppliers.

The terraform init command initializes a working listing containing Terraform configuration information(major.tf). That is the primary command that must be run after writing a brand new Terraform configuration.
- Run the command
terraform planto evaluate the run plan.

This command exhibits the modifications that Terraform will make to the infrastructure primarily based on the offered configuration. This step is non-obligatory however is usually used as a preview of the modifications Terraform will make.
- If the plan seems appropriate, run the command
terraform applyto use the configuration. - When prompted for affirmation earlier than continuing, enter
sure.

The terraform apply command runs the actions proposed in a Terraform plan. Terraform will create the sampleTopic matter in your MSK cluster.
- After the
terraform applycommand is full, confirm the infrastructure has been created with the assistance of the kafka-topics.sh utility:

You should use the kafka-toipcs.sh device with the --list choice to retrieve a listing of subjects related together with your MSK cluster. For extra data, discuss with the createtopic documentation.
Replace the MSK matter configuration utilizing Terraform
To replace the MSK matter configuration, let’s assume we wish to change the variety of partitions from 50 to 10 on our matter. We have to carry out the next steps:
- Confirm the variety of partitions on the subject utilizing the
--describecommand:

This command will present 50 partitions on the sampleTopic matter.
- Modify the Terraform file
major.tfand alter the worth of the partitions parameter to 10: - Run the command
terraform planto evaluate the run plan.

- If the plan exhibits the modifications, run the command
terraform applyto use the configuration. - When prompted for affirmation earlier than continuing, enter
sure.

Terraform will drop and recreate the sampleTopic matter with the modified configuration.
- Confirm the modified variety of partitions on the subject, advert rerun the
--describecommand:

Now, this command will present 10 partitions on the sampleTopic matter.
Delete the MSK matter utilizing Terraform
Once you not want the infrastructure, you’ll be able to take away all assets created by your Terraform file.
- Run the command
terraform destroyto take away the subject. - When prompted for affirmation earlier than continuing, enter
sure.

Terraform will delete the sampleTopic matter out of your MSK cluster.
- To confirm, rerun the
--listcommand:

Now, this command won’t present the sampleTopic matter.
Conclusion
On this publish, we addressed the widespread challenges related to handbook MSK matter configuration administration and introduced a strong Terraform-based resolution. Utilizing Terraform for automated matter provisioning and configuration streamlines your processes, fosters scalability, and enhances flexibility. Moreover, it facilitates automated deployments and centralized administration.
We encourage you to discover Terraform as a method to optimize Amazon MSK configurations and unlock additional efficiencies inside your streaming knowledge pipelines.
In regards to the creator
Vijay Kardile is a Sr. Technical Account Supervisor with Enterprise Help, India. With over 20 years of expertise in IT Consulting and Engineering, he makes a speciality of Analytics companies, significantly Amazon EMR and Amazon MSK. He has empowered quite a few enterprise shoppers by facilitating their adoption of varied AWS companies and providing skilled steerage on attaining operational excellence.
