Weekly newsletter with curated articles about DevOps.
This week we bring a presentation about Incident Management at Netflix Velocity, then some articles on Prometheus about alerts and monitoring Celery Workers, troubleshooting PostgreSQL RDS write spikes, monitoring the age of your AWS credentials, setting up azure firewall for analysing outgoing traffic, 7 Principles of Infrastructure as Code and the Production Guideline from Google Cloud Platform.
Prometheus is an incredible system, but because it has many details, it may happen that you might be using values in the labels field when you are defining your alerts. This can bring many problems, and you can check on this post how to avoid them.
In this post, Ramiro Alvarez Fernandez shared an example that helps to explain why these technologies got him so excited. To do this, we’ll take a look at his recipe for building a notification system in AWS (using Terraform, AWS Lambda, AWS Cloudwatch and Go). This system is designed to notify us of any outdated user credentials by sending an alert through Slack.
When it comes to cloud automation, there is a lot of choices out there. Denis Biondic wants to show some practices that worked well for him over last few years, with Azure as an example cloud platform (but the principles should be applicable to almost all forms of Infrastructure as Code).