Understanding of product reliability

Why we need establish a good product observability? How the monitoring impact on product reliability. We will explore the significance of the four golden signals in measuring the system’s performance and reliability. If you’ve ever worked with on-premises environments, you know that you can physically touch the servers. If an application becomes unresponsive, someone can … Read more

System Reliability: implementing ‘golden metrics’

Before we start lets think first what is a system reliability means. In simple words, this is the probability of a product performing its intended function under stated conditions without failure for a given period of time. It means, among other things, continuous monitoring of the state of the system. Why this is so important … Read more

Building redundant EKS monitoring and alerting stack

As Kubernetes containers are actually Linux processes, we can use our favorite tools to monitor and log cluster performance. In Kubernetes, application monitoring does not depend on a single monitoring solution. Each organization is unique in form of requirements to monitoring sensitivity and log ingestion, analysis and persistence. This is very important building our own … Read more