Cost Planning in Cloud

Cost planning is an important phase in your design that starts with capacity planning. Capacity planning ensures that you’re matching what you need with what you have before your project kicks off. It helps you deliver work on time, on budget, and on scope. Capacity planning Capacity planning is a continuous, iterative cycle. I recommend … Read more

Designing Reliable Systems

We already told a bit about designing reliable systems before. Today, we’ll go over how to design services to meet requirements for availability,durability, and scalability. We will also discuss how to implement fault-tolerant systems by avoiding single points of failure, correlated failures, and cascading failures. We will see how to avoid overload failures by using … Read more

Designing SMART key performance indicators (KPI)

With a set of requirements in place, we will now move on to consider how to measure whether the technical and business requirements have been met. To manage a service well, it is important to understand which behaviors matter, and how to measure and evaluate these behaviors. These must always be considered in the context … Read more

Understanding of product reliability

Why we need establish a good product observability? How the monitoring impact on product reliability. We will explore the significance of the four golden signals in measuring the system’s performance and reliability. If you’ve ever worked with on-premises environments, you know that you can physically touch the servers. If an application becomes unresponsive, someone can … Read more

How to reduce mean time to resolution (MTTR)

To consistently address issues raised in your ITSM, you must focus on monitoring, reporting, and reviewing speed of responsiveness. Mean Time to Identify (MTTI) and Mean Time To Resolution (MTTR) are a key indicators that can provide visibility on performance and point to improvements. What is MTTR? MTTI is defined as the average time it takes … Read more