Effective Alert Routing, On-Call and Incident Response
SLOs and Error
Incident Analytics and
TRY IT FOR FREE
SCHEDULE A DEMO
Our Product Roadmap is now public. Check it out
🎉 We Are Hiring! 🎉
We Are Hiring!
Effective Alert Routing, On-Call
and Incident Response
Post Incident Review
SLOs and Error Budgets
Incident Analytics and Reliability
Mobile Incident Management
Squadcast way to resolve Incidents
TRY SQUADCAST for Free
schedule a demo
Subscribe to our latest updates
Enter your Email Id
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Understanding the landscape of AWS compute
In the second part of our "SLOs for AWS-based infrastructure" blog , Gigi Sayfan dives deeper into understanding the landscape of AWS compute by using the lens of Kubernetes to compare and contrast & covers in detail setting of SLOs for ECS, EKS, Fargate, and Lambda based services.
July 10, 2020
SLOs for AWS-based infrastructure
In our latest two-part series blog post, Gigi Sayfan, author of “Mastering Kubernetes”, discusses managing complex infrastructure on AWS with an eye towards SLOs (service level objectives). Though there are many ways to discuss the management of infrastructure, in this two-part series, he covers SLOs for AWS, Observability on AWS, Quotas Limits, and Optimizing cost on AWS and in the second part, he uses the lens of Kubernetes to compare and contrast compute infrastructure on AWS with Kubernetes.
July 8, 2020
Kubernetes Operators for Automated SRE
It can be quite challenging for an SRE team to maintain the well-being of a large-scale Kubernetes based system with hundreds or thousands of services. In this blog post, Gigi Sayfan, author of “Mastering Kubernetes”, outlines the SRE challenge and how we can achieve the ultimate goal of automated SRE with Kubernetes operators
May 27, 2020
Using observability tools to set SLOs for Kubernetes Applications
You deployed a service to your Kubernetes cluster. How do you it is working as expected? In this blog, Gigi Sayfan, author of “Mastering Kubernetes” talks about Kubernetes observability tools like Prometheus, Grafana and Jaeger, how to utilize them to set proper SLOs and make sure the service meets its objectives.
April 16, 2020
The Age of Service Mesh
There has been some hype around service meshes for a while now. But what are they and why is it needed? In this article, Gigi Sayfan, a Principal Software Architect and author of “Mastering Kubernetes” explores the what, why of service mesh and how it works with Kubernetes
November 28, 2019
Intent-based Capacity Planning and Autoscaling with Kubernetes
Intent-based Capacity Planning is Google's approach to declare reliability intent for a service and then solve for the most efficient resource allocation plan dynamically. Learn how you can start using this approach to effectively manage the reliability of your services running on your Kubernetes cluster.
July 24, 2019
IT Incident Management
Submit a Ticket
Copyright © Squadcast Inc. 2017-2021