< !--End Global site tag(gtag.js) - Google Analytics-- > < !--Start GA4-- > < !--Google tag(gtag.js) -- > < !--script async src = "https://www.googletagmanager.com/gtag/js?id=G-QHCFP81EHY" >

🚀 Squadcast’s new and improved analytics are here - offering instant visibility into your Incident Response and Alert Noise!

Prakya Vasudevan

Product Manager

A product manager focused on creating impactful solutions for engineering teams.

On-call On-boarding Checklist

On-call On-boarding Checklist

Best Practices in Incident Management

Best Practices in Incident Management

Configure an Intuitive Service Dashboard & Reduce Response Time

Configure an Intuitive Service Dashboard & Reduce Response Time

What you should know about Squadcast + Grafana Integration

What you should know about Squadcast + Grafana Integration

Incident Response in the time of Remote Work

Incident Response in the time of Remote Work

Must Read DevOps & SRE Books for all Engineers

Must Read DevOps & SRE Books for all Engineers

Top Monitoring Tools for DevOps Engineers and SREs

Top Monitoring Tools for DevOps Engineers and SREs

Hrushikesh shares his journey into SRE and his thoughts on the future of this space

Hrushikesh shares his journey into SRE and his thoughts on the future of this space

Better Incident Response: Incident Classification & Setting Severities with Tags

Better Incident Response: Incident Classification & Setting Severities with Tags

February 20, 2020

Scheduling IT and Engineering on-call rotations just got easier

Scheduling IT and Engineering on-call rotations just got easier

February 13, 2020

Things to do to make on-call less stressful

Things to do to make on-call less stressful

January 30, 2020

Hiteshwar shares his thoughts on being an SRE

Hiteshwar shares his thoughts on being an SRE

January 24, 2020

Arild Jensen from Upwork shares his thoughts on being an SRE

Arild Jensen from Upwork shares his thoughts on being an SRE

January 17, 2020

What you can show on your status page

What you can show on your status page

January 14, 2020

Using a Status Page in your Incident response process

Using a Status Page in your Incident response process

January 10, 2020

Reducing On-call Alert Fatigue with Deduplication

Reducing On-call Alert Fatigue with Deduplication

January 8, 2020

Squadcast's Year in Review, 2019

Squadcast's Year in Review, 2019

December 31, 2019

How to avoid on-call burnout

How to avoid on-call burnout

December 20, 2019

Get the latest scoop on Reliability insights. Delivered straight to your inbox.

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

If you wish to unsubscribe, we won't hold it against you. Privacy policy.

Engage with the Squadcast community for effective Incident Response strategies.

Prakya Vasudevan

A product manager focused on creating impactful solutions for engineering teams.

Squadcast way to resolve Incidents

TRY SQUADCAST for Free schedule a demo

Subscribe to our latest updates

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

On-call On-boarding Checklist

A humane on-call is the mark of good engineering culture. Access our free on-call onboarding checklist that can proactively help your on-call team & improve your overall on-call experience.

Best Practices in Incident Management

Discover best practices for effective incident management - don't let an incident affect your business's overall efficiency. Let us show you how to handle incidents with minimal disruption and maximum efficiency. Read more.

Configure an Intuitive Service Dashboard & Reduce Response Time

Leverage Multiple Alert Sources in Squadcast to reflect your actual system infrastructure on your Service Dashboard

What you should know about Squadcast + Grafana Integration

At Squadcast, we use Grafana and absolutely love it! This blog post talks about how you can use your Grafana data to set off alert triggers in Squadcast. Turbocharge your observability data in Grafana by making it actionable.

Incident Response in the time of Remote Work

The unexpected and sudden shift to remote working introduces a new set of problems within the incident response space. And while each organization needs to take its own unique circumstances into account, this post outlines the best practices and steps that can be taken in the right direction in keeping operations both productive and proactive.

Must Read DevOps & SRE Books for all Engineers

Here's a curated list of “Must Read” books specific to the Incident Management space, suggested by folks from the SRE and DevOps community to help you understand what changed their perspective of software engineering as a role.

Top Monitoring Tools for DevOps Engineers and SREs

Incident Monitoring softwares help you go from reactive to proactive to meet your observability & system reliability needs. Devops Incident monitoring tools like Prometheus, Solarwinds-Pingdom, Zabbix, Solarwinds - Server and Application Monitor (SAM), Datadog, New Relic

Hrushikesh shares his journey into SRE and his thoughts on the future of this space

Hrushikesh is passionate about making a complex design with simple and reliable solutions. He is technology and platform agnostic and doesn’t believe in limiting himself to just a few. He started his career in 2006 with a Media company where he was responsible for introducing new technologies along with driving a team to deliver quickly. He does not limit his role to just development and operations and loves exploring everything in the tech space. He believes that SRE principles will revolutionize the way classic operations-driven organizations think.

Better Incident Response: Incident Classification & Setting Severities with Tags

In this blog, learn how you can reduce MTTR by implementing incident classification by attaching incident severity levels to effectively route incidents to the right on-call team.

February 20, 2020

Scheduling IT and Engineering on-call rotations just got easier

Introducing UI improvements to the on-call schedules and rotations feature on Squadcast.

February 13, 2020

Things to do to make on-call less stressful

Doing on-call management in a way that’s better, less stressful and actually works to improve your incident response processes, uptime & reliability

January 30, 2020

Hiteshwar shares his thoughts on being an SRE

Hiteshwar is an SRE based out of Mumbai, India. His area of specialization is in distributed systems. He works on Kubernetes, running his own custom clusters, maintaining them and creating tools to manage and monitor them. He is an active speaker in meetups and developer groups and also teaches DevOps and SRE practices at learning centers.

January 24, 2020

Arild Jensen from Upwork shares his thoughts on being an SRE

Arild Jensen, SRE Manager at Upwork, talks about his journey into SRE and some best practices he picked up along the way including implementing a blameless culture, code review and making decisions based on hard data.

January 17, 2020

What you can show on your status page

When something goes down, the first thing a customer does is check if there is something wrong with their systems or if it is an issue with one of their service providers. So it’s important to make sure that your hosted status page has all the information that is needed where they don’t feel the need to raise an issue or create a ticket, adding to your support costs.

January 14, 2020

Using a Status Page in your Incident response process

Hosted Status pages and self hosted status pages can be used in different forms for internal or external communication which aligns all teams towards a culture of transparency, both with your customers and outside stakeholders as well as your colleagues and peers.

January 10, 2020

Reducing On-call Alert Fatigue with Deduplication

Alert noise is a very common on call complaint leading to fatigue and on call burnout. This article is an attempt at helping folks address this problem.

January 8, 2020

Squadcast's Year in Review, 2019

It’s the end of a decade and this year has been nothing short of great with accelerated product adoption, team growing 2x in size, a platform full of features and a heart full of happiness!

December 31, 2019

How to avoid on-call burnout

Incident management is stressful and can lead to on call burnout. Even more so, during the holidays. This is a checklist of things to watch out for to make sure your on-call team remains calm if an incident were to occur.

December 20, 2019

Danny Mican on his experience as an SRE at Auth0

Danny Mican, an SRE from Auth0 shares his thoughts on SRE and being SLO driven to deliver outstanding customer experiences. Danny currently manages the reliability of systems that authenticate over 2.5 billion logins per month and is expected to have 99.9% (3 Nines) availability.

December 2, 2019

Pavlos Ratis shares his experience on being an SRE

Pavlos Ratis, a Munch-based SRE, talks about his experience of embracing SRE culture and how it has been beneficial for both himself and his team. Learn how the right SRE culture can help your organization grow and succeed.

November 13, 2019

Managing technical risk effectively with Error Budgets

Error budgeting is a key concept in SRE. Learn how to effectively manage technical risks with error budgets in this blog post by Squadcast. Find out how to create an effective strategy and identify cost-effective solutions to technical risks."

October 14, 2019

Mark Henderson from Stack Overflow shares his experience on being an SRE

The Journey of an SRE: A Conversation with Mark Henderson from Stack Overflow - SRE Onboarding, DevOps and more...

No items found.