Are you headed to the Gartner IT Infrastructure, Operations, and Cloud Strategies (IOCS) Conference next week? Gartner IOCS 2025 runs from Dec 9th-11th in Las Vegas, and features 338 different sessions for IT leaders.

It’d be impossible to attend all of them, so we put together an unofficial reliability track of talks we think are worth checking out.

If you’re going, be sure to stop by and visit us at Booth #445! We’d love to discuss any best practices you learned and how we can help you make them a reality in your organization.

Reliability talks to check out

Ask the Expert: How Can I Prepare for the Next "CrowdStrike-Like" Incident?

Tuesday, December 09, 2025 / 11:00 AM - 11:45 AM PST

About: The CrowdStrike incident in July 2024 affected 8.5 million Windows devices globally. Post recovery, many organizations questioned if this could have been avoided. In this session, attendees can ask Gartner expert Eric Grenier how to mitigate the risk of software updates compromising critical infrastructure.

Why we care: Every infrastructure has critical dependencies that are central to how your system operates, but they shouldn’t be a surprise to you. Companies should identify critical dependencies ahead of time, know what happens if they fail, and have a deliberate plan for when they do.

Organize for Resilience With an Optimal SRE Team Topology

Wednesday, December 10, 2025 / 12:00 PM - 12:30 PM PST

About: Organizations investing in SRE practices often struggle to realize the full potential due to suboptimal team structures and engagement models. This session will help establish adaptive team topologies, which will guide I&O leaders to structure their SRE function to maximize SRE value.

Why we care: The right tools are essential for improving reliability, but you need the right processes and team structure to back them up. We’ve written about the key parts of successful reliability programs before, and if you’re using an SRE model, knowing how to organize your teams contributes significantly!

AI, Autonomy, and Architects - The Future of Site Reliability Engineering

Wednesday, December 10, 2025 / 03:45 PM - 04:15 PM PST

About: As GenAI and agentic AI evolve and impact on infrastructure and operational domains, what is the future of Reliability and practices such as Site Reliability Engineering in an AI agent world and how can infrastructure and operations leaders be ready for this future.

Why we care: Whether it’s generating code, helping diagnose issues, or simply having to keep new systems reliable, AI has had a profound impact on Site Reliability Engineering. Your organization should be aware of how AI is impacting its systems and processes so you can plan accordingly.

Proactive SaaS Resilience: Planning for the Unexpected

Thursday, December 11, 2025 / 01:45 PM - 02:15 PM PST

About: Explore proactive approaches to SaaS resilience, acknowledging that traditional disaster recovery methods are largely ineffective in SaaS environments. This session offers innovative guidance on creating contingency plans, assessing provider resilience and implementing governance practices to ensure continuity and mitigate risks effectively.

Why we care: We are all about proactive reliability and resilience efforts, especially when it comes to disaster recovery. SaaS environments bring their own unique challenges, and it’s important for teams to be fully informed about their system resilience so they can plan accordingly.

——

Looking forward to seeing you in Vegas! Want to set up time to chat? Drop us a line and our team will reach out!

No items found.
Start your free trial

Gremlin's automated reliability platform empowers you to find and fix availability risks before they impact your users. Start finding hidden risks in your systems with a free 30 day trial.

sTART YOUR TRIAL
Ready to learn more?

See Gremlin in action with our fully interactive, self-guided product tours.

Take the tour
Gavin Cahill
Gavin Cahill
Sr. Content Manager