Episodes

  • The Art and Science of Site Reliability Engineering with Liz Fong-Jones
    Oct 9 2024

    In this exciting episode of Cloud Dialogues, we are joined by Liz Fong-Jones, Field CTO at Honeycomb and former Google SRE, to explore the fascinating world of Site Reliability Engineering (SRE)—a game-changer for scaling and automating large systems.

    What We Covered:

    1. Meet Liz Fong-Jones: Liz brings over a decade of SRE experience from her time at Google and Honeycomb, helping companies revolutionize how they manage reliability and automation.

    2. The Origin Story: SRE actually predates the cloud! Born at Google in the early 2000s, SRE started as a way to automate manual system administration tasks and has since evolved into its own discipline, running parallel to DevOps.

    3. SRE at Its Core: - Minimize repetitive work (aka "toil") by automating everything you can. - Use Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure and maintain reliability.

    4. Different SRE Models: There are different ways to implement SRE: - Tools-based within platform teams - Consultative SREs parachuting in to help teams - Embedded SREs integrated within every team

    5. The SRE Mindset: Curiosity and empathy are essential for SREs. Teams need a culture of psychological safety where concerns can be raised without fear.

    6. The Magic of SLOs and SLIs: SLOs set reliability targets (like aiming for 99.5% uptime), while SLIs measure performance against those targets. Together, they ensure your systems are running smoothly.

    7. FinOps Meets SRE: Liz explains how SREs can help balance reliability, performance, and costs using SLOs to allocate resources more efficiently.

    8. Disaster Testing: Want proof SREs are ready for anything? Honeycomb regularly tests its disaster recovery by taking down an entire availability zone—on purpose!

    9. Pro Tips for Executives: Thinking about implementing SRE at your company? Liz suggests starting with your biggest challenges, offering executive support, and setting clear, achievable SLOs.

    10. Why Observability Matters: Observability is the backbone of SRE. Having real-time, actionable data is key for setting and managing effective SLOs.

    Plus, Liz gives covers off on her favorite ARM processors (for cost and environmental savings) and shares insights from her book Observability Engineering.

    This episode is a deep dive into SRE, filled with actionable insights and strategies for leaders looking to supercharge their reliability game. You won’t want to miss it!

    Show More Show Less
    33 mins
  • Unlocking AI Potential through Data Strategy with Allison Howells
    Sep 23 2024

    In this episode of Cloud Dialogues, Matt & Georgia dive deep into the world of data strategy with Allison Howells, a seasoned expert with over 20 years of experience across industries like financial services, insurance, and consulting.

    Exploring the evolution of data platforms and the strategic decisions that help organizations harness the power of their data.

    Here are some of the key topics the team breaks down:

    1. Data Platforms: Then & Now – How data platforms have transformed over the past 10-15 years. 2. Aligning Data with Business – Why syncing your data strategy with your business goals is a game-changer. 3. Centralized vs. Decentralized Platforms – The pros and cons of each approach. 4. Data Governance & Quality – Why these are the cornerstones for driving successful AI and machine learning projects. 5. Data Context & Origin – The critical role of understanding where your data comes from and how it’s used. 6. Cultural Impact of Decentralized Platforms – How decentralization can foster innovation and shift organizational mindsets.

    Top Takeaways: - Data Strategy = Business Strategy: A successful data strategy is always grounded in solving real business problems. - Decentralized Data Platforms: These can enhance accountability, fuel innovation, and empower teams through data democratization. - Data Context is Key: Knowing the "where" and "why" behind your data ensures it’s used effectively. - Governance & Quality Matter: No matter your data platform, strong governance and data quality are essential. - Data Literacy is a Must: Every department needs to level up their data skills to thrive in today’s data-driven world.

    Curious about how to get your data initiatives off the ground and deliver business value from day one?

    Allison shares actionable steps for executives to build a winning data strategy that drives results. To get the full executive playbook, tune in to the episode!

    Show More Show Less
    35 mins
  • Crafting Affective Cloud Operating Models
    Aug 27 2024

    In this exciting episode of "Cloud Dialogues," Georgia and Matt return from their well-deserved breaks, ready to dive deep into the world of cloud operating models. They unpack the challenges organisations face when transitioning to cloud environments and share insights on how to navigate this complex landscape.

    Here's what you can expect:

    1. Tackling Operating Model Challenges: Dive into the common pitfalls of cloud transformation, emphasizing how misaligned operating model changes often lead to trouble.

    2. DevOps and Product-Centric Thinking: The shift from traditional IT operations to a DevOps and product-focused approach, stressing the need for a cultural shift within organisations.

    3. Platform Engineering Essentials: The importance of customising cloud platforms to meet the unique needs of DevOps teams, all while creating an exceptional developer experience.

    4. The Role of Cloud Centers of Excellence (COE): Discover how COEs bridge the gap between platform teams and developers, ensuring smooth and efficient cloud usage.

    5. Evolution of Change Management: Learn how change management is transforming in cloud environments, moving from rigid approval processes to more flexible, enabling approaches.

    6. FinOps and Accountability: The critical role of product owners in balancing cloud costs and value, making accountability a key focus.

    7. Navigating Data Management: Challenges of managing data across organisations, proposing a federated model with a central data catalog.

    8. AI and Platform Teams: Should organisations build their own AI capabilities or rely on existing SaaS solutions? The hosts offer their take on this hot topic.

    9. Mastering Containerisation: They shed light on the complexities of managing containerised environments like Kubernetes, with practical insights.

    10. Strengthening Security Operations: The importance of well-resourced security teams in understanding and mitigating risks within intricate cloud setups.

    11. Transforming Operating Models: Finally, the hosts advocate for small, incremental changes over sweeping transformations, ensuring smoother transitions.

    To wrap things up, Georgia and Matt hint at an upcoming discussion on Site Reliability Engineering (SRE)—stay tuned!

    Show More Show Less
    30 mins
  • DevOps Unleashed: AI, Innovation, and the Future of Software Development with Patrick Debois
    Jul 17 2024

    In this exciting episode of the Cloud Dialogues Podcast, we dive into the world of DevOps with none other than Patrick Dubois, the man who coined the term "DevOps" back in 2009. Here’s a breakdown of the key points we covered:

    1. The Birth of DevOps:

    - Patrick takes us back to the origin story of DevOps, which he accidentally coined during the planning of DevOps Days in 2009.

    - DevOps isn't just about automation; it’s a holistic approach involving collaboration, feedback loops, and business strategies.

    2. AI Meets DevOps:

    - Discover how AI is revolutionizing the software development lifecycle, from brainstorming ideas to deploying and maintaining applications.

    - Highlights include tools like GitHub Copilot for coding, AI-driven UX/UI designs, and AI-assisted testing and monitoring.

    3. Navigating Challenges:

    - Organizations face the challenge of balancing innovative AI solutions with practical implementation.

    - The expertise of seasoned engineers is crucial to evaluate AI-generated code and solutions.

    - With AI integration, security and risk management are more important than ever.

    4. The Future of Testing and Quality Assurance:

    - AI can lead to larger codebases and longer review times, emphasizing the need for automated testing with human oversight.

    - New concepts like "evals" (evaluations) are emerging to assess AI-driven applications.

    5. Impact on Organizations:

    - Addressing concerns about job displacement due to AI automation.

    - A shift from coding roles to managing systems and reviewing outputs.

    - Faster onboarding processes and increased productivity could be on the horizon.

    6. Strategies for AI Implementation:

    - Start with pilot teams, often involving data science experts, before scaling up.

    - Eventually, move to platform teams while ensuring strong governance, licensing, and data management.

    7. Looking Ahead:

    - Organizations need comprehensive strategies for AI integration in their development pipelines.

    - AI promises better situational awareness and faster decision-making for executives.

    The episode wraps up with Patrick offering a strategic roadmap for executives on implementing AI in DevOps, emphasizing the dynamic nature of best practices in this rapidly evolving field.

    Show More Show Less
    36 mins
  • Executive Blueprint: Driving Cloud Transformation with Design Thinking
    Jun 24 2024

    In this exciting episode, Matt & Georgia dive into how organizations can harness the power of design thinking to unlock the full potential of cloud.

    Many organizations either struggle to reap the benefits of cloud migrations or are unsure how to leverage the cloud strategically.

    Our hosts coin a design thinking approach: starting with mission-critical processes and gaining a deep understanding of user journeys and interactions.

    They introduce a dynamic framework to guide executives through this transformation:

    a. Empathize with users and pinpoint core problems and opportunities

    b. Define current value streams and map out user journeys

    c. Ideate and reimagine processes without existing constraints

    d. Develop comprehensive architectural frameworks and designs that deliver these reimagined journeys

    e. Validate solutions with stakeholders and users

    f. Continuously iterate and refine the process

    Georgia emphasises the need for involving a diverse group of stakeholders, balancing customer needs with the organization’s realities, and maintaining continuous feedback and communication with users and customers throughout the journey.

    A key takeaway for leaders is to support and celebrate the lessons learned from failures during these transformation efforts. Not all systems need a complete overhaul; some might be better suited for SaaS solutions.

    Additionally, the importance of allocating dedicated resources specifically for transformation projects, separate from the usual business operations.

    Please share your questions and experiences in the comments!

    Show More Show Less
    29 mins
  • Navigating Cloud Choices: Breaking Down "Vendor Lock-In"
    Jun 3 2024

    In this episode, hosts Matt and Georgia delve into the topic of "vendor lock-in" with cloud services and technologies, featuring insights from Chris Munns of Amazon Web Services.

    Understanding Lock-In: 🔒

    - Common apprehension about "lock-in" often avoids managed cloud services like AWS Lambda.

    - Lock-in is unavoidable with any tech decision, as choosing one path excludes others.

    - Focus should be on the tangible costs of switching technologies rather than hypothetical lock-in.

    Premature Optimization: ⏳

    - Fears of lock-in often make people make premature optimizations.

    - Technologies and requirements evolve, necessitating rewrites and refactoring over time.

    - Startups prioritize speed-to-market, while enterprises may face regulatory constraints.

    Practical Realities and Trade-offs: ⚖️

    - Assess the real-world implications and trade-offs of each technology choice.

    - Managed services often provide quicker time-to-value.

    - Avoid making choices based on vague lock-in fears.

    - Understand the real concerns behind lock-in statements and make risk-based decisions.

    Tune in for a thought-provoking discussion that challenges conventional wisdom and offers executives a fresh perspective on navigating the cloud technology landscape. 🌥️

    Show More Show Less
    33 mins
  • Cloud Strategy for Executives 101: From Plans to Profits
    May 14 2024

    Join Matt and Georgia as they dive into the world of cloud strategy with special guest, cloud strategy superstar Michael Ewald!

    Listen as we unpack the essentials of crafting a killer cloud strategy that not only rocks your business world but also keeps your stakeholders smiling.

    Your hosts + Michael kick things off by emphasizing the importance of syncing your cloud strategy with your business master plan.

    It's like making sure your favorite song is playing in the background while you conquer the dance floor – total harmony! Now, let's sprinkle in some key takeaways:

    1. Lift-and-Shift Blunders: They caution against the dreaded "lift and shift" approach without a sprinkle of modernization. It's like trying to fit a square peg in a round hole – costly and ineffective.

    2. Incentivize Cloud Champions: Aligning leadership incentives and KPIs with your cloud strategy is like fueling your rocket ship for intergalactic success.

    3. Pick Cloud Services Wisely: Think of choosing cloud services like picking toppings for your perfect pizza – it's got to match your taste! So, select services that vibe with your organization's culture and goals.

    4. Keep it Simple, Silly: Going multi-cloud might sound fancy, but unless it's absolutely necessary, it's like juggling flaming torches – risky business!

    5. Operational Overhaul: Don't just focus on the tech stuff; plan for those operating model changes like a seasoned conductor leading an orchestra. It's all about that harmonious symphony.

    6. Track Progress Like a Pro: Continuous measurement is the name of the game. It's like having a Fitbit for your cloud strategy – keeping you on track and feeling accomplished!

    7. Learn from cloud tenured execs: Reach out to those who've braved the cloud journey before you. It's like having a cheat code in the game of cloud domination – why reinvent the wheel?

    To sum it up, crafting a top-notch cloud strategy isn't just about flinging your data into the digital sky and hoping for the best. It's about strategic thinking, alignment, and a sprinkle of magic that turns your cloud dreams into business reality. So, grab your brainstorming hats and get ready to conquer the cloud-o-sphere!

    Show More Show Less
    34 mins
  • AWS London Summit 2024 Wrapped
    Apr 29 2024

    Get ready to dive into Cloud Dialogues' coverage of the electrifying AWS London Summit!

    Imagine a venue bursting at the seams with 18,000 tech enthusiasts (though we've since learnt it was 23k), creating an atmosphere akin to a bustling nightclub.

    Our hosts take you on a whirlwind tour of the event's highlights:

    - Picture this: keynote speeches featuring industry giants like Zilch, Adobe, TUI Airlines, and even the globe-trotting experts at Lonely Planet. Themes ranged from AI marvels, sustainability efforts and cost management, all while championing digital skills training.

    - Lonely Planet's futuristic use of Anthropic's AI to craft dreamy itineraries and slash costs left jaws on the floor. Meanwhile, Zilch revealed their secret weapon: AI and ML wizardry for predicting buyer intent and cutting credit costs.

    - Project Seba, promising custom silicon for machine learning that's about to shake up the tech world.

    - TUI Airlines, with a whopping 10 billion AWS Lambda calls per month, soared through the pandemic with AWS by their side, slashing costs like superheroes.

    - Technical hiccups sent shockwaves through the summit, with the keynote theater bursting at the seams and attendees scrambling to catch a glitchy live stream.

    - Matt & Georgia steer the conversation toward responsible tech adoption, urging companies to tread lightly on the environmental and financial frontiers of AI. Amazon's Q is now equipped with "guardrails" to keep those generated outputs in check.

    - Amidst the AI frenzy: while the potential is sky-high, companies mustn't lose sight of the problems they aim to solve. An inside-out approach, starting with AI for internal productivity gains before launching into the stratosphere.

    - A session on AWS' sovereign cloud offerings that felt a tad too much like a sales pitch? Our hosts raise an eyebrow, craving more technical meat and less sizzle.

    So buckle up, tech adventurers, as we journey through the highs, the lows, and the unexpected twists of the AWS London Summit!

    Show More Show Less
    21 mins