2022 Observability Theater Presentations
Introducing the Datadog Observability Theater!
Observability Theater sessions will happen on both Day 1 and Day 2 of the event. These 20-minute how-to sessions are designed to give you and your team actionable insights that you can apply right away on a variety of topics from monitoring to security, APM, containers, AI Ops and cloud cost management. Get an in-depth look at Datadog products and hear how users like yourself are applying best practices in their day-to-day work.
Day 1 | SLOs
Five things we learned from our SLO journey at Datadog
Tuesday, October 18th, 1:00PM - 1:20PM
Kai Xin Tai, Product Manager, Datadog
Paul Reinlein, Senior Software Engineer, Datadog
At Datadog, our engineering teams are focused on building highly reliable systems that meet the everyday observability needs of our customers. Service Level Objectives (SLOs) have been instrumental in helping us balance between maintaining platform stability and being able to improve our product offering.
When we are meeting our SLOs, we’re free to experiment with new code versions, features, and user experiences. But when services regularly exceed their error budgets, we’re more vigilant of introducing additional risk. We instead prioritize reliability work, such as addressing incident remediation items and optimizing our infrastructure and processes for scalability and resilience.
In this session, you’ll learn some SLO best practices for identifying customer-facing issues in your production environment. You’ll see how you can implement processes for iterating on SLO definitions and why you should incorporate burn rate alerts into your alerting and incident response strategy. See real-world examples of SLOs that have been created for Datadog’s most popular products and how to implement them within your organization.
Day 1 | DBM
How to troubleshoot databases without the skills of a DBA
Tuesday, October 18th, 1:30PM - 1:50PM
Meaghan Vella, Product Marketing Manager, Datadog
Jason Manson-Hing, Product Manager, Datadog
When a database fails, applications can break in unexpected ways and data can be lost. The cause of failures could lie in the underlying infrastructure, the database schema, or even the queries themselves. Without unified access to critical information about your databases and the queries they run, it can be difficult for disparate teams to efficiently collaborate and resolve the issue.
In this session, you will learn how Datadog Database Monitoring (DBM) enables DBAs, SREs, and application developers to identify performance issues at a glance and quickly improve database efficiency. We will demonstrate how DBM lets you find the root cause of any issue by providing deep visibility into wait events, active connections, query performance, and explain plans, allowing you to reduce database costs, increase efficiency and uptime, and streamline collaboration between all stakeholders.
Day 1 | Datadog Incidents, Datadog Inventory SQL, Cloud Workload Security
Monitoring and reacting to emerging vulnerabilities
Tuesday, October 18th, 2:00PM - 2:20PM
Andy Giron, Senior Security Researcher, Datadog
Fred Baguelin, Senior Security Researcher, Datadog
Vulnerabilities are bad news, and bad news never comes at a good time. Over the last 10 years, we have seen the emergence of numerous high-profile vulnerabilities that have interrupted the operations of many companies. These vulnerabilities disrupt product development due to their high impact on company infrastructure and the threat they carry of a malicious actor using them to gain a foothold into organizations.
In this talk, you will learn how Datadog Security Labs monitors, qualifies, reacts to, and patches these “emerging vulnerabilities.” By using a combination of open source tooling, social media, and the Datadog platform, we will follow the lifecycle of an emerging vulnerability, and identify how to create a playbook for responding to these incidents while assuring your leadership and customers that you are monitoring and defending in an efficient manner.
Day 1 | Watchdog AI
Extending Your Team's Capabilities with Watchdog AI
Tuesday, October 18th, 2:30PM - 2:50PM
Omri Sass, Group Product Manager, Datadog
Terry Teng, Senior Product Marketing Manager, Datadog
Day 1 | Datadog Continuous Profiler
Make your production code lean and cost effective with Continuous Profiler
Tuesday, October 18th, 3:00PM - 3:20PM
Eyal Engel, Team Lead, Product Marketing, Datadog
Code profiling allows you to measure code performance by surfacing the most time- and resource-consuming lines of code. However, code profilers are often viewed as complex tools that are used only by experienced engineers and limited to non-production environments. Worse still, when profiling bottlenecks are discovered and addressed, developers are often missing critical observability context (like logs and traces) to better understand overall performance impact within systems.
In this session, you will learn how you can address these challenges with Datadog Continuous Profiler, which provide low overhead, always-on code profiling on every production host and a simple way to instrument additional metrics, traces, and logs on-demand from the Datadog UI. You will leave this session understanding how these two products fit into your observability journey with Datadog, allowing anyone on your team to troubleshoot and resolve bottlenecks quickly, reduce CPU and memory consumption, and lower cloud costs.
Day 1 | Service Catalog
Simplify service governance, observability, and troubleshooting with the Service Catalog
Tuesday, October 18th, 3:30PM - 3:50PM
Joshua Zangari, Manager, Observability, Indeed
Brooke Chen, Product Manager, Datadog
Microservices let you simplify code deployments, improve application reliability, and give teams autonomy to work independently. But adopting a microservices architecture at scale can bring increased operational complexity that leads to gaps in your team members’ knowledge about how your services work, what dependencies they have, and which teams own them.
In this session, you will learn how the Datadog Service Catalog helps consolidate dispersed knowledge of organization’s services and streamline communication between SREs, application developers, and service owners. Hear from the team indeed how they’re using Service Catalog to not only provide a multi-lens view of ownership, performance, and reliability across their fleet of microservices, but also triage incidents more effectively and promote observability best practices across their teams.
Day 1 | Integrations
Datadog integrations: Monitoring data analytics platform for SREs and data engineers
Tuesday, October 18th, 4:00PM - 4:20PM
Shri Subramanian, Senior Product Manager, Datadog
Karl Kalash, Product Marketing Manager, Datadog
A modern data analytics stack is made up of a suite of tools for gathering, storing, transforming, and analyzing data from disparate sources. Each of these layers plays a key role in your organization’s goals of getting better insights from vast amounts of data and of proactively uncovering new opportunities for growth. But these layers are often managed by different teams, and a problem that occurs in one layer can propagate into other areas of the stack.
In this session you will learn about two of the most common use cases for building a data analytics platform, along with what teams are involved in building it and what different technologies are needed to create a functioning system. We will also look at the challenges that engineers face when it comes to running a data analytics stack and how Datadog can help SREs and data engineers get a centralized view into every layer and technology of their distributed data analytics system for faster troubleshooting and performance optimization.
Day 1 | Log Management
Get more from your logs with advanced queries and augmented insights
Tuesday, October 18th, 4:30PM - 4:50PM
Tori Teng, Product Manager, Datadog
Hai Bui, DevOps Technical Lead, Tyme Group
Cobus Frey, Lead, Global Cloud Platforms and Enterprise Services, Tyme Group
When you’re investigating the root cause of an incident, querying large volumes of logs to get the insights you need can be extremely challenging, especially when the issue is time sensitive and critical to the business. SREs, developers, and security detection engineers (SDEs) all need an easy way to investigate log events.
In this session, see how Tyme Group easily drills down into their analytics data with only a few clicks. Moreover, you will learn how Datadog automatically surfaces patterns that can help you find explanations for log anomalies. Do this all within Datadog Log Management to prevent potential threats and reduce incidents.
By the end of this session, you will have the tools you need to get started expertly querying and gaining insights within Log Explorer.
Day 1 | Online Archives, Sensitive Data Scanner, Audit Trail, RBAC
Rethink your enterprise governance and compliance strategy
Tuesday, October 18th, 5:00PM - 5:20PM
Anshum Garg, Product Manager II, Datadog
Ross Banfield, VP of Monitoring, Citizens Bank
In large organizations, there is often a central observability team that is responsible for providing observability services to application teams. Managing log data in this environment can be a complex challenge thanks to data governance requirements and the dynamic nature of developer needs.
In this session, you will see how the observability team at Citizens Bank uses Datadog Log Management to provide autonomy and tooling to their developers that fulfills their individual requirements while also adhering to organizational governance requirements. You will learn best practices for maintaining continuous and robust observability across your systems even as you set up enterprise governance policies.
By the end of the session, you will be able to use the capabilities of Datadog Log Management to take tangible actions for ensuring full data governance at scale.
Day 1 | OpenTelemetry with Datadog for APM and Infra Monitoring
OpenTelemetry with Datadog: Monitor the Three Pillars of Observability
Tuesday, October 18th, 5:30PM - 5:50PM
Prashant Jain, Product Manager, Datadog
Lutao Xie, Product Manager II, Datadog
OpenTelemetry (OTel) is a Cloud Native Computing Foundation (CNCF) initiative that provides open, vendor-neutral standards and tools for instrumenting services and applications. Many organizations use OpenTelemetry’s collection of APIs, SDKs, and tools to collect and export observability data from their environment to their preferred backend.
As part of Datadog’s ongoing commitment to OpenTelemetry (OTel), we offer multiple solutions to ensure that OpenTelemetry users have the flexibility to easily send their metrics, traces, and logs to Datadog.
In this session, we will walk through a live product demo that will show how you can use OTel and Datadog to get full visibility into your infrastructure and applications. We will also explore different data ingestion paths you can use to derive maximum value out of your OTel usage with Datadog.
Day 2 | RUM, Session Replay
How to monitor and improve end user experience in real time
Wednesday, October 19th, 9:00AM - 9:20AM
Shabih Syed, Director, Product Marketing, Datadog
Amina Bouabdallah, Senior Product Manager, Datadog
Amazing digital experiences can help brands delight their customers and help differentiate from competitors—and ultimately generate more sales. But to deliver these experiences, brands need to gain insights from every user interaction, which is a serious challenge for teams that use legacy, tag-based analytics tools.
In this session, we will look at how Datadog Real User Monitoring (RUM) and Session Replay provide an end-to-end view of your frontend browser and mobile applications. You will learn how RUM and Session Replay give you a real-time perspective of your users, so that you can proactively troubleshoot, review, analyze, and improve user experiences at scale. Together, these solutions minimize your blindspots across user journeys so that you can deliver perfect digital experiences.
Day 2 | APM, RUM
Automagic correlations: faster remediation with unified end-to-end visibility into user requests
Wednesday, October 19th, 9:30AM - 9:50AM
Kevin Logan, Site Reliability Engineer, ClickUp
Charlie Wang, Product Marketing Manager, Datadog
In addition to building modern applications with a multi-tier, service-oriented architecture, software engineers are releasing code faster than ever. That pace of innovation comes with an increased risk of introducing performance regressions, and problems that affect end users can occur at any level in the technology stack. To catch such problems before they cascade and become critical, frontend and backend teams need a single source of truth so they are not blindsided in pivotal moments and can avoid context switching and redundant manual work.
In this session, frontend and backend developers will learn how to achieve end-to-end visibility into the user experience and application performance using Datadog Real User Monitoring (RUM) and Application Performance Monitoring (APM). Learn how frontend and backend teams can have a shared view to quickly understand where a problem has originated and who is responsible for fixing the issue.
Hear how the team at ClickUp is applying these tools and services in their org. You will leave this session with a better understanding of how to monitor and troubleshoot your entire technology stack in a single view to quickly detect, triage, and resolve customer-facing problems.
Day 2 | Security
Wednesday, October 19th, 10:00AM - 11:30AM
Day 2 | Security
Secure what you monitor: How Devops can reduce risk in the age of Cloud
Wednesday, October 19th, 11:30AM - 11:50AM
Alisa Ho, Product Marketing Manager, Datadog
Hugo Delaby, Product Manager II, Datadog
Ninety-nine percent of cloud security failures could be traced by misconfigurations. This is heavily driven by the complexity of cloud configuration options and the agile culture of rapid deployments. There are many layers of configuration options that could create exposure: IAM configurations define who can modify and run cloud workloads; network configurations define which resources a workload can communicate with; storage policies control the encryption—the list goes on. This complexity increases the likelihood of mistakes, which weakens security posture and can open doors for bad actors.
In this session, you will learn how Datadog’s Cloud Security Suite helps continuously surface security weaknesses resulting from misconfigurations across all cloud assets, including cloud accounts and containers. This helps teams easily mitigate risks while ensuring compliance with industry standards.
Day 2 | NPM, NDM
How to build end-to-end visibility into the cloud and on-prem networks
Wednesday, October 19th, 12:00PM - 12:20PM
Natalie Altman, Senior Product Manager, Datadog
Modern networks are complex. As teams migrate to the cloud, communication across services can break down at any point along the way. Maintaining a modern healthy network requires engineers to have visibility into everything from the physical on-prem network infrastructure and cloud services that services communicate across, and into each conversation and request along the way.
In this session you will learn how Datadog’s suite of network monitoring tools – Network Device Monitoring and Network Performance Monitoring – allow network and application engineers, and SRE teams to work together to isolate the source of any issue faster than ever.
Day 2 | Container Monitoring
Trends from Over 1.5 Billion Containers in the 2022 Container Report
Wednesday, October 19th, 12:30PM - 12:50PM
John Kendall, Senior Product Manager, Datadog
Containers enable organizations to accelerate delivery cycles and rapidly scale their operations to meet the demands of today’s fast-paced market. More and more organizations are migrating their workloads to containers, and the container ecosystem is expanding and evolving to accommodate these increasingly dynamic environments.
Join us to learn more about how container usage has risen and led to the evolution of adjacent concerns such as container security, development agility, scalability, and more. We’ll explore how organizations’ usage of containers is exploding in production and all of the side effects that shift inevitably brings.
Hear insights and trends that we have gathered from the latest real-world usage data of over 1.5 billion containers, run by tens of thousands of Datadog customers, to understand how they are scaling and supporting their container environments.
Day 2 | Powerpacks
Break down knowledge silos with Powerpacks
Wednesday, October 19th, 1:00PM - 1:20PM
Stephanie Niu, Product Manager II, Datadog
As your organization scales and your systems evolve, maintaining observability best practices can become a challenge. Without legible dashboards and consistent runbooks, teams can waste time finding the right data during precious minutes on-call. And while certain administrative users or experts may have the knowledge to get an organization up and running, they can easily become the bottleneck as a team grows.
Datadog is equipped to help large and growing organizations scale up their monitoring as fast as they scale up their organization. In this theater session, we will demonstrate how platform administration and subject-matter experts can leverage Datadog’s Powerpacks to manage observability knowledge for thousands of users with minimal effort.
Day 2 | Cloud Cost Management
How to simplify monitoring and management of your cloud costs
Wednesday, October 19th, 1:30PM - 1:50PM
Kayla Taylor, Senior Product Manager, Datadog
Brendan McFarland, CloudOps Engineer, Datadog
Martin Amps, Principal Engineer, Stitchfix
As companies increasingly shift to the cloud, CloudOps teams are tasked with optimizing the organization’s cloud spend. To do so, engineering and operations teams need a central place where they can view both spend and infrastructure performance data at once. However, with modern environments becoming so complex. Getting full visibility into spend and performance is increasingly difficult.
In this session, you will learn how you can give your teams the ability to visualize cloud cost trends right alongside infrastructure performance data with Datadog Cloud Cost Management. With Cloud Cost Management, you can easily understand how you can spend more effectively; quickly detect and address cloud cost changes using a powerful, granular query syntax; and enable your team to correlate what’s happening across your entire infrastructure so they can take action more quickly.
Day 2 | Event Management, Incident Management, & Coscreen
From diagnosis to resolution: How Datadog collaboration tools enable faster incident response
Wednesday, October 19th, 2:00PM - 2:20PM
Tanja Garcia, Product Marketing Manager, Datadog
Till Pieper, Director, Product Management, Datadog
Ever-growing IT complexity means that vulnerabilities are an increasing threat. Dev, Ops, and Security teams need to continuously detect vulnerable services and understand the potential impact so they can prioritize and remediate. The faster each step can be completed, the smaller the potential impact of an attack will be. Siloed teams and tools, however, prevent people involved from creating a shared understanding of the problem quickly enough, resulting in longer resolution time.
In this session, you will learn how you can apply Datadog’s collaboration-centric approach and suite of products to help your teams accelerate problem detection and incident resolution.
Day 2 | Workflow Automation
Workflow automation, made easy
Wednesday, October 19th, 2:30PM - 2:50PM
Tanja Garcia, Product Marketing Manager, Datadog
Devon Lawler, Product Manager, Datadog
Many teams and organizations still rely heavily on manual workflows with significant human involvement, making repeatable tasks time-consuming and error-prone. Workflow automation provides a streamlined, cost-efficient approach for dev, ops, support, and business teams to complete their work faster and more easily.
But there are several big challenges for companies adopting effective workflow automation, including the plethora of tools required to customize workflows to fit their needs, the complexity that comes with setting up and configuring these tools, and the lack of integration that prevents important context from being shared.
In this session, you will learn how you can use Datadog to easily orchestrate and automate any manual process, regardless of use case or complexity. Using an incident response use case, we will illustrate how you can use simple building blocks to create effective automated workflows that enable your response teams to save time, reduce overhead, and accelerate incident response and resolution.
Day 2 | Continuous Testing, Synthetic Monitoring
Ship every release confidently with continuous testing and synthetic monitoring in your pipeline
Wednesday, October 19th, 3:00PM - 3:20PM
Ariana Ling, Product Marketing Manager, Datadog
Beth Glenfield, Product Manager II, Datadog
With the sheer amount of information that is presented to people on a daily basis, your application needs to capture their attention immediately. An underwhelming or broken digital experience only makes that harder. That’s why it’s critical to create software that delivers the best possible experience for your fragmented web and mobile audiences no matter where they are in the world and what device they are using.
See how you can use Datadog Synthetic Monitoring to proactively and automatically simulate key business workflows and user journeys on your applications, regardless of the device type. Datadog lets you test at every stage of your development cycle, from your backend right through to your frontend.
In this session, you will learn how to use Synthetic Monitoring to perform robust testing of your API endpoints to help you keep a finger on the pulse of your application uptime. We will demonstrate how you can set up our self-healing browser tests with ease. You will also see how these tests integrate seamlessly with CI/CD pipelines and Datadog APM to help you ensure a flawless end-user experience by catching issues earlier in the development cycle, reducing MTTR.
Day 2 | CI visibility
Accelerate pre-production velocity using CI Visibility
Wednesday, October 18th, 3:30PM - 3:50PM
Neha Julka, Product Marketing Manager, Datadog
Bryan Lee, Senior Product Manager, Datadog
Modern agile development methods and advances in CI/CD automation have enabled organizations to build and ship releases faster and regularly deliver new value to customers. CI/CD tools help teams automate their development, deployment, and testing of new releases. But preventing development outages due to slow builds or faulty tests requires granular visibility into the performance of pre-production environments.
In this session, you will learn how to use Datadog CI Visibility to attain comprehensive visibility into all of your pipelines, stages, and jobs to easily locate where slowdowns or failures might be occurring. You can also monitor your tests to surface flaky ones and visualize performance over time to spot regressions. We will demonstrate how you can fill the pre-production observability gap to ensure smooth, reliable builds and catch issues before they reach customers.
Day 2 | Serverless
Monitor your serverless workloads across any cloud
Wednesday, October 19th, 4:00PM - 4:20PM
Danny Driscoll, Product Manager II, Datadog
Neha Julka, Product Marketing Manager, Datadog
Since 2014, AWS Lambda has helped catalyze the serverless movement by enabling teams to deploy and run code while eliminating the need to provision and manage underlying infrastructure. The current serverless ecosystem has grown more mature and supports a diverse array of serverless technologies and workloads allowing users to build secure, reliable, and highly performant applications.
In this session, you will learn how, for organizations operating in the cloud, serverless compute has become an essential part of their technology stacks. We’ll look at the various serverless offerings available within AWS, Azure, and Google Cloud, and how you can use Datadog Serverless Monitoring to achieve end-to-end visibility into the health and performance of your serverless applications.
Day 2 | Log Management, Log Forwarding
Building a modern log processing strategy at enterprise scale
Wednesday, October 19th, 4:30PM - 4:50PM
Avi Verma, Senior Product Manager, Datadog
Grace Gui, Product Marketing Manager, Datadog
Many large enterprises with multiple teams rely on various log management tools for different uses related to observability, security, and compliance. These can be cumbersome to manage, and teams often need a centralized solution to process and transform their data and route it correctly to communicate between systems.
In this session, we will walk you through how Jack Henry & Associates is using Datadog Log Management as a central, managed solution for processing, enriching, and forwarding logs. You will learn how to set up scalable and powerful pipelines, processors, and remappers using out-of-the-box rules and easy-to-use interface. Understand how to automatically protect sensitive data before sending to other destinations via machine learning capabilities and use Log Forwarding to select logs and send them to custom destinations including storage archives and third-party platforms.
By the end of the session, you will be able to better manage your log processing at scale through these simple steps within the Datadog platform.
Day 2 | Application Security Management
How to stay ahead of attackers with Datadog Application Security Management
Wednesday, October 19th, 5:00PM - 5:20PM
Karishma Asthana, Product Marketing Manager, Datadog
Arnaud Breton, Senior Product Manager, Datadog
Daniel Blazquez, Senior Product Manager, Datadog
With the increased requirement for agility, developers push thousands of lines of code and spin up new ephemeral cloud instances every day. This speed coupled with the highly complex nature of modern day software architecture (i.e. distributed systems, greater scale, reliance on open source, etc) leads to blind spots which allow attackers the time to carry out large-scale attacks. On top of this, adding separate security tools to your environment slows down investigation and collaboration for security incidents, thus, adding even more time for attackers.
In this session, we will demonstrate how Datadog leverages its powerful observability platform to provide deep visibility and high-fidelity security insights with Application Security Management (ASM). We will also touch on new, exciting features which will illustrate how security observability makes it possible to address the entire lifecycle of a security incident, all from a single platform.
By the end of this session, you will understand how to quickly get up and running in 1-click with ASM and how the product drastically speeds up the assessment of at-risk applications.
Day 2 | Cloud Workload Security
Auditing a world without boundaries - How Datadog helps tackle modern compliance challenges
Wednesday, October 19th, 5:30PM - 5:50PM
Rishi Ilangomaran, Product Manager, Datadog
Kendra Ash, Security Engineer II, Datadog
Achieving compliance in a cloud environment is challenging. With the dynamic and ephemeral nature of containers, traditional tools are no longer sufficient. Teams need a modern solution to alleviate the pains of compliance in the cloud age.
Learn how you can use Cloud Workload Security and the rest of the Datadog ecosystem to manage compliance in your containerized environments.. Workload security, network security, container monitoring, and other requirements can be met within Datadog - all simplified by the pivots between products, and unified tagging.