
Why Application Performance Management (APM) is critical for modern applications and production systems.
The Four Golden Signals of monitoring and how they help you detect performance issues early.
The benefits of a well-designed monitoring system, including faster troubleshooting, better reliability, and improved user experience.
What Datadog is and how it fits into modern observability and monitoring stacks.
Why Datadog is needed in real production environments and what problems it solves.
Key benefits of using Datadog, including faster troubleshooting, better visibility, and improved system reliability.
How Datadog compares to traditional monitoring tools and why teams choose it for modern applications.
What a host, agent, and tags are in Datadog and how they relate to each other.
How the Datadog Agent works and why it is required for collecting metrics and traces.
Why tags are important for filtering, grouping, and analyzing monitoring data.
How these basic concepts are used in real monitoring scenarios throughout the course.
You will learn how integrate Datadog with Pager Duty and receive an alert
You will understand what is SLA, SLO, SLI and Error Budget
1. You will be able to create SLO
2. To make Date Correlation on the SLO
3. Set up Alerts based on the SLO
This course will help you to:
Understand basic and advanced concepts of Application Performance Management (APM) and Datadog tool usage.
Build APM for your application from scratch using Datadog.
Visualize the entire request path and quickly identify where bottlenecks or errors occur.
Track application errors and slow queries with just a few clicks.
Install and configure the Datadog Agent and query essential system and application metrics.
Analyze Datadog APM services, trace searches, and code profiling, including .NET Core API monitoring with SQL service layer visibility.
Create dashboards using different widgets, including:
Time series
Query value
Top lists
Tables and pie charts
We will build dashboards together for:
Latency (time series & query value)
Error rate (time series & query value)
Top API endpoints
Host resource usage
Service map visualization
Applying formulas for advanced insights
Create monitors and alerts for hosts, services, endpoints, IIS, and Watchdog:
Request latency
Error rate
Specific API endpoints
SQL query duration
Host data reporting
Watchdog monitors
Understand SLA, SLO, SLI, and Error Budgets by creating Datadog SLOs for success rate tracking.
Integrate monitors with Slack and PagerDuty for real-time incident notifications.
Use Synthetic Monitoring with API and browser tests, including
GET/POST requests with Bearer token authorization.
Manage logs in Datadog, including saving and viewing logs using NLog.
Use Datadog Notebooks to:
Investigate incidents
Collaborate with team members using comments
Create and reuse custom templates
Monitor key application aspects such as availability, reliability, scalability, and performance duration.
Set up and monitor a Node.js application with MongoDB, including:
Service configuration
Custom metrics
Distributed tracing
Learn the latest Datadog updates and features used in modern production environments.