This project demonstrates a DevOps-driven self-healing cloud infrastructure capable of:
- Detecting EC2 instance failures or performance issues
- Automatically recovering the instance via Ansible playbooks
- Sending real-time alerts to Slack
- Provisioning and managing infrastructure using Terraform and GitHub Actions CI/CD pipelines
- Monitoring and alerting through Prometheus
💡 Built to simulate a real-world, automated cloud monitoring environment using AWS Free Tier resources.
| Category | Tools Used |
|---|---|
| Cloud | AWS (EC2, IAM, S3) |
| IaC | Terraform |
| Configuration Management | Ansible |
| Monitoring | Prometheus, Alertmanager |
| Automation | GitHub Actions |
| Notifications | Slack Webhook |
| Languages | Python, YAML, HCL, Shell Script |
| Dashboard | Node.js + Express |
This project is licensed under the MIT License.