Useful Links
Site Reliability Engineering (SRE)
SRE Book - The Site Reliability Engineering Book by Google provides a comprehensive overview of the principles and practices of SRE, offering valuable insights and case studies from Google's own experience.
Book of SRE - The Book of SRE is a collection of practical advice, wisdom, and best practices from leading experts in the field of Site Reliability Engineering, aiming to help readers implement and improve their own SRE practices.
SRE Weekly - SRE Weekly is a curated newsletter that provides the latest news, articles, and resources related to Site Reliability Engineering, helping professionals stay updated on industry trends and best practices.
DevOps
The DevOps Handbook - The DevOps Handbook is a practical guide that offers concrete steps for adopting DevOps principles and practices in your organization, with insights and case studies from leading companies.
DORA - The DevOps Research and Assessment (DORA) group provides data-driven research and insights on DevOps practices, helping organizations improve their software delivery performance and overall reliability.
DevOpsDays - DevOpsDays is a global series of conferences that bring together professionals from around the world to discuss and learn about the latest trends, best practices, and challenges in the world of DevOps.
Monitoring and Observability
Prometheus - Prometheus is an open-source monitoring and alerting toolkit designed for reliability and scalability, providing powerful query capabilities and integrations with various data sources and visualization tools.
Grafana - Grafana is a popular open-source visualization and analytics platform that supports a wide range of data sources, including Prometheus, InfluxDB, and Elasticsearch, enabling users to create and share interactive, customizable dashboards.
Honeycomb - Honeycomb is an observability platform that helps software teams understand, debug, and improve the performance of their systems, offering powerful tools for distributed tracing, event-driven analytics, and real-time visualization.
Continuous Integration and Deployment (CI/CD)
Jenkins - Jenkins is an open-source automation server that enables developers to automate various aspects of their software development process, including building, testing, and deploying applications.
CircleCI - CircleCI is a cloud-based continuous integration and continuous delivery (CI/CD) platform that helps development teams automate their software development pipelines, speeding up the process of building, testing, and deploying code.
GitLab CI/CD - GitLab CI/