Found 38,161 repositories(showing 30)
bregman-arie
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
NationalSecurityAgency
Ghidra is a software reverse engineering (SRE) framework
awesome-foss
A curated list of amazingly awesome open-source sysadmin resources.
milanm
DevOps Roadmap for 2026. with learning resources
dastergon
A curated list of Site Reliability and Production Engineering resources.
kubeshark
eBPF-powered network observability for Kubernetes. Indexes L4/L7 traffic with full K8s context, decrypts TLS without keys. Queryable by AI agents via MCP and humans via dashboard.
upgundecha
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
bregman-arie
DevOps resources - Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP
runatlantis
Terraform Pull Request Automation
Site Reliability Engineer Interview Preparation Guide
isno
⭐ 【出版书籍】京东购买链接 https://item.jd.com/14531549.html 深入讲解内核网络、Kubernetes、ServiceMesh、容器等云原生相关技术。经历实践检验的 DevOps、SRE指南。
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
k8sgpt-ai
Giving Kubernetes Superpowers to everyone
coroot
Coroot is an open-source observability and APM tool with AI-powered Root Cause Analysis. It combines metrics, logs, traces, continuous profiling, and SLO-based alerting with predefined dashboards and inspections.
StackStorm
StackStorm (aka "IFTTT for Ops") is event-driven automation for auto-remediation, incident responses, troubleshooting, deployments, and more for DevOps and SREs. Includes rules engine, workflow, 160 integration packs with 6000+ actions (see https://exchange.stackstorm.org) and ChatOps. Installer at https://docs.stackstorm.com/install/index.html
hjacobs
Compilation of public failure/horror stories related to Kubernetes
rundeck
Enable Self-Service Operations: Give specific users access to your existing tools, services, and scripts
litmuschaos
Litmus helps SREs and developers practice chaos engineering in a Cloud-native way. Chaos experiments are published at the ChaosHub (https://hub.litmuschaos.io). Community notes is at https://hackmd.io/a4Zu_sH4TZGeih-xCimi3Q
david-gpu
Image super-resolution through deep learning
antonputra
DevOps Tutorials
wmariuss
A curated list of awesome DevOps platforms, tools, practices and resources
jonmosco
Kubernetes prompt info for bash and zsh
leandromoreira
CDN Up and Running - Building a CDN from Scratch to Learn about CDN, Nginx, Lua, Prometheus, Grafana, Load balancing, and Containers.
chaterm
Open source AI terminal for cloud and infrastructure management, enabling you to deploy, troubleshoot, and automate services using natural language and intelligent agents.
bregman-arie
A checklist of anyone practicing Site Reliability Engineering
DevOpsHiveHQ
A FREE pragmatic DevOps learning to kickstart your DevOps career and knowledge in the Cloud Native era following the Agile MVP style! ⭐ (2026 plans for DevOps, Cloud, Platform, SRE, SWE)
HolmesGPT
SRE Agent - CNCF Sandbox Project
anzhihe
Learning Shell,Python,Golang,System,Network
chaostoolkit
Chaos Engineering Toolkit & Orchestration for Developers
alibaba
Cloud Native DataOps & AIOps Platform | 云原生数智运维平台