Sr. Site Reliability Engineer
Crest Data • Ahmedabad, Gujarat • Posted May 24, 2026
About the Role
Job Summary : Experienced Systems Administrator with a strong foundation in Linux, infrastructure management, and incident response, skilled in monitoring, troubleshooting, and maintaining reliable systems across virtualized and cloud-based environments. Job Responsibilities Manage and optimize Linux systems with focus on performance, reliability, and troubleshooting. Handle network issues including latency, packet drops, and connectivity Work on cloud platforms (AWS/GCP/Azure) for deployment and scaling Deploy and manage applications using Docker and Kubernetes (cluster troubleshooting & scaling) Build and maintain monitoring systems using Prometheus, Grafana, and ELK Create dashboards, alerts, and PromQL queries Automate tasks using Python/Bash scripting Manage CI/CD pipelines (Jenkins/GitLab CI) Handle P1/P2 incidents , lead bridges, and perform RCA Job Location : Ahmedabad & Pune Key Skills Strong Linux fundamentals. Good understanding of networking (TCP/IP, DNS, HTTP/HTTPS, load b...