Sr Site Reliability Engineer

Detalles de la oferta

As SRE you are responsible for overall system operation and we use a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages. _

**What you'll do**:

- Manage system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures.
- Build infrastructure as code (IAC) patterns that meet security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and programming with cloud SDK).
- Build automated tooling to deploy service request to push a change into production
- Solve problems and triage complex distributed architecture service maps.
- Build runbooks that are comprehensive and detailed to manage detect, remediate and restore services.
- Lead availability blameless postmortem and own the call to action to remediate recurrences.
- Effectively communicate to technical peers and team members in both written and verbal formats

**What experience you need**:

- BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent job experience required
- 3+ years of experience developing and/or administering software in public cloud
- 3+ years of experience in languages such as **Python, Bash, Java, Go JavaScript and/or node.js**
- 3+ years of experience of cross-functional knowledge with systems, storage, networking, security and databases
- 3+ years of **hands-on **experience **of system administration skills, including **automation and orchestration of Linux/Windows using **Terraform, Chef, Ansible and/or containers (Docker, Kubernetes, etc.)**
- 3+ years of experience working with continuous integration and continuous delivery tooling and practices
- **Any Cloud Certification either in GCP, AWS or Azure**

**What could set you apart**:

- You have expertise designing, analyzing and troubleshooting large-scale distributed systems.
- You take a system problem-solving approach, coupled with strong communication skills and a sense of ownership and drive
- You have experience managing Infrastructure as code via tools such as Terraform or CloudFormation
- You are passionate for automation with a desire to eliminate toil whenever possible
- You've built software or maintained systems in a highly secure, regulated or compliant industry
- You thrive in and have experience and passion for working within a DevOps culture and as part of a team

LI-DU1
LI-Hybrid


Salario Nominal: A convenir

Fuente: Whatjobs_Ppc

Requisitos

Helpdesk Support For Low Code/No Code Applications

**In this Role, Your Responsibilities Will Be**: - Fix and resolve issues related to the low code/no code platforms. - Document all Helpdesk interactions an...


Emerson - San José

Publicado a month ago

Technical Support Representative - Work At Home

**Why ClearSource?** ClearSource is a people-driven company focused on delivering exceptional customer experiences every day! We truly believe in our Core Va...


Clearsource - San José

Publicado a month ago

Servicenow Administrator

Job Description - About the Role: Fragomen, an AmLaw 100 Firm and the leading global immigration services provider, is seeking an experienced ServiceNow Admi...


Fragomen - San José

Publicado a month ago

Sales Operations Support Analyst Hybrid/Remote

Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep...


Splunk - San José

Publicado a month ago

Built at: 2024-12-12T08:49:49.198Z