Job Title: Site Reliability Engineer (AWS, GitHub)
Location: Remote, but MUST reside in US and preferably Southern California
Relocation: Client is open to candidates who need to relocate, but will only pay for the cost of interviewing in person
Duration: 3 Months+ (With Possible Extension or Conversion to FTE/full time employee)
Citizenship: MUST be US Citizen or Permanent Resident Green Card
Hourly Pay: DOE – (client willing to pay market rate for the right person)
Pay close attention to the details and questions below
NO OUTSIDE VENDORS PLEASE
Onboarding Process:
Job Position Details
We are looking for a skilled SRE Reliability Engineer to join our Site Reliability Engineering (SRE) team. The primary focus of this role is to ensure the reliability, availability, and performance of our systems and services. You will work closely with software engineers, DevOps teams, and other SREs to build and maintain resilient systems that meet our service level objectives (SLOs). Your expertise will help us identify potential reliability risks, automate processes, and improve our incident response capabilities.
Mandatory skill sets needed:
Key Responsibilities:
Reliability Engineering:
Monitoring and Alerting:
Performance and Reliability Analysis:
Automation and Tooling:
Collaboration:
Experience:
o 5+ years of experience in Site Reliability Engineering, DevOps, or a related role with a focus on system reliability and performance.
o Strong background in monitoring, alerting, and incident management tools and practices.
o Experience with cloud platforms (AWS, Azure, GCP) and container orchestration tools (e.g., Kubernetes, Docker).
Skills:
o Proficiency in scripting and automation languages (e.g., Python, Bash, Go).
o Strong understanding of networking, system performance, and reliability principles.
o Knowledge of service level management, including SLOs, SLIs, and SLAs.
o Familiarity with containerization and orchestration tools (e.g., Docker, Kubernetes).
Soft Skills:
o Excellent problem-solving and analytical skills, with a proactive approach to identifying and addressing system vulnerabilities.
o Strong communication skills, with the ability to work effectively with cross-functional teams.
o A commitment to continuous learning and staying current with the latest industry trends and technologies.
SCREENING QUESTIONS
Connect with us on Linked In: www.LinkedIn.com/in/DianeKrehbiel
CGEMJP00267343 – Site Reliability Engineer – Remote – US Citizen or Permanent Resident Green Card