Senior Site Reliability Engineer (remote)

EBlock

Location: Burlington, Vermont

Type: Full Time

Education: Bachelor's Degree

Experience: 3 - 5 Years

E INC is the parent company of EBlock and EDealer, unifying our approach to products, services, and strategies under one Vision and one Mission: to create the best digital auction and retailing platform in the world by connecting the automotive wholesale and retail experiences. Our brands and their technologies make it easy for a vehicle to move between buyers and sellers throughout its entire ownership lifecycle. Learn more at https://e.inc/

Why work with us!

E INC is a growing company that is continuing to evaluate our ongoing commitment and investment in our team members with:

  • Competitive pay
  • Medical, Dental & Vision
  • 401k/RSP with Match
  • Paid time off
  • Flexible working environment
  • Continuous Learning
  • And an amazing culture to top it all off!

The Site Reliability Engineer (SRE) is responsible for ensuring all production systems are running smoothly and meeting their Service Level Objectives (SLOs). SREs bridge between development and operations by applying software engineering to system administration, and applying automation. This individual will setup and automate highly available, scalable infrastructure, monitor system performance, respond to emergencies, identify downtime along with underlying causes, integrate security measures in the environment. Ultimately, this individual will work in the change management process and capacity planning, ensuring Service Level Agreements (SLAs) are met by reducing latency, improving performance and efficiency of software applications.

What you will be doing

 

  • Part of on-call rotation responding to availability incidents and provide support for production engineers with customer incidents.
  • Ensuring availability, performance, security and scalability of AWS production systems.
  • Automation, deployment, management, and maintenance of AWS cloud-based systems.
  • Establish, manage the creation, release, logging, monitoring, metrics and configuration of production systems.
  • Help optimize our backend, mobile and web, preventing incidents from happening
  • Develop and maintain infrastructure documentation and technical diagramming of interconnected systems and networks.
  • Handle incidents, troubleshooting systems, performing root cause analysis of outages, and resolve problems across various application domains and platforms.
  • Define, automate, adhere and validate disaster recovery, data manipulation, compliance, security controls, risk treatment and change management policies.
  • Set and establish best practices for infrastructure consumption by software engineers.
  • Work solo or collectively with peer engineers, collaborating cross-functionally and sharing knowledge.
  • Run our infrastructure with automation orchestration and configuration management tools (such as Terraform, Ansible and Kubernetes).

What we would like to see

  • Bachelor’s degree or higher degree in computer science or related field.
  • Good experience with Java and Node technology stacks.
  • Deep and extensive knowledge of building and maintaining AWS infrastructure; strong understanding of how to secure AWS environments and meet compliance requirements.
  • Automation experience using orchestration tools (such as Terraform, AWS CloudFormation) and configuration management tools (such as Ansible, AWS OpsWorks).
  • Experience with GIT version control, containerized environments (such as Docker, AWS Fargate, ECS, EKS) and serverless (AWS Lambda).
  • Solid foundation of networking, security, scripting and Linux administration.
  • In-depth experience implementing Service Level Objectives (SLOs).
  • Good self-management skills, ability to track and prioritize competing tasks.

© 2022 Vermont Technology Alliance

Site by Scout Digital