Senior Site Reliability Engineer Job at Algo Capital Group, New York, NY

V0NBSHN1V3k4b21SVWw5ZVRuQ1J5SXR2NVE9PQ==
  • Algo Capital Group
  • New York, NY

Job Description

Senior Site Reliability Engineer

Our client is a top tier High Frequency Trading firm based in NYC with a strong engineering culture and ML infrastructure based in NYC looking is looking to hire a Senior Site Reliability Engineer to their infrastructure team, this team is responsible for developing and maintaining the corporate productivity stack for the entire firm, both on-prem and in the cloud. You will ensure the availability and reliability of systems within this stack and grow the engineering practice in alignment with the firm's larger engineering organization.

This role requires a deep Linux operating system and application administration skill set, proficiency in Python, and solid experience with configuration management/IaC. Successful candidates should also have exceptional organizational, communication, and project management skills, as well as the ability to troubleshoot complex technical issues.

Responsibilities

  • Manage on-premise containerized web services
  • Automate and troubleshoot a broad range of technical infrastructure
  • Design and operate secure, reliable systems
  • Develop and implement monitoring solutions to ensure high system uptime and reliability; utilize tools to detect and resolve issues proactively
  • Document system architecture, processes, and best practices
  • Break down complexity, iterate, and communicate progress to a wide variety of leads and stakeholders
  • Assist with the administration of DHCP and DNS for both on-premise and external systems and applications

Qualifications

  • Years of experience in site reliability engineering or related disciplines
  • Strong proficiency with Python
  • Experience managing and monitoring containerized infrastructure
  • Experience working with CI/CD tools such as Jenkins, GitHub Actions, or ArgoCD
  • Expert experience with IaC and configuration management tools such as Terraform, SaltStack, Chef, Puppet, or Ansible
  • Experience building and operating systems on cloud platforms (e.g. AWS, Azure, GCP)
  • OpenLDAP or other directory services management expertise
  • Atlassian Data Center administration experience (on-prem)
  • Web development experience

This position offers a top of the market compensation package with excellent benefits and career growth opportunities.

Job Tags

Similar Jobs

WELLS

CERTIFIED WELDER Job at WELLS

 ...operations, and project management. They bring innovative solutions that yield outstanding results. In the design and construction industry, we focus on honesty and hard work to build trust that lasts beyond individual projects. Our team is consistently working to... 

Sunbird Software, Inc.

Web Developer Job at Sunbird Software, Inc.

 ...Job Description Job Description POSITION SUMMARY We are looking for a Software Developer who enjoys keeping current in technology to help us develop a web application for data center management. We develop in a collaborative environment where our workday involves... 

Amatheon Animal Health

Pharmaceutical Sales Representative Job at Amatheon Animal Health

 ...Amatheon Animal Health is a wholesaler of brand and generic pharmaceuticals to veterinarians nationwide. The Amatheon Sales Team is the most knowledgeable and effective consultative sales force in the industry. Our strong manufacturer relationships and dedicated sales... 

R.D. Offutt Company

Farm Manager Job at R.D. Offutt Company

 ...Job Summary This position is responsible for managing the field operations of the potato farm including, but not limited to, representing the organization in the community, executing the annual farm operating plan while collaborating with various cross-functional... 

Morgan Advanced Materials

Maintenance Manager Job at Morgan Advanced Materials

 ...Responsibilities Summary: Under the direction of the Hayward Engineering Manager, oversees all functions and operations related to upkeep, repair and improvement of the facility, installation, maintenance and repair of the equipment and security for the facility and...