Remote Job
CI Site Reliability Engineer
Square
- 5000 - 6000 USD per Month
- Full Time
- Posted : 3 weeks ago
Since we opened our doors in 2009, the world of commerce has evolved immensely, and so has Square. After enabling anyone to take payments and never miss a sale, we saw sellers stymied by disparate, outmoded products and tools that wouldn’t work together.
To solve this problem, we expanded into software and built integrated solutions to help sellers sell online, manage inventory, book appointments, engage loyal buyers, and hire and pay staff. Across it all, we’ve embedded financial services tools at the point of sale, so merchants can access a business loan and manage their cash flow in one place. Afterpay furthers our goal to provide omnichannel tools that unlock meaningful value and growth, enabling sellers to capture the next generation shopper, increase order sizes, and compete at a larger scale.
Today, we are a partner to sellers of all sizes – large, enterprise-scale businesses with complex operations, sellers just starting, as well as merchants who began selling with Square and have grown larger over time. As our sellers grow, so do our solutions. There is a massive opportunity in front of us. We’re building a significant, meaningful, and lasting business, and we are helping sellers worldwide do the same.
Job Description
The Continuous Integration Infrastructure (CII) team at Square is an ensemble of hybrid systems and automation engineers. The CII team is pivotal to the design and management of our expansive continuous infrastructure. Our CII Site Reliability Engineers (SREs) are dedicated to meticulously building systems with a dual focus on boosting developer productivity and enhancing system performance and reliability.
CI Site Reliability Engineers have a major impact across Square, and especially in Hardware, through building and operating our on-prem data centers and AWS infrastructure, and optimizing mission-critical internal applications. We’re looking for engineers who want to be a part of developing, maintaining, and scaling this infrastructure via software tooling and automation.
Responsibilities
- Build scalable infrastructure to manage CI systems (both on-prem and AWS) and applications
- Minimize risk of reliability related failure outcomes as pertaining to durability, availability, performance
- Drive improvement projects and handle incidents in an effective and organized manner
- Collaborate across multiple teams including IT Support, Production Platform Engineering, Hardware Engineering, and Devices Software Engineering
- Build automation tools to detect and remediate system health, and prevent those issues from reoccurring
- Build automation to help with capacity planning to scale our on-prem and CI AWS infrastructure
- Perform periodic on-call duty to handle availability and efficiency of the continuous integration infrastructure
Qualifications
Requirements:
- BS or higher in Computer Science or equivalent technical experience
- 5+ years of industry experience architecting, developing, and troubleshooting large scale infrastructure
- 2+ years of experience in at least one of the following programming languages: Python, Go, Ruby
- Strong system/network debugging skills
- Strong knowledge of TCP/IP networking, network, and application-level security
- Experience with management/automation tools such as Chef/Ansible/SALT
- Consistent track record of coordinating or leading small cross-team technical projects
Nice to have:
Experience with CI systems such as Jenkins
Skills
- Python
- Go
- Ruby
- debugging
- TCP/IP
- Jenkins