Portfolio Jobs

Discover opportunities across our Portfolio
companies
Jobs

Site Reliability Engineer (SRE), Cloud Efficiency Engineer

Chainlink

Chainlink

Software Engineering
United States · Mexico City, Mexico · Remote · United Kingdom · Toronto, ON, Canada · Remote · Buenos Aires, Argentina · Remote · São Paulo, SP, Brazil · Remote · Remote
Posted 6+ months ago

About Us

Chainlink Labs is the primary contributing developer of Chainlink, the decentralized computing platform powering the verifiable web. Chainlink is the industry-standard platform for providing access to real-world data, offchain computation, and secure cross-chain interoperability across any blockchain. Chainlink Labs helps power verifiable applications for banking, DeFi, global trade, and gaming by collaborating with some of the world’s largest financial institutions, notably Swift, DTCC, and ANZ. Chainlink Labs also works with top Web3 teams, including Aave, Compound, GMX, Maker, and Synthetix. Chainlink Labs was ranked in Newsweek’s 100 Most Loved Workplaces 2023 in both the United States and United Kingdom.

The Engineering Team

At Chainlink Labs, our engineering team pushes the scale and capabilities of decentralized applications across the industry. The Chainlink Network holds >70% market share in the oracle space, solving real-world problems by enabling smart contracts to securely interact with off-chain data/computation.

We value talented and driven craftsmen who work collaboratively to tackle complex challenges, deliver product impact, and grow as builders. Join us and shape the future of blockchain technology and decentralized finance.

All roles with Chainlink Labs are globally remote based. We encourage you to apply regardless of your location.

The Infrastructure Platform team enables Chainlink development and empowers engineers to continue building and supporting crucial products and services that have a profound impact in the blockchain industry. Chainlink has enabled $19+ trillion TVE (total value enabled) as an undisputed leader in the oracle space. Reliability is vital to the success of our company. As a SRE Cloud Efficiency Engineer, you will help us accelerate and enable other engineering teams by optimizing cloud costs, improving resource efficiency, and ensuring financial accountability.

This job would be perfect for someone who has a strong SRE/DevOps background, is passionate about cloud resource optimization, and has experience in managing and attributing cloud spend. The entire engineering team is expanding, and you will have plenty of opportunities to build, learn, and grow.

We are distributed across time zones and continents, and we embrace remote work. Our on-call rotation uses the follow-the-sun pattern: you will be on-call some of the time, but your shifts will be during your day and our team is large.

We all have different backgrounds and are determined to help you succeed no matter where you are or who you are. If you think you would do a great job at Chainlink, we are looking forward to speaking with you, even if you don't match 100% of the job requirements: those describe people we've usually had a great time working with, but they're not a tick-box exercise.

Your Impact

  • Optimize and manage cloud infrastructure costs, focusing on AWS and GCP.

  • Shape the resilience, efficiency, and scalability of our services

  • Partner with finance and engineering teams to align cloud spend with budget.

  • Collaborate with engineering teams to ensure proper tagging and cost attribution.

  • Identify and implement strategies to improve cloud resource utilization and efficiency.

  • Innovate and enhance the infrastructure platform team’s product offerings to increase self service, improve cost optimization, and reduce toil

  • Provide insights and recommendations on cloud cost trends and potential savings.

  • Provide technical leadership and mentoring to your team and others

  • Champion best practices in reliability, security, and cloud infrastructure to help cultivate a culture of high operational standards

Requirements

  • At least 3 years of relevant professional experience. You probably have worked on a devops, infrastructure, SRE, and/or platform team before

  • Have led large cross-team initiatives and can demonstrate a successful track record with quantifiable metrics that impact the business

  • Strong communication skills. You can give and receive constructive feedback, and you do not shy away from planning meetings and code reviews

  • Strong foundational knowledge of cloud platforms, particularly AWS and GCP.

  • Expert knowledge in cloud cost optimization, monitoring, and reporting.

  • Ability to develop scripts and tools for cost management and automation.

  • Expert knowledge in all aspects of designing, deploying, and supporting large real-time systems

  • Practical experience in shell scripting and demonstrable skills in at least one higher-level language

  • Experience with distributed systems and container orchestration.

  • Excellent communication, presentation and project management skills to drive initiatives across cross-functional teams.

  • Familiar with most tools from our stack (see below)

Desired Qualifications

  • Experience working remotely in a distributed team.

  • Ability to scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity

  • AWS and/or GCP certifications

  • Experience with FinOps principles and methodologies and cloud cost management tools (e.g., AWS Cost Explorer, CloudZero).

  • Experience working cross-functionally with Finance to optimize cloud cost forecasting and budget adherence.

  • Experience with setting team priorities (OKRs) and aligning business processes required to get a product/service from ideation to production (PRD, RFC, etc)

  • A strong desire to grow and challenge yourself. We would expect you to constantly find ways to improve and automate services to reduce toil

Our Stack

We adhere to the GitOps approach to infrastructure and state management. Self service and automation through our internal developer platform is paramount. Some of the tools and services we use daily or almost daily include: AWS, Terraform/Terragrunt, Kubernetes, ArgoCD, GitHub Actions and Grafana.

We expect you to be comfortable with many of these tools or have a strong understanding of the fundamental concepts the tools are applied to.

All roles with Chainlink Labs are global and remote-based. Unless otherwise stated, we ask that you try to overlap some working hours with Eastern Standard Time (EST).

All roles with Chainlink Labs are global and remote-based. Unless otherwise stated, we ask that you try to overlap some working hours with Eastern Standard Time (EST).

Commitment to Equal Opportunity

Chainlink Labs is an equal opportunity employer. All qualified applicants will receive equal consideration for employment in compliance with applicable laws, regulations, or ordinances. If you need assistance or accommodation due to a disability or special need when applying for a role or in our recruitment process, please contact us via this form.

Global Data Privacy Notice for Job Candidates and Applicants

Information collected and processed as part of your Chainlink Labs Careers profile, and any job applications you choose to submit is subject to our Privacy Policy. By submitting your application, you are agreeing to our use and processing of your data as required.