About the company
Fuel is building a high-performance blockchain operating system that provides high throughput without sacrificing decentralization or security. Our project is a full-stack solution that lets anyone deploy a Fuel chain using any configuration they would like (Rollup on Ethereum, Sovereign chain, Appchain, etc.). Currently, our focus is to launch an Ethereum rollup to help scale Ethereum beyond its current capacity.
Job Summary
Responsibilities:
📍Be able to develop, integrate, and proving SRE Best Practices, Tooling, and Architecture into our current devops practice to achieve 99.99% uptime of our applications and infrastructure 📍Efficiently handle live production incidents, debug/troubleshoot application and infrastructure issues, follow and implement SRE best practices. 📍Must be able to debug applications and infrastructure with Linux, Kubernetes, AWS and other devops experience. 📍Develop, integrate, educate, and document on oncall strategies using tooling like prometheus, AWS, PagerDuty, Grafana, Elasticsearch, and others to improve application and infrastructure performance and monitoring 📍Improve logging of our applications and infrastructure to AWS and Elasticsearch 📍Develop and improve prometheus based alerts using PromQL 📍Develop and improve Grafana dashboards for our applications and infrastructure 📍Work closely with software engineers and QAs to ensure the system is responding properly to non-functional requirements such as performance, security, and availability. 📍This role include on-call rotations, with the expectation of being available for potential emergencies for one weekend each month.
The following expertise and/or willingness to learn such technologies will be needed:
📍Amazon Cloud Service (AWS) 📍AWS Elastic Kubernetes Service (EKS) 📍AWS Cloudwatch 📍Linux/Unix 📍Github 📍Prometheus & PromQL 📍Grafana & ElasticSearch 📍PagerDuty 📍SRE Best Practices & Architecture 📍Oncall Best Practices, Engineering, and Education 📍DevSecOps Best Practices, Auditing, and Tooling Experience