About the company
Figment is the worldās leading provider of blockchain infrastructure. We provide the most comprehensive staking solution for our over 200+ institutional clients including exchanges, wallets, foundations, custodians, and large token holders to earn rewards on their crypto assets. These clients rely on Figmentās institutional staking service including rewards optimization, rapid API development, rewards reporting, partner integrations, governance, and slashing protection. Figment is backed by industry experts, financial institutions and our global team across twenty three countries. This all leads to our mission to support the adoption, growth and long term success of the Web3 ecosystem. We are a growth stage technology company ā looking for people who are builders and doers. People who are comfortable plotting their course through ambiguity and uncertainty to drive impact and who are excited to work in new ways and empower a generative company culture.
Job Summary
Responsibilities/Duties
šLead the planning, testing, implementation, and management of projects to deliver scalable, maintainable, secure, and stable deployments across a number of different use cases šIdentify common areas where automation can significantly increase our ability to manage all our infrastructure šIdentify areas of systemic risk, and then evaluate, propose, and implement solutions šManage and monitor cloud and physical servers across multiple global hosting services and data centers šAutomation of blockchain software build, install, and management processes šLinux server administration, hardening, intrusion detection, and vulnerability scanning šDebugging complex distributed blockchains to help run them smoothly šIdentify tasks and processes that get stuck or move slowly and support the team in pushing them through šExplain broad initiatives and vision to Engineering Leadership and Product. šDemonstrate technical expertise in multiple domains and act as a mentor to others on the team
Qualifications
šIn-depth experience supporting Linux servers in large environments with over 500 servers, preferably in highly secure and highly available 24/7 environments. Deep troubleshooting expertise to quickly identify server, network, and application issues, and methodical approach to implementing, testing, and measuring changes to remediate issues. Experience troubleshooting complex application flows which span multiple clusters of systems. Knowledge of common OS and application level settings to optimize system performance šExperience with common system administration scripting languages such as bash and Python šIn-depth experience using Ansible (or similar tools) to environments with over 500 servers. Experience designing complex static and dynamic inventories, variable structures, and playbooks. Writing custom Ansible roles from scratch with advanced features of Ansible such as Jinja2 templating, complex logic flows, dynamic role/task includes, and handlers. Strong focus on ensuring idempotency & reusability and following best practices where they make sense