The Technical Infrastructure SRE team is responsible for managing the whole infrastructure and applications. Our mission is to ensure all production systems can support our fast growing world-wide user base as well as keep the entire systems stable, efficient and cost effective. We manage deployments, system capacity, traffic scheduling, fault tolerance, disaster recovery, emergency response, automations, operation platforms development, etc.
Be responsible for the basic engineering construction of byte infrastructure products & components, focusing on infrastructure O&M architecture optimization, automated O&M platform research and development, data and intelligent O&M. Through the methodology of software engineering and digital intelligence, O&M, around the O&M requirements of infrastructure products & components, built a layered and systematic O&M platform to solve the problem of ultra-large-scale cluster O&M management. (Goals) To provide stable, efficient, and low-cost serverless infrastructure facilities for Mid-Platform & Business.We aim to be the leading SRE team across the industry。
- Grow and lead a team of engineers committed to building and operating scalable and reliable AML Platform systems.
- Be both technically hands-on and people manager.
-Provide technical leadership and guidance to both your team members and your project peers.
- Communicate cross-functionally across various teams, organizations and internal and external stakeholders to drive engineering efforts.
- Lead the team's innovation efforts, bring in new ideas and technologies.