Salary: 60K-100K
Age: 21-35 years
Job Requirements:
1. Over 5 years of relevant work experience, bachelor's degree or higher in Computer Science or related fields
2. Proficient in Linux operating system with in-depth understanding
3. Expertise in at least one scripting language and one static language; experience in large-scale system design preferred
4. Familiar with TCP/IP and HTTP protocols, with in-depth troubleshooting experience preferred
5. Familiar with container technologies and orchestration technologies, with experience in K8s production operations preferred
6. In-depth knowledge of database principles and common database engines preferred
7. Deep understanding of distributed systems and familiarity with common open-source components for the internet (nginx, redis, kafka, mysql, hbase, zookeeper, hadoop, etc.)
8. Experience with big data operations and development or machine learning algorithms preferred
9. Experience with continuous integration/continuous deployment and large-scale cluster management is a plus
10. Strong sense of responsibility, proactive, passionate about learning, and team-oriented
11. Bonus: Familiarity with common vulnerabilities in operation tools and Linux server optimization
Responsibilities:
1. Responsible for the online monitoring and alerting of core systems and applications in the department to ensure system stability
2. Participate in incident management, analysis, localization, processing, and follow-up improvements for unexpected online events
3. Conduct system resource statistics, performance evaluations, and capacity planning
4. Drive the implementation of DevOps within the department, focusing on improving operations capabilities (CI/CD, application deployment, monitoring, alerting, emergency response plans, smart operations, etc.)
5. Promote the standardization, automation, and intelligence of operations (AIOps)