Job Description
About Company
We are looking for a Cloud Operation Team Leader as part of the Customer Success & Support Department.
Your role & responsibilities
- Provide leadership for delivery of 24/7 service operations and KPI compliance.
- Develop, implement and maintain processes to document day to day support activities/checklist, dashboard of work and communication of timelines and issues.
- Lead the Cloud Operation Engineers in the team to respond and successfully close incidents through conduct problem determination, and work with various internal teams to resolve issues on a timely basis to meet SLA.
- Develop/review scripts for running tasks automatically using different scripting languages.
- Monitor/build/customize the dashboard using different monitoring tools.
- Effectively manage emergent situations, and resolve or escalate incidents in accordance with an SLA.
- Plan/participate in patching, mass upgrades and other planned maintenance activities for both infrastructure level (required) and application level (preferred).
- Provide lead for system administration duties for cloud-based server infrastructure.
- Develop and implement performance tuning steps for application, system, and network configurations.
- Establish metrics, key performance indicators, and service level agreements to continually improve the performance of IT cloud operations.
- Ensure overall service stability and sustainability.
Your skills & qualifications
Must have:
- Bachelor’s Degree in Computer Science, Computer or Electronics Engineering, Information.
- Technology or related disciplines or relevant work experience.
- Minimum 1 year of Team Lead role.
- Minimum 3 years of working experience (with hand-on skills) with Cloud infrastructure and services, and resources administration (i.e. AWS, Azure).
- Minimum 1 year of working experience in docker deployment, orchestration services (i.e. Kubernetes) and Linux Operating System (Ubuntu, CentOS).
- Experience in monitoring tools (i.e. Grafana, DataDog, Cloud Log Analytics).
- Ability to write/review scripting languages such as python, bash shell, and sql.
- Excellent problem detection and determination skills in multiple functional cloud infrastructure and application on cloud environment.
- Strong customer service orientation and an inherent sense of urgency and attention to details for resolving issues.
- Proven experience in creating, championing and maintaining processes, procedures and policies.
- Experience in managing cloud production environment and implement preventative actions/measures to avoid business impacting incidents.
- Good understanding of the interdependent relationship between cloud infrastructure, information security and cloud applications/services they enable as well as the criticality of maintaining strong connections between the respective teams within IT.
Nice to have:
- A strong self-starter and able to work with minimal supervision.
- Ability to work in a dynamic, fast-moving and growing environment.
- Critical thinker and problem-solving skills.
- Team player with great interpersonal and communication skills.
- ITIL V3 Intermediate Certifications or relevant cloud ops certification preferred.
Note: Interested candidates are kindly invited to send your application in English
Benefits for you
- Dynamic, young and friendly environment with enjoyable staffs activities
- Macbook Pro laptop for working
- Base salary package
- Annual leaves with 14 days at the beginning and insurance types following by the Labour Code
- Flexi benefits and leaves as per organization’s policy (birthday leave, personal leave, medical leave, and monthly work from home)
- Performance based reward and recognition
- Healthcare package, company trip, and quarterly team building
- Gifts on Public Holidays
- Working time: 8h30-17h30 Monday to Friday