Share this Job
Apply now »

Site Reliability Engineer

Aug 26, 2021
Hồ Chí Minh (TP)
GFT Technologies SE


  • Supporting the engineering team in building highly fault-tolerant, scalable applications. 

  • Developing tools to ensure our services can scale and are highly available. We always try to manage our ops tasks with automation, by adopting open source tools or developing bespoke tools as required 

  • Being part of the 24x7 on-call rota, helping support and maintain production systems 

  • Day to day development support and monitoring of production server and network environments by developing and deploying logging and monitoring tools. 

  • Developing applications to increase code quality throughout our codebase. 

  • Supporting disaster recovery, backup, redundancy and capacity planning activities. 



  • Strong background in Linux/Unix administration, e.g. Ubuntu, Debian 

  • A strong background in at least one of Go, Python or Java 

  • A strong background in one of the following: database administration, Kafka, observability tools (such as Prometheus or Zipkin) or infrastructure automation. 

  • Experience with AWS, Azure or GCP is essential 

  • Experience or knowledge of container orchestration tools, e.g. Kubernetes 


  • Experience in supporting production systems 

  • Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible 

Apply now »