Head Of Infrastructure

Storm2
Storm2
( )
asiaremotejobs.com  Remote (Asia | APAC Time Zone Permitted)

Job Type : Full-Time
Experience : 3 to 5 years
Education : Bachelor Degree

Job Detail

Head of Infrastructure

Location: Singapore (Remote)

Permanent role – Full time

Competitive market rate salary + Benefits + Flexibility


My client is looking for a head of infrastructure to join their rapidly growing team! This is an incredible chance to get on a front row seat in a fast-expanding and booming Fintech. You will oversee the infrastructure (hosted on AWS Cloud), databases, network, SRE, security and incident management. You will also be working closely with the founders and play a critical role in the success of the organization. This company is a well-funded B2C FinTech Start-up that just secured their Series B funding and are ready to expand like crazy! Their products give consumers access to payments within their B2C platform.


You will promote DevOps methodologies, build out expertise in Site Reliability Engineering and focus on automations. You will own the work that you implement/build - optimizing performance and improving the reliability and availability of our systems. You will be working closely with the Engineering team and provide insight and visibility on production systems performance based on Data driven approach.


Key Responsibilities

  • Lead and shape the Infrastructure, Network, Security & SRE Team
  • Identify weaknesses, needs, risks - build the roadmap to solve them
  • Manage, monitor, upgrade and maintain the critical infrastructure – with high availability
  • Identify risks (security, infrastructure, network) – address the issues before it becomes an actual problem
  • Organize the team and define processes to achieve 24/7 level 3 support
  • Work closely with of the Engineering team to design and architect the platform and services
  • Identify parts of the system that are lacking in scalability - provide resolution and measures
  • Propose solutions within the infrastructure team to reduce the workload by automation (Terraform, Ansible, …)
  • Run blameless root cause analyses

 

SRE Specifics: 

  • Build out the monitoring and alerting solution (Prometheus, Grafana, Alert manager,…) for all our production platforms, systems, APIs
  • Identify the SLI (Service Level Indicators) that will align the team to meet the availability and latency objectives (Service Level Objectives)
  • Solve issues and optimize performance across the entire stack: hardware, software, application, and network
  • Define relevant KPI and metrics to assess the performance of the platform and systems
  • Represent the SRE organization in design reviews and operational readiness exercises for new and existing services


We would love to hear from you if this is something you could be interested in. Please click on the “Easy Apply” button at the top of this page and follow the instructions to send us your application. If you do have any extra requirements to support your application, then please just add a note along with your CV to let us know.

27 total views, 2 today
Apply this position
LinkedIn-SG - 1 year ago