Job Description
• The Senior Platform Engineer will be responsible for ensuring the stability and responsiveness of the platform via implementation of optimisation, monitoring and alerting mechanisms reducing production incidents and reducing mean time to recover of both the public and private cloud environments.
• The role is responsible for building platforms while optimising capacity and infrastructure ensuring ample infrastructure headroom, tuning of services, appropriate auto-scaling and cost monitoring preventing bill shock.
• This role will be working closely with DEVOPS Management, QA, and Systems Engineers to understand the solution requirements in order to support and operate accordingly.
• This role must be flexible to constant business and technology change, with the ability to interact, engineer, and communicate collaboratively with a wide range of stakeholders.
• The role is responsible for building, enforcing and optimising the CICD pipeline rules and governance in order to increase quality and faster time to market.
• The role is responsible for the migration to new technologies and obsolescence of existing systems and tools.
• The role is responsible for ensuring that the technologies choices are current and promotes software engineering.
• The role is responsible for KPI management via building platforms that supports and ensures mandates of uptime, latency, resilience and security are met and are consistently above mandated levels.
• The role is responsible in ensuring that cluster level monitoring and alerting are up to date and has a wide coverage to identify issues faster and prevent outages.
• The role is responsible for creating platform self-health and healing mechanisms.
• The role trend analysis identifying recurring issues and root causes while implementing preventative measures or mechanisms to identify issues faster.
• The role is responsible for creating processes and support mechanisms for API Consumers and Tenants of the platform further improving time to market.
• The role has security management as a key deliverable which includes and not limited to vulnerability management, obsolescence, auditing, risk management and access control.
• This role will be required to create DevOps R&D initiatives, promoting automation and AI, reducing cost and increasing KPI success rate and SLA mandates adherence.
Knowledge and Skills:
- Public and private cloud IaaS and PaaS deployment models and technologies, e.g. AWS, OpenStack and Azure.
- Micro-service architecture, virtualization and infrastructure technologies, e.g. AWS Services e.g. EKS, Kubenetes, Windows, Linux, VMWare, Xen, KVM and Docker.
- Infrastructure management and configuration management tools, e.g. Kube admin and Putty.
- Automated software and infrastructure deployment and configuration management.
- Automation of Software Engineering and QA activities via CICD and Platform CSI initiatives.
- Platform setup and support experience using APM, Monitoring and Alerting tools e.g. e.g. Dynatrace, App Dynamics, Grafana.
- Production-readiness assessment of modules and supporting systems.
- Strong understanding of network protocols and client-server communication.
- Database technologies – RDBMS e.g. Oracle, MySQL, PostGreSQL, Microsoft SQL Server
- Ability to analyse and interpret complex problems or processes, identify and understand issues and manage solutions.
- Strong creativity, problem solving skills and ability to apply original thinking to produce new ideas and innovative solutions to operational activities.
- Strong relationship building, persuasion and collaboration skills that enables the coordination of activities between technical teams and the creation of new relationships with new acquaintances quickly and confidently.
- Strong communication and influencing skills, with the ability to apply, work with and train delivery roles in adopting latest CICD and Software engineering principles.
- Highly effective planning and prioritisation skills.
- Willingness to track, assess, and incorporate practice and technology developments into day-to-day working
- Ability to rapidly acquire new knowledge and learn new skills converting skills learnt in actionable gains.
- Good understanding of the portfolio, wider organizational goals, and desired product business outcomes.
- Systems analysis and infrastructure management in scaled environments.
- Infrastructure and Network understanding of enterprise wide implementation
- Cost and capacity management experience of both public and private cloud. Cloud health experience is beneficial.
Experience working with platform security tools e.g. Qualys, SonarQube, Nexus SonaType
.
More Information
- Job Application Details INTERESTED CANDIDATES SHOULD APPLY ON THE LINK BELOW: https://oldmutual.wd3.myworkdayjobs.com/Old_Mutual_Careers Old Mutual Limited is pro-vaccination and encourages its workforce to be fully vaccinated against Covid-19.
- This job has expired!
New Job Alert
Never miss a chance!
Let us know your job expectations, so we can find you jobs better!
Get Daily Job Updates in your email
Like our Facebook Page
Search Jobs Namibia
Top Companies
Job Location
- Namibia (89)
- Windhoek, Namibia (62)
- Windhoek (30)
- Ondangwa (15)
- Walvis Bay, Namibia (13)