Platform Support Engineer
Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely accelerate their deployment and usage of AI. Saviynt is recognized as the leader in identity security, with solutions that protect and empower the world’s leading brands, Fortune 500 companies and government institutions. For more information, please visit www.saviynt.com.
Due to the nature of the UK Government projects this role supports, this position is classified as a Reserved Post. In accordance with the Civil Service Nationality Rules, paragraph 1 of Schedule 23 and paragraph 5 of Schedule 22 of the Equality Act 2010, we can only accept applications from persons with UK residency (at least five years).
Successful candidates must undergo National Security Vetting (NSV). This role requires Security Check SC level clearance as a minimum. Any offer of employment is strictly conditional upon the candidate successfully obtaining and maintaining this clearance.
In line with the Immigration, Asylum and Nationality Act 2006, all shortlisted candidates will be required to provide original documentation verifying their Right to Work in the UK and their British Citizenship during the initial interview stage. We conduct thorough Baseline Personnel Security Standard (BPSS) checks as a precursor to all higher-level clearances.
WHAT YOU WILL BE DOING:
- Strong pod-level troubleshooting skills in AKS/EKS (not just restarting pods).
- Analyze application and DB (RDS, MySQL) performance issues.Deeply investigate and analyze application performance issues (Java, Grails, Hibernate), identifying root causes and implementing solutions.
- Oversee the monitoring of our SaaS applications and underlying infrastructure (Kubernetes on AWS and Azure, VPN connections, customer applications, Elastic Search, MySQL) for alerts and performance issues.
- Strong understanding of basic computing concepts like DNS, IP addressing, Networking, and LDAP.
- Effectively participate and contribute in on-call escalations with a strong operational mindset and provide technical guidance during critical incidents.
- Proactively communicate with customers on technical issues when required.
- Ability to guide junior engineers when needed technically.
- Manage the full lifecycle of alerts, incidents, and service requests reported through FreshService, ensuring timely and accurate logging, prioritization, resolution, and escalation.
- Develop, implement, and maintain operational procedures, runbooks, and knowledge base articles to standardize incident resolution and service request fulfillment.
- Drive continuous improvement initiatives to optimize operational efficiency, reduce incident rates, and improve service request turnaround times.
- Collaborate with backend engineering and development teams to troubleshoot complex issues, identify root causes, and implement preventative measures.
- Ensure adherence to defined SLAs (Service Level Agreements) and KPIs (Key Performance Indicators) for operational performance. Maintain operational documentation, including system diagrams, contact lists, and escalation paths.
- Ensure compliance with relevant security and compliance policies.
- Plan and coordinate scheduled maintenance activities with minimal impact to service availability.