Responsibilities
responsibilities include but are not limited to:
* maintain synchronization between production & dr environments for payments products, ensuring changes/configurations implemented in production are simultaneously implemented in the dr environment, and that replication is in a healthy state on a real-time basis.
* sop creation as per industry best standards for week-long true failovers, ensuring customers can run their business seamlessly from the dr environment for a week.
* ensure documentation (deployment diagrams, internal and client-facing runbooks) is timely updated, maintained, and peer-reviewed.
* when production/uat issues reported by the client and/or implementation team are addressed within acceptable timelines, ensure the same changes are implemented in dr simultaneously.
* ability to debug system logs and produce technical solutions associated with data flow, processes involved, or performance aspects.
* work closely with the implementation team to understand the process flow and solution challenges, and work towards implementing & supporting a seamless sync between production & dr infrastructure.
* work closely with the implementation & product teams to explore & reuse tools/processes for effective maintenance of the dr environment.
* ensure automation is introduced wherever possible to reduce system stability time post-failover to dr.
* focus on acquiring adequate understanding of the technical & functional aspects/flows of the implemented products within a short span of time.
* regular internal health checkups and internal tests are performed prior to client-facing dr tests to maintain high-level resiliency.
* ensure new techniques and solutions are highlighted, voiced out, and further implemented to maintain the highest possible resiliency.
* effectively participate in dr planning, work on rca’s, provide dr updates, and identify risks with mitigation plans.
* learn, collaborate & contribute across teams. Be flexible.
the required skills, knowledge & experience
must have:
* willingness to work in 24 x 7 shifts and provide on-call support on rotation including weekends.
* 5+ years’ experience as a devops, application operations/support, or similar role.
* experience in application deployment and configuration across multiple platforms such as linux and windows on azure.
* 3+ years experience in configuring web servers like jboss, weblogic, ibm websphere, etc.
* 3+ years experience in integration using jms/mq, web services, rest api.
* 3+ years’ experience of working on the linux platform and proficient in scripting languages such as linux shell script & ansible.
* familiarity with general monitoring principles, as well as tools such as site 24 x 7, grafana, etc.
* experience in different automation concepts (ex. Ci/cd, infra as code) and devops tools (ex. Azure devops, jenkins, docker, terraform, chef, etc.).
* strong communication skills, both written and verbal, should be able to coordinate with cross-functional teams.
good to have:
* experience in cloud technologies such as azure and aws.
* exposure to payments domain and finastra payments products i.e., gpp or p2g is an added advantage.
* 2+ years experience with kubernetes, red hat openshift platform, azure kubernetes services.
* 3+ years exposure to oracle database and effective usage of sql.
* familiarity with atlassian tools such as jira and confluence.
* knowledge of java and supporting java applications.
* technical consulting and implementation experience in client-facing projects is an added advantage.
#j-18808-ljbffr