*location: guadalajara /aguascalientes / remote*
*job summary*:
the primary objective of the cloud site reliability engineer (sre) will be to provide architectural guidance, internal and external cloud computing, strategic provisioning, governance, security, and availability in coordination with network and devops engineers to create, maintain, and support the public and private cloud server and network systems infrastructures that meet the technical demands of the company.
*reporting relationships*:
- *this job reports to: manager, cloud sre systems.
reporting directly (or indirectly) to this position are the following job titles: not applicable.
- duties & responsibilities: *
*working closely with information systems, information technology and engineering teams to identify, implement, orchestrate, and automate cloud-based platforms throughout the company.
building and designing web services in the cloud, along with implementing the set-up of geographically redundant services.
identifying and implementing system improvements by evaluating system performance; upgrading, installing, tuning, and configuring the system
managing cloud environments in accordance with company security guidelines.
deploying and debugging projects as needed in accordance with best practices throughout the development lifecycle.
educating teams on the implementation of new cloud-based initiatives, providing associated training as required.
employing problem-solving skills, performance monitoring, and other pre-emptive strategies to identify and resolve issues before they manifest into performance or availability issues.
using your knowledge of apis to design restful services, and integrate them with existing data providers, using json or xml as needed.
developing standard practices and procedures for site reliability engineer (sre) team.
assist in development of masimo cloud processes and procedures.
researching and recommending new solutions to business and management problems; working with vendors and business units on department and company projects to accomplish masimo company goals.
testing, evaluating, and installing new software and systems.
managing business continuity in cloud-based environments, including server, file backup and recovery; preparing and testing disaster recovery procedures
undertaking routine preventative measures and implementing, maintaining, and monitoring network security and intrusion detection
preparing technical and user documentation and training materials.
ensure highest 24 hour/7-day week critical system availability.
assist in maintaining company compliance with internal policies and regulatory standards
staying current with industry trends, making recommendations as needed to help the company excel.
performing other duties or special projects as assigned
*minimum & preferred qualifications and experience*:
* *minimum qualifications: *
three to five years of experience in a site reliability engineer (sre), devops role or related position.
one to two industry recognized certifications (aws, azure, kubernetes, etc.)
customer service oriented
knowledge of regulatory frameworks and their impact on design considerations (hipaa, pci, iso27xxx, itar, etc.)
proficient in networking and security
knowledge of networking and internet protocols, including tcp/ip, dns, smtp, http, and distributed networks.
experience with any of the following node.js, python, go, powershell.
experience with performance management of database engines (dynamodb, mongodb, elasticsearch, kafka) exceptional organizational skills
good communication skills, both verbal and written; experience with documentation of processes excellent problem-solving ability, technical and analytical skills
ability to work independently, and with moderate supervision
*preferred qualifications*:
experience developing software using languages such as java, python.
experience deploying infrastructure as a code using tools such as terraform and cloud formation.
experience with ansible, jenkins, git.
experience with kubernetes, enterprise kubernetes management tools such as rancher.
experience working with linux, docker, and microsoft azure.
aws cloud security certification, and/or openstack administrator certification a plus.
experience with mobile operating system platforms.
multiple cloud platform, microsoft, cisco certifications are a definite plus.
*education*:
*bachelor's degree preferred, preferably in computer science, engineering, or a related field.
*why become an improver*
*competitive salary*
you will get a very competitive salary that will match with your skills and experience.
*career development*
develop your skills to the full potential and advance your career with platforms and resources at your disposal.
*challenging projects*
be a crucial part of innovative international projects where you'll have the chance to work and participate with your clients both remotely and on-site.
*work-life balance*
we p