Svitla systems inc. is looking for a senior datadog administrator (with azure) for a full-time position (40 hours per week) in costa rica, mexico. Our client is a leading expert network connecting business and government professionals with industry experts to support informed decision-making. They provide a research enablement platform powered by real-time data, innovative technology, and specialized expertise. Through calls, conferences, surveys, and workshops, the platform enables clients to gain insights across industries like healthcare, finance, consumer goods, energy, technology, and legal sectors. Since 2003, the company has partnered with top consulting firms, hedge funds, and fortune-ranked companies, helping them turn insights into action. You will join the cloudops team and help drive the optimization and performance of the infrastructure monitoring and observability practices. You will be responsible for managing, maintaining, and optimizing datadog for comprehensive monitoring and observability across our azure infrastructure, kubernetes environments, and application services.
by leveraging datadog’s tools for monitoring, alerting, and automated remediation, you will play a key role in ensuring the high availability, reliability, and performance of cloud-based systems.
requirements
* 5+ years of experience as a site reliability engineer or devops.
* 3+ years of commercial experience with azure
* strong experience with monitoring and observability tools.
* experience with kubernetes
* 3+ years of experience with datadog (experience in using datadog’s integration features (alerts, monitoring dashboards, and automated remediation).
* 2+ years of experience in cloud cost management (finops)
* proficiency in scripting with languages such as bash, powershell, python, or similar.
* strong troubleshooting and debugging capabilities in an agile software development environment.
* upper-intermediate level of english
nice to have
* knowledge of infrastructure as code using tools like terraform, arm templates, or azure cli is a huge plus.
* azure solutions architect expert certification or equivalent.
* azure security engineer certification (associate level).
* familiarity with ansible for automation and configuration management.
* advanced knowledge of kubernetes and container orchestration best practices.
* experience in ci/cd pipelines and integrating datadog with devops processes.
responsibilities
datadog implementation & management: take full ownership of datadog for monitoring infrastructure, services, and applications across multiple environments (production, development, test). Ensure optimal configurations for observability and alerting.
performance & health monitoring: monitor infrastructure and application performance using datadog, identify potential issues, and create automated remediation workflows to resolve them.
cost management: optimize and monitor azure cloud costs using datadog and other cloud tools, tracking and improving resource usage and cost-efficiency.
automation & remediation: leverage datadog’s alerting system and integrations to automate the remediation of common infrastructure and application issues.
kubernetes & cloud infrastructure: collaborate with cloudops and engineering teams to monitor and optimize kubernetes environments, ensuring containers, pods, and services are running efficiently.
collaboration: work closely with engineering, appops, and cloudops teams to address complex infrastructure challenges, ensuring smooth deployments and high availability.
security & compliance: ensure security and compliance best practices are followed for monitoring and logging, participating in security audits and incident response activities as required.
infrastructure as code: support the automation and deployment of infrastructure using tools like terraform and azure resource manager (arm).
finops: contribute to finops activities by tracking resource usage and optimizing cloud costs, providing data-driven insights into cost-saving opportunities.
best practices & optimization: continuously review and improve monitoring configurations, workflows, and processes for maximum efficiency, performance, and security.
we offer
* us and eu projects based on advanced technologies.
* competitive compensation based on skills and experience.
* remote-friendly culture and no micromanagement.
* christmas bonus in the amount of 15 days' salary.
* bonuses for article writing, public talks, other activities.
* personalized learning program tailored to your interests and skill development.
* free webinars, meetups and conferences organized by svitla.
* fun corporate celebrations and activities.
* awesome team, friendly and supportive community!
about svitla
svitla systems is a global digital solutions company headquartered in california, with business and development offices throughout the us, latin america, europe, and asia. Svitla is an outspoken advocate of workplace flexibility, best known for its well-established remote culture, individual approach to our teammate’s professional and personal growth, and trustworthy environment.
since 2003, svitla has served a wide range of clients, from innovative start-ups in california to mega-large corporations such as ingenico, amplience, invoiceasap and global citizen. At svitla, developers work with clients’ teams directly, building lasting and successful partnerships, as a result of seamless integration with on-site processes.
svitla systems’ global mission is to build a business that contributes to the well-being of our partners, personnel and their families, improves our communities, and makes a lasting difference in the world. Join us!
if you are interested in our vacancy, please send your cv.
we will be happy to see you in our friendly team :)
tell us briefly about your project, and we will contact you within a day.
first name last name email country phone number linkedin profile link (optional) attach cover letter (optional) choose a file or drag and drop it here attach resume choose a file or drag and drop it here or
#j-18808-ljbffr