*this position's location is only in spgg, nuevo leon, mexico, on a hybrid wfh scheme (3 days at home and 2 at the office).
*it is not a remote role*
we are seeking an azure cosmos dba sql to join our team.
*responsible to*:
- analyse impact of technology choices and be able to communicate and influence at an enterprise level.
- persist to completion, especially in the face of overwhelming odds and setbacks.
- push self for results; push others for results through team spirit
- provide azure cosmos thought leadership across the organization.
- ensure understanding of issues and presenting clear rationale.
- able to speak to mutual needs and win-win solutions.
- use two-way communication to influence outcomes and ongoing results.
*required*:
- at least 5 years of experience in development of data solutions using azure cloud platform.
- solid understand of spark architecture and experience with performance tuning big data workloads in spark.
- building complex data transformations on both structure and semi-structured data (xml/json) using pyspark & sql, refactoring tradition ml model to run on spark framework.
- familiarity with azure databricks environment and deploying spark code in data bricks cluster.
- experience building complex orchestrations using tools like adf, airflow etc.
- good understanding massively parallel processing (mpp) systems with strong sql skills
- proficient with source control using git.
- good understanding of agile, devops and ci-cd automated deployment (e.g., azure devops, jenkins)
- familiarity with cognitive search/elastic search and its use cases & building integrations to load data to search services.
- azure data certification of dp-200/201/203 or databricks developer certification will be an advantage.
*good understanding of*:
- no sql and its use case
- modelling no sql schemas & containers
- building integration to read/write to azure cosmos db sql api
- writing function/triggers/procedures in cosmos sql api
*solid programming skills in python and proficiency using libraries like*:
- panda
- numpy
- scipy
- matplotlib
- scrappy
- beautiful soup