*who we are*
a financial technology company dedicated to supporting the small and medium-sized companies in mexico, developing and offering solutions to solve their main problems, and seeking to be the best ally of entrepreneurs with dreams and ambitions to create value, consolidate their well-being and contribute to the community, the country and the planet.
*responsibilities*
- collaborate with data engineers to develop automated orchestration of data pipelines in order to provide the proper data to the data scientists in the respective stage (training and production)
- collaborate with data scientists to develop automated orchestration of model pipelines to solve konfío business use case objectives.
- deploy fully containerized docker/kubernetes data processing, and machine learning model pipelines into aws and google cloud.
- document detailed designs and code for data quality frameworks that can measure and maintain data completeness, data integrity and data validity between interfacing systems.
- ensure all solutions comply with the highest levels of security, privacy, and data governance requirements set by the respective teams.
- train and mentor junior team members (engineers and data scientists).
- familiar with automated machine learning (automl) concepts would be an asset.
*experience required*
- 3+ years of production programming experience in spark and python.
- production systems integration experience.