职位描述:
Summary:
We are looking for a Senior Software Engineer (MLOps) to help us ensure that machine learning models are not only successfully deployed but also maintained and monitored for optimal performance. You will lead all the processes to design, build and manage reproducible, testable, and evolvable ML-powered software.
Description:
This role requires a blend of skills in software engineering, machine learning, and operations to ensure the smooth functioning of ML systems in production environments. In this role you will:
– Lead the team to design and implement automation for model training, testing, validation, and deployment
– Collaborate with machine learning engineers to ensure efficient deployment and scaling of ML models
– Implement monitoring and alerting systems to track model performance, system health, and data drift
– Optimize compute resources for cost and performance efficiency
– Manage model versions to ensure traceability and reproducibility
职位要求:
Minimum Qualifications:
6+ years of experience in the design and implement of Large-scale ML Systems or Distributed Systems
Experience with model pipeline and registry tools, detecting and preventing model drift, automating model monitoring, and ensuring model accuracy
Proficiency in programming languages such as Python, Java, or Golang
Effective communication skills in written and spoken English
Bachelor or above in Software Engineering, Computer Science, Machine Learning, or a related field
Preferred Qualifications:
Experience in machine learning frameworks such as TensorFlow, PyTorch, AutoGluon, XGBoost or Scikit-learn
Experienced in DevOps Tools such as Docker, Jenkins, Ansible, Grafana, Prometheus, Elastic or Kubernetes
Familiar with CI/CD deployment practices
Experience with SQL and database systems such as PostgreSQL
Experience with building ETL pipeline in data warehouse such as Snowflake
Experience with inference optimization
招聘部门:
Apple Machine Learning and AI
工作地点:
Shanghai, Shanghai, China
面试建议:
Apple的Senior Software Engineer (MLOps)职位是一个结合了软件工程、机器学习和运维的高端技术岗位。这个职位不仅要求候选人具备扎实的编程能力,还需要对机器学习模型的整个生命周期有深入的理解和实践经验。特别值得注意的是,Apple对模型部署后的监控和优化有极高的要求,这意味着候选人需要展示出在实时系统中处理数据漂移、性能下降等问题的能力。 为了准备这个面试,建议你重点准备以下几个方面的内容:首先,确保你对大规模机器学习系统的设计和实施有清晰的理解,能够详细描述你过去项目中的具体贡献。其次,准备好展示你在模型部署和监控方面的经验,特别是如何利用工具如Prometheus或Grafana来跟踪模型性能。此外,面试官可能会询问你如何优化计算资源,因此要准备好讨论你在成本效益分析方面的经验。最后,不要忽视沟通能力的展示,Apple非常注重团队协作,因此清晰表达你的想法和解决问题的能力同样重要。
在线咨询
提示:由 AI 生成回答,可能存在错误,请注意甄别。