职位描述:
Summary:
The AIML Production Engineering China team is looking for an exceptional Infrastructure Systems Engineer with experience in Machine Learning (ML) infrastructure services and applications to work in local and global projects for platforms of computation, data retention, data processing pipelines and result delivery. The Systems Engineer is expected to be qualified as a technical leader, with the potential to design and to build architecture through cross-organizational collaboration. This role has a high impact and is essential to creating the highest quality user experience that Apple internal and external customers expect and love.
Description:
The Infrastructure Systems Engineer will do the following tasks, through collaboration with team members in China and around the world.
– Analyze the requirements, demands, constraints and challenges of machine learning in local or global environments, design or re-design platform architecture to improve its scalability and agility, and to enable new, high-impact use cases
– Develop and implement the above design, bringing it to an internal product, with observability to support efficient system management
– Design and/or enhance automation of operations for infrastructure and platforms, including tools and processes of monitoring, logging and alerting, to improve scalability in both system construction and runtime operations
– Support Dev and Eng efforts through provisioning operational solutions, co-design ML application architecture and drive the coordination among local and global, internal and cross-functional groups to achieve the result of success
– Create performance profile for platforms and services, defining service level objectives (SLO) and driving the measurement, monitoring and evaluation over these objectives
– Lead constant evaluation on system performance and reliability, discover potential faults, drive RCA and fixes
职位要求:
Minimum Qualifications:
Master or PhD degree in Computer Science, Electrical Engineering or equivalent
5+ years of Systems or AIML production-service experience, commensurate with running cutting-edge hybrid cloud services in China and the rest of the world
Solid understanding of system architecture and large-scale service or computational platform operations
Demonstrated understanding of system management, covering aspects of configuration and usage accounting
Proficiency in coding with scripting and programming languages, including Bash, Python, Golang and Java – while having the ability to select the proper language as a tool to solve a certain problem
Experience in large-scale service and job deployment, using an orchestration framework (Kubernetes) and cloud services for large-scale projects
Experience in observability of system behaviors (e.g. Prometheus, Grafana)
Preferred Qualifications:
Self-motivated and proactive, with demonstrated creative and critical thinking capabilities
Ability to identify problems in depth, distinguishing purposes vs. measures without confusion
Strong sense of thoroughness, driving details, delivering running code, and contributing to the collective understanding of the organization
Sense of speed and prioritization, driving what matters with constrained resources while delivering high-quality results
Good communication with internal and external teams, in English and in Chinese
Demonstrated understanding of computing, storage, and networking in public cloud infrastructures, including provisioning, setup, monitoring, security operations, performance tuning, and troubleshooting.
Knowledge of ML as well as experience in developing real ML jobs
Experience of designing and implementing systems to support ML applications
Experience in device management and mobile app development
Knowledge of data governance and compliance
Apple is an Equal Opportunity Employer that is committed to inclusion and diversity. We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, women, protected veterans, and individuals with disabilities. Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants.
招聘部门:
Apple Machine Learning and AI
工作地点:
Shanghai, Shanghai, China
面试建议:
Apple的AIML基础设施系统工程师职位是一个技术领导角色,需要你在机器学习基础设施领域具备深厚的专业知识和实践经验。这个职位最特别的地方在于它要求你不仅能设计架构,还要能将其转化为内部产品,同时具备全球视野和本地落地的能力。面试官会特别关注你在混合云环境中的实战经验,以及你如何解决大规模机器学习平台的可扩展性和可靠性问题。 为了准备这个面试,你需要重点准备几个方面。首先,确保你能清晰阐述你在机器学习基础设施项目中的具体贡献,特别是那些涉及架构设计和性能优化的案例。其次,准备好展示你对Kubernetes等编排工具的深入理解,以及你如何使用可观测性工具来监控系统行为。第三,由于这个职位需要频繁的跨团队协作,你要准备好讨论你过去如何协调不同团队达成技术目标的经验。最后,别忘了准备一些关于数据治理和合规性的见解,这在跨国企业的机器学习项目中尤为重要。面试中可能会遇到一些情景题,测试你在资源受限情况下如何权衡取舍,因此提前思考一些相关案例会很有帮助。
在线咨询
提示:由 AI 生成回答,可能存在错误,请注意甄别。