Are you excited with the amazing potential of foundation models, LLMs and multimodal LLMs?We are looking for someone who thrives on collaboration and wants to push the boundaries of what is possible today! The Video Computer Vision org is a centralized applied research and engineering organization responsible for developing real-time on-device Computer Vision and Machine Perception technologies across Apple products. We balance research and product to deliver Apple quality, state-of-the-art experiences, innovating through the full stack, and partnering with HW, SW and ML teams to influence the sensor and silicon roadmap that brings our vision to life.

Description

We are seeking a highly motivated and skilled Applied Research Engineer to join our team. The ideal candidate will have a strong background in developing and exploring multimodal large language models that integrate various types of data such as text, image, video, and audio. You will work on cutting-edge research projects to advance our AI and computer vision capabilities, contributing to both foundational research and practical applications• Conduct research and development on multimodal large language models, focusing on exploring and utilizing diverse data modalities• Design, implement, and evaluate algorithms and models to enhance the performance and capabilities of our AI systems• Collaborate with cross-functional teams, including researchers, data scientists, software engineers, to translate research into practical applications• Stay up-to-date with the latest advancements in AI, machine learning, and computer vision, and apply this knowledge to drive innovation within the company

Minimum Qualifications

  • Experience in developing, training/tuning multimodal LLMs
  • Programming skills in Python and C++
  • Bachelors Degree and a minimum of 3 years relevant industry experience.

Key Qualifications

Preferred Qualifications

  • Expertise in one or more of: computer vision, NLP, multimodal fusion, Generative AI.
  • Experience with at least one deep learning framework such as JAX, PyTorch, or similar.
  • Publication record in relevant venues.
  • PhD in Computer Science, Electrical Engineering, or a related field with a focus on AI, machine learning, or computer vision.

Education & Experience

Additional Requirements

Pay & Benefits

  • At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $143,100 and $264,200, and your base pay will depend on your skills, qualifications, experience, and location.

    Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

    Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

  • Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.