You are viewing a preview of this job. Log in or register to view more details about this job.

(#6334297003) Intern, Machine Learning Engineer - VLMs

What You’ll Learn

Project Overview: The primary objective of this project is to develop novel vision-language models for text generation. The project aims to leverage the power of multimodal and large vision-language models to effectively combine visual and textual information, resulting in text documents that meet the legal and technical requirements.
Skills You’ll Learn:
- Overall, this project will provide an excellent opportunity for the intern to gain practical experience in machine learning research, problem-solving, collaboration, and, all while working on a challenging and meaningful project.
- The intern will gain hands-on experience in developing machine learning models, specifically vision-language models. They will learn about various techniques in natural language processing, computer vision, and multimodal modeling, and how to apply them to solve real-world problems.
- The intern will work with large text and image datasets and learn how to preprocess and analyze them. They will also learn how to evaluate the performance of machine learning models using appropriate metrics and analyze the results to identify areas for improvement.
- The intern will be exposed to complex problems in the field of text generation and learn how to approach them systematically. They will need to think critically, analyze data, and experiment with different approaches to find the most effective solution.
- At the end of the project, the intern will need to prepare a comprehensive report detailing their research, findings, and recommendations. They will also need to present their work to their supervisor and other team members, which will help them develop skills in effective communication and presentation.

What You’ll Do

The Intellectual Property (IP) Group is seeking a highly-motivated and exceptional Research Intern to assist in developing innovative systems for natural language generation. The intern will collaborate with a team on research, experimentation, analysis, and implementation of large vision language models and multimodal models for text generation. The intern will contribute to designing novel algorithms and publishing research findings in leading machine learning conferences.

Location: Hybrid, working onsite at our San Jose office/headquarters 3 days per week with the flexibility to work remotely the remainder of your time

Reports to: Sr. Manager, Semi. Device Development

Conduct scientific research to design and develop innovative machine learning algorithms in the field of natural language processing for language modeling and multimodal text generation.
Collaborate with other researchers in developing machine learning algorithms, providing input and feedback to identify best possible solutions.
Communicate with the domain experts to gain a deep understanding of the problem space, ensuring that the research is aligned with the needs of the industry and stakeholders.
Utilize various tools and resources, including machine learning frameworks such as TensorFlow, PyTorch, and Keras, and models from the HuggingFace to fine-tune and train the models.
Prepare presentations and technical reports to showcase the experimental results and findings, effectively communicating the research outcomes to both technical and non-technical audiences.
Summarize and publish research findings in top-tier conference papers and/or patent submissions, contributing to the body of knowledge in the field.
Demonstrate the ability to think creatively and work effectively both as part of a team and as an individual contributor, effectively managing time and resources to meet project deadlines.
Show a strong willingness to learn new algorithms, application areas, and tools, continuously expanding their knowledge and skills in the field of machine learning.
Complete other responsibilities as assigned, ensuring that the overall goals of the project are met and any additional tasks are carried out to the highest standard.
This internship provides an excellent opportunity to gain practical experience in machine learning research, develop valuable skills, and make a meaningful impact in the field.

What You Bring

Preferably pursuing a Ph.D. in machine learning/Ph.D. student in Computer Science with a strong background in machine learning, specifically deep learning, neural networks, and neural language models
Must have at least one academic quarter or semester remaining
Demonstrated strong research skills through a publication record
Background in natural language processing, computer vision, and text generation algorithms
Experience with machine learning frameworks such as TensorFlow, Keras, or PyTorch
Strong background in machine learning, with a focus on deep learning and transformer networks
Familiarity with natural language processing, text generation, and transformer-based models
Knowledge of computer vision and multimodal models
Strong analytical and problem-solving skills, with attention to detail
Proficient programming skills in Python
Excellent communication and teamwork abilities
You’re inclusive, adapting your style to the situation and diverse global norms of our people
An avid learner, you approach challenges with curiosity and resilience, seeking data to help build understanding
You’re collaborative, building relationships, humbly offering support and openly welcoming approaches
Innovative and creative, you proactively explore new ideas and adapt quickly to change

#LI-AD1