(#6334297003) Intern, Machine Learning Engineer - VLMs
What You’ll Learn
- Project Overview: The primary objective of this project is to develop novel vision-language models for text generation. The project aims to leverage the power of multimodal and large vision-language models to effectively combine visual and textual information, resulting in text documents that meet the legal and technical requirements.
- Skills You’ll Learn:
- Overall, this project will provide an excellent opportunity for the intern to gain practical experience in machine learning research, problem-solving, collaboration, and, all while working on a challenging and meaningful project.
- The intern will gain hands-on experience in developing machine learning models, specifically vision-language models. They will learn about various techniques in natural language processing, computer vision, and multimodal modeling, and how to apply them to solve real-world problems.
- The intern will work with large text and image datasets and learn how to preprocess and analyze them. They will also learn how to evaluate the performance of machine learning models using appropriate metrics and analyze the results to identify areas for improvement.
- The intern will be exposed to complex problems in the field of text generation and learn how to approach them systematically. They will need to think critically, analyze data, and experiment with different approaches to find the most effective solution.
- At the end of the project, the intern will need to prepare a comprehensive report detailing their research, findings, and recommendations. They will also need to present their work to their supervisor and other team members, which will help them develop skills in effective communication and presentation.
What You’ll Do
The Intellectual Property (IP) Group is seeking a highly-motivated and exceptional Research Intern to assist in developing innovative systems for natural language generation. The intern will collaborate with a team on research, experimentation, analysis, and implementation of large vision language models and multimodal models for text generation. The intern will contribute to designing novel algorithms and publishing research findings in leading machine learning conferences.
Location: Hybrid, working onsite at our San Jose office/headquarters 3 days per week with the flexibility to work remotely the remainder of your time
Reports to: Sr. Manager, Semi. Device Development
- Conduct scientific research to design and develop innovative machine learning algorithms in the field of natural language processing for language modeling and multimodal text generation.
- Collaborate with other researchers in developing machine learning algorithms, providing input and feedback to identify best possible solutions.
- Communicate with the domain experts to gain a deep understanding of the problem space, ensuring that the research is aligned with the needs of the industry and stakeholders.
- Utilize various tools and resources, including machine learning frameworks such as TensorFlow, PyTorch, and Keras, and models from the HuggingFace to fine-tune and train the models.
- Prepare presentations and technical reports to showcase the experimental results and findings, effectively communicating the research outcomes to both technical and non-technical audiences.
- Summarize and publish research findings in top-tier conference papers and/or patent submissions, contributing to the body of knowledge in the field.
- Demonstrate the ability to think creatively and work effectively both as part of a team and as an individual contributor, effectively managing time and resources to meet project deadlines.
- Show a strong willingness to learn new algorithms, application areas, and tools, continuously expanding their knowledge and skills in the field of machine learning.
- Complete other responsibilities as assigned, ensuring that the overall goals of the project are met and any additional tasks are carried out to the highest standard.
- This internship provides an excellent opportunity to gain practical experience in machine learning research, develop valuable skills, and make a meaningful impact in the field.
What You Bring
- Preferably pursuing a Ph.D. in machine learning/Ph.D. student in Computer Science with a strong background in machine learning, specifically deep learning, neural networks, and neural language models
- Must have at least one academic quarter or semester remaining
- Demonstrated strong research skills through a publication record
- Background in natural language processing, computer vision, and text generation algorithms
- Experience with machine learning frameworks such as TensorFlow, Keras, or PyTorch
- Strong background in machine learning, with a focus on deep learning and transformer networks
- Familiarity with natural language processing, text generation, and transformer-based models
- Knowledge of computer vision and multimodal models
- Strong analytical and problem-solving skills, with attention to detail
- Proficient programming skills in Python
- Excellent communication and teamwork abilities
- You’re inclusive, adapting your style to the situation and diverse global norms of our people
- An avid learner, you approach challenges with curiosity and resilience, seeking data to help build understanding
- You’re collaborative, building relationships, humbly offering support and openly welcoming approaches
- Innovative and creative, you proactively explore new ideas and adapt quickly to change
#LI-AD1