You are viewing a preview of this job. Log in or register to view more details about this job.

AI, Machine Learning, and Data Scientist (Artificial Intelligence for Business Intelligence, AI4BI)

Overview of Role
Taiwan Semiconductor Manufacturing Company (TSMC) is seeking applications for skilled junior AI, Machine Learning, and Data Scientists for their Artificial Intelligence for Business Intelligence (AI4BI) Center. Applicants should have graduated with a Ph.D. in the past 1-2 years with a degree in computer science, information systems, information science, statistics, or related AI or machine learning field. The AI4BI Center is a global and international team that seeks to develop advanced AI-enabled analytics techniques to facilitate important business intelligence applications highly relevant to TSMC. This role will be based in our San Jose office in a hybrid working environment (working three days in the office). Individuals in this position will be responsible for designing and implementing significant machine learning, deep learning, text mining, Large Language Model (LLM), and/or network science-based approaches to extract insights from structured (e.g., transactional) and unstructured data sources (e.g., 10K reports, social media posts, news) for various BI applications. This role requires close interaction and coordination with international teams of senior data scientists, other junior data scientists, data engineers and business stakeholders to gather requirements, oversee development, evaluate the outputs of technical solutions, produce internal reports, and co-author publications at top-tier AI venues.

Responsibilities
• Create and manage advanced machine learning, deep learning, and LLM algorithms to develop data-driven solutions for complex business problems
• Analyze and interpret complicated data sets to provide actionable insights that aid in enhancing decision-making processes for business stakeholders
• Work with cross-functional and international teams to identify and prioritize data-driven opportunities, including the development and deployment of predictive models to drive business outcomes
• Work with teams with senior data scientists, other junior data scientists, and data engineers to develop and uphold efficient procedures for gathering, retaining, and scrutinizing data
• Work in teams with data engineers
• Consistently assess and refine model performance to ensure precision and dependability
• Communicate insights and recommendations via internal reports to technical and non-technical stakeholders
• Design and implement the latest AI, machine learning and data science tools and techniques to enhance existing processes based on feedback and guidance from senior data scientists

 

'Minimum Qualifications
• Ph.D. in computer science, information systems, information science, statistics, or related AI or machine learning field in the past 1-2 years
• 4-5 years of AI/ML experience (including those attained during Ph.D. studies)
• Proficient in Python, GitHub, and Markdown for data wrangling, pre-processing, and extraction from structured and unstructured data sources
• Strong conceptual and practical knowledge and experience of classical machine learning algorithms and learning paradigms, including supervised learning and unsupervised learning
• Strong skills in fundamental deep learning implementation, including data encodings, processing units, and learning paradigms
• Proficiency in adapting machine learning or deep learning algorithms based on unique dataset characteristics and business requirements
• Strong practical network science skills and experience
• Hands-on experience with text analytics for named entity recognition, sentiment analysis, and topic modelling.
• Hands-on experience with time series modeling and forecasting with statistical and/or deep learning-based approaches
• Experience fine-tuning LLM-based approaches with strategies such as low rank adaptation, few shot learning, and others.
• Experience with prompt engineering on LLMs with techniques such as reinforcement learning, prompt tuning, and others
• Knowledge about deploying models into user interfaces using technologies such as Streamlit, Dash, Tableau, or PowerBI
• Must be willing to travel to Taiwan for at least 3 months each year for training, team building, and project coordination

Preferred Qualifications
• Knowledge on processing text in financial, accounting, and market analysis reports
• Multi-lingual text analysis using packages like Stanza, Polyglot, or Textflint.
• Knowledge of graph embedding techniques such as graph convolutional networks and graph attention networks using packages such as stellargraph, PyG, or Deep Graph Library.
• Understanding of neural information retrieval approaches, including deep structured semantic models, entity resolution techniques, and retrieval augmented generation (RAG).
• Familiarity and practical experience with learning paradigms such as adversarial learning, knowledge distillation, and self-supervised learning.
• Publications in leading AI conferences (e.g., NeurIPS, ICLR, ICML, KDD) and/or journals (e.g., IEEE TKDE, ACM TOIS)