Multimodal Generative AI Research Intern

Bosch Group

Multimodal Generative AI Research Intern

Pittsburgh, PA
Internship
Paid
  • Responsibilities

    Job Description

    At Bosch Research in Pittsburgh, we are seeking a forward-thinking research intern to explore innovative applications of generative AI for signal-based understanding and generation. This role offers the opportunity to develop cutting-edge AI models that transform raw signal data into meaningful outputs, enabling semantic understanding of complex environments and activities. The work will involve leveraging state-of-the-art techniques in computer vision (CV), natural language processing (NLP), and deep learning for multimodal data.

    The internship role will focus on the following responsibilities:

    • Design and implement generative AI models for signal-based data, combining advanced techniques in computer vision and natural language processing.
    • Evaluate the performance of developed models on diverse benchmarks and real-world datasets.
    • Document findings and contribute to high-impact publications in leading AI conferences and journals.
  • Qualifications

    Qualifications

    Required Qualification:

    • Currently enrolled as a PhD student in Computer Science, Electrical Engineering, Systems Engineering, or a related field.
    • Strong foundation in deep learning with expertise in generative AI, computer vision, and/or natural language processing.
    • Proven track record of research contributions, including publications in top AI conferences (e.g., ACL, EMNLP, NeurIPS, CVPR, ECCV, ICCV).
    • 2+ years of programming experience in Python, with proficiency in frameworks like PyTorch or TensorFlow.
    • Excellent communication and collaboration skills.

    Desired Qualification:

    • Experience in processing and interpreting signal-based data (e.g., time-series sensor data, Wi-Fi CSI, or other non-visual modalities).
    • Deep understanding of various generative model architectures, including diffusion models, flow-based models, and consistency models, along with their practical applications.
    • Proven experience in fine-tuning Vision-Language Models (VLMs) and Large Language Models (LLMs) for diverse use cases and domain-specific tasks.
    • Knowledge of multimodal learning techniques for combining diverse data sources.
    • Familiarity with applying generative AI models (e.g., diffusion models) to novel domains.
    • Demonstrated ability to innovate and drive impactful research in emerging AI fields.

    Additional Information

    By choice, we are committed to a diverse workforce - EOE/Protected Veteran/Disabled.

    BOSCH is a proud supporter of STEM (Science, Technology, Engineering & Mathematics)

    • FIRST Robotics (For Inspiration and Recognition of Science and Technology)
    • AWIM (A World In Motion)

    The U.S. base salary range for this intern position is $30.00-$58.00 per hour . Within the range, individual pay is determined based on several factors, including, but not limited to, type of degree, work experience and job knowledge, complexity of the role, type of position, job location, etc. Your Hiring Manager can share more details about the specific salary range for this position during the interview process.