VLM-based Scene Understanding Research – Intern

Bosch Group

VLM-based Scene Understanding Research – Intern

Sunnyvale, CA
Internship
Paid
  • Responsibilities

    Job Description

    • Build up an advanced cloud-based system for sparse semantic scene understanding using VLMs.
    • Architect a cloud-retrieval pipeline consisting of scene representation storage, localization and data retrieval, as well VLM-based map creation.
    • Implement the system using off-the-shelf building blocks for localization, mapping and communication, extending them where needed.
    • Summarize research findings in high-quality paper and/or patent submissions.
  • Qualifications

    Qualifications

    Basic Qualifications

    • Ph.D. student or highly qualified M.S. student in Computer Science, Machine Learning, Robotics, or related fields (Must be a current student or recent graduate – less than 1 year)
    • Hands-on experience in setting up and running computer vision technologies, such as YOLO, SAM, etc.
    • Hands-on experience in setting up and running VLMs or LLMs
    • Experience with localization and mapping algorithms, such as SLAM or place recognition
    • Knowledgeable on the state-of-the-art in VLM/LLM ideas and software
    • Solid C++ and Python programming skills
    • Hands-on experience working with robotic middlewares such as ROS2 or FogROS2

    Preferred Qualifications

    • Publication record in top venues (ICRA, IROS, RSS, CoRL, NeurIPs etc.)
    • Experience in embedded systems or distributed systems is a plus
    • Experience in cloud computation and Cloud computing platforms (i.e. AWS, Azure, Google Cloud) is a plus
    • Able to work independently, has strong research and problem-solving skills
    • Good communication and teamwork skills

    Additional Information

    By choice, we are committed to a diverse workforce - EOE/Protected Veteran/Disabled.

    BOSCH is a proud supporter of STEM (Science, Technology, Engineering & Mathematics)

    • FIRST Robotics (For Inspiration and Recognition of Science and Technology)
    • AWIM (A World In Motion)

    The U.S. base salary range for this intern position is $41.00-$68.00 hourly. Within the range, individual pay is determined based on several factors, including, but not limited to, type of degree, work experience and job knowledge, complexity of the role, type of position, job location, etc. Your Hiring Manager can share more details about the specific salary range for this position during the interview process.

    For more information on our culture and benefits, please visit:

    Culture and Benefits | Bosch in the USA