Data Engineer

Manifest Solutions

Data Engineer

Columbus, OH
Full Time
Paid
  • Responsibilities

    Manifest Solutions is currently seeking a Python Data Engineer for a hybrid position in Columbus, OH.

    • Data Engineer is responsible for developing Life Sciences content curation and delivery system for the purpose of building life sciences databases that empower proprietary life sciences search technologies. This role encompasses developing and deploying scientific software solutions in the life sciences information space to support transformational initiatives, delivering both short and long-term results to the business.

    • Develops data transformation and integration pipelines and infrastructure foundations of life sciences content in support of scientific databases and data curation.

    • Combines strong software development and data engineering skills with a working knowledge of basic biology/chemistry/physics to develop sophisticated informatics solutions that drive efficiencies in content curation and workflow process.

    • Applies data transformation and other data-engineering software development capabilities to contribute to the building of new scientific information management systems supporting scientific database building activities.

    Education/Experience

    • 4-year degree in computer science, engineering, informatics, or equivalent experience

    • Minimum of 4 years of software development experience

    Competencies/Technologies

    • Proficiency in Python

    • Proficiency in other programming languages such as JavaScript/TypeScript/Java

    • Proficiency in Linux/Unix environments

    • Experience building applications for public cloud environments (AWS preferred)

    • Experiencewith databases technologies (NoSQL, relational, property graph, RDF/triple store)

    • Experience with data engineering tools and techniques is highly desired

    • Experience with AWS DevOps tools (git, Cloud Development Kit, CDK Pipeline) is highly desired

    • Experience building applications using AWS Serverless technologies such as Lambda, SQS, Fargate, S3 is highly desired

    • Experience working with XML and XPath is highly desired

    • Experience with MarkLogic/Xquery is a plus

    • Experience with Apache Airflow is a plus

    • Experience building containerized applications (Docker, Kubernetes) is a plus

    • Strong communication, organizational savvy, interpersonal skills

    • Self-motivated with the ability to work with minimal supervision

    • Innovates and continuously improves; focuses on areas of highest potential