Principal Site Reliability Engineer (Cortex Cloud Security Posture Management)

Palo Alto Networks

Principal Site Reliability Engineer (Cortex Cloud Security Posture Management)

Santa Clara, CA
Full Time
Paid
  • Responsibilities

    Job Description

    This role requires a US Citizen or Green Card holder.

    Your Career

    The Cortex team builds and delivers the industry’s most advanced SecOps platform, consisting of XSIAM, XSOAR, and XPANSE. As a member of the Cortex DevOps team, your role involves operating and maintaining a large-scale GCP environment, including the design, implementation, and continuous enhancement of our comprehensive observability systems. To meet the opportunities that such a role provides, you will have a deep knowledge of modern observability and monitoring tools and practices, having managed high cardinality metrics, implemented tracing, and operationalized large scale logging solutions. As part of this role, you will collaborate closely with our engineering teams to develop innovative solutions that provide clear and actionable insights into our systems’ performance and health.

    Your Impact

    As a Principal SRE with the Cortex Cloud Security Posture Management team, you will:

    • Cloud Expertise - Utilize your expertise in monitoring cloud platforms, particularly GCP, to optimize our infrastructure leveraging cloud-native technologies
    • Incident Management - Leverage incident management processes to ensure efficient resolution of system issues and minimal impact on services
    • Automation - Automate complex monitoring and alerting tasks by building tools for cloud operations, such as automated remediation of known issues and auto-scaling
    • CI/CD - Develop and maintain application deployment tools such as Terraform and Helm
    • Continuously Improve - Stay up-to-date with cutting-edge technologies, evaluate their potential impact on our operations, and implement them when appropriate
    • On-Call - Participate with our DevOps team to provide follow-the-sun operational coverage in the production of our SaaS product
    • Collaborate - Work with our Engineering team to influence the operability of the product and ensure the reliability and availability of our services
  • Qualifications

    Qualifications

    Your Experience

    • Incident and Alerts Management - Clear understanding of incident and alerts management in Site Reliability Engineering
    • DevOps/SRE Expertise - 5+ years of experience as a DevOps/SRE engineer with a passion for technology and a strong motivation for high reliability at the service level
    • Cloud Proficiency - High proficiency in either Google Cloud Platform or Amazon Web Services
    • Kubernetes and Docker - High proficiency with Kubernetes and Docker for container orchestration
    • Scripting and Automation - High proficiency in Python programming and Linux Shell commands - Experience with Terraform for infrastructure as code
    • Security - Strong grasp of security concepts and best practices
    • Observability - Experience with observability and incident response tools
    • Communication Skills - Effective communication and interpersonal skills, with the ability to work and coordinate between multiple teams
    • Troubleshooting - Ability to effectively troubleshoot and address emerging and complex problems
    • Independence - Ability to operate independently, make decisions, take action, and take responsibility

    Additional Information

    The Team

    Our engineering team is at the core of our products – connected directly to the mission of preventing cyberattacks. We are constantly innovating – challenging the way we, and the industry, think about cybersecurity. Our engineers don’t shy away from building products to solve problems no one has pursued before.

    We define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of a challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.and downtime.

    Compensation Disclosure

    The compensation offered for this position will depend on qualifications, experience, and work location. For candidates who receive an offer at the posted level, the starting base salary (for non-sales roles) or base salary + commission target (for sales/commissioned roles) is expected to be between $147000 - $225500/YR. The offered compensation may also include restricted stock units and a bonus. A description of our employee benefits may be found here.

    #LI-TD1

    Our Commitment

    We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

    We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

    Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

    All your information will be kept confidential according to EEO guidelines.

    Is role eligible for Immigration Sponsorship? No. Please note that we will not sponsor applicants for work visas for this position.