[Remote] Senior Platform Engineer
Note: The job is a remote job and is open to candidates in USA. Pondurance is a company dedicated to making the world a safer place through innovative security solutions. As a Senior Platform Engineer, you will enhance AI-driven backend systems and maintain core MDR data pipelines, combining platform engineering with backend development.
Responsibilities
- Own and evolve CI/CD pipelines using GitHub Actions, Terraform, and Nomad, including deployment of containerized services and AWS serverless components (e.g., Lambda)
- Develop and enhance our growing AI platform
- Own self-service deployment mechanisms for our platform frontend using GitHub Actions and Nomad
- Collaborate with Front-End and Operations teams to define and deliver the roadmap for the AI platform
- Manage configuration using Salt and HashiCorp tools
- Oversee Docker-based systems to ensure reliable deployment and operation of applications
- Maintain uptime and functionality of primary pipeline
- Implement monitoring, alerting, and dataflow solutions using Datadog to proactively detect and resolve issues
- Document processes, procedures, and troubleshooting steps to improve team efficiency and knowledge sharing
- Provide secondary support to client-facing support teams on client device issues
- Participate in on-call rotations, root cause analysis (RCA), and continuous improvement efforts
Skills
- Strong experience developing production systems in Python (services, APIs, or data pipelines)
- Experience working with CI/CD pipelines and infrastructure-as-code tools (e.g., GitHub Actions, Terraform)
- Ability to work effectively in Linux-based environments and troubleshoot production systems
- Familiarity with serverless architectures and AWS services such as Lambda (or equivalent)
- Strong communication and documentation skills, especially in cross-functional settings
- Ability to take ownership of work, prioritize effectively, and operate independently
- Experience working on production systems with a focus on reliability and minimizing client impact
- Demonstrated ability to work across both development (e.g., Python) and platform/infrastructure domains
- Own and evolve CI/CD pipelines using GitHub Actions, Terraform, and Nomad, including deployment of containerized services and AWS serverless components (e.g., Lambda)
- Develop and enhance our growing AI platform
- Own self-service deployment mechanisms for our platform frontend using GitHub Actions and Nomad
- Collaborate with Front-End and Operations teams to define and deliver the roadmap for the AI platform
- Manage configuration using Salt and HashiCorp tools
- Oversee Docker-based systems to ensure reliable deployment and operation of applications
- Maintain uptime and functionality of primary pipeline
- Implement monitoring, alerting, and dataflow solutions using Datadog to proactively detect and resolve issues
- Document processes, procedures, and troubleshooting steps to improve team efficiency and knowledge sharing
- Provide secondary support to client-facing support teams on client device issues
- Participate in on-call rotations, root cause analysis (RCA), and continuous improvement efforts
- Familiarity with AI-assisted and agentic development workflows to improve engineering velocity and code quality
- Experience with container orchestration (Nomad, Kubernetes, Docker Compose)
- Experience with monitoring and observability tools such as Datadog
- Familiarity with configuration management tools (e.g., Salt, HashiCorp stack)
- Experience working in environments with on-call responsibilities and production support
- Familiarity with tools like Jira, Slack, and Google Workspace
Benefits
- Medical, dental, vision, disability, FSA, HSA, life and AD&D insurance, 401(k) Plan.
- Time off: PTO, sick, holiday, & parental leave details are available.
- We provide competitive compensation packages based on the market and your overall credentials.
Company Overview