Sr. DevOps Engineer
Eclypsium
Software Engineering
Portland, OR, USA
Senior DevOps Engineer
We are looking for an experienced Senior DevOps Engineer to join our team. The ideal candidate will be able to work in a fast paced environment, operate gracefully under stress, effectively manage multiple assignments, be self driven, proactive and have great interpersonal and communication skills.
As a Senior DevOps Engineer, you will play a key role in designing, building, and maintaining the infrastructure and processes that empower our development teams to deliver high-quality software quickly and reliably. You will be responsible for implementing and optimizing CI/CD pipelines, managing cloud-based infrastructure, and championing DevOps best practices throughout the organization. This role requires a strong technical background in DevOps practices, cloud technologies, automation tools, and the ability to mentor and guide other team members.
Role & Responsibilities
-
CI/CD Pipeline Optimization: Design, implement, and continuously improve CI/CD pipelines to streamline the software delivery process, ensuring rapid and reliable deployments.
-
Infrastructure Management: Manage and maintain our cloud-based infrastructure on Google Cloud Platform (GCP), ensuring high availability, performance, security, and cost-effectiveness.
-
Automation Expertise: Automate repetitive tasks and processes to improve efficiency, reduce manual errors, and ensure consistency across environments.
-
Monitoring and Observability: Implement and maintain comprehensive monitoring and alerting systems to proactively identify and resolve issues, ensuring the health and performance of our systems.
-
Mentorship and Collaboration: Share your expertise and mentor other team members on DevOps best practices, tools, and techniques. Collaborate with development, QA, and SRE teams to troubleshoot and resolve issues, fostering a culture of collaboration and continuous learning.
-
Security and Compliance: Ensure our infrastructure and processes adhere to industry best practices and security standards, protecting our systems and data from potential threats.
-
Incident Response: Participate in incident response and post-incident review processes to minimize downtime, identify root causes, and implement corrective actions.
Minimum qualifications
Experience:
-
5+ years of experience in DevOps or a related field.
-
Proven track record of designing, building, and maintaining CI/CD pipelines, infrastructure as code, and cloud-based infrastructure.
-
Deep understanding of Google Cloud Platform (GCP) or other major cloud providers.
-
Hands-on experience with containerization (e.g., Docker) and orchestration (e.g., Kubernetes).
Skills:
-
Strong programming or scripting skills in Python, Bash, or other relevant languages.
-
Expertise in configuration management tools (e.g., Ansible, Chef, Puppet).
-
Proficiency in using version control systems (e.g., Git).
-
Solid understanding of networking and security concepts.
-
Excellent problem-solving and troubleshooting skills.
-
Strong communication and collaboration skills, with the ability to mentor and guide others.
Education:
-
Bachelor's degree in Computer Science, Engineering, or a related field.
Bonus Points:
-
Experience with cybersecurity tools and technologies.
-
Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana).
-
Knowledge of SRE principles and practices.
-
Contributions to open source projects related to DevOps.
-
Certifications in relevant cloud technologies or DevOps practices.
Required Technical Skills
Cloud Infrastructure
-
Expert: Google Cloud Platform (GCP)
-
Compute (Compute Engine, Kubernetes Engine, Cloud Functions, App Engine)
-
Storage (Cloud Storage, Persistent Disk, Filestore, BigTable)
-
Networking (VPC, Load Balancing, Cloud DNS, Cloud CDN, Cloud Interconnect)
-
Databases (Cloud SQL, Cloud Spanner, Firestore)
-
Security (Identity and Access Management, Cloud Armor, Security Command Center, Secret Manager)
-
Monitoring (Cloud Monitoring, Cloud Logging, Cloud Trace)
-
Advanced:
-
Experience with infrastructure optimization and cost management on GCP
-
Knowledge of GCP best practices and architectural patterns
-
Bonus: Experience with other cloud providers (AWS, Azure) or hybrid cloud environments.
Infrastructure as Code (IaC)
-
Expert: Terraform
-
Proficient: Ansible, or similar configuration management tools (Chef, Puppet)
-
Bonus: Experience with other IaC tools (e.g., CloudFormation, Pulumi), or policy-as-code frameworks (e.g., Open Policy Agent)
CI/CD
-
Expert: Jenkins, CircleCI, GitLab CI/CD, or similar tools
-
Advanced: Experience designing and implementing complex CI/CD pipelines with multiple stages, environments, and deployment strategies
-
Bonus: Experience with Tekton, Argo CD, or other Kubernetes-native CI/CD solutions
Containerization and Orchestration
-
Expert: Docker, Kubernetes
-
Advanced: Experience with Kubernetes networking, storage, security, and cluster management
-
Bonus: Experience with Helm (Kubernetes package manager), Istio (service mesh), or Knative (serverless platform)
Programming and Scripting
-
Proficient: Python, Bash, or other scripting languages (e.g., Ruby, Perl)
-
Bonus: Experience with Go (Golang) or other compiled languages (e.g., Java, C++)
Monitoring and Observability
-
Proficient: Prometheus, Grafana, ELK Stack (Elasticsearch, Logstash, Kibana), or similar monitoring and logging tools
-
Advanced: Experience designing and implementing monitoring and alerting strategies for distributed systems
-
Bonus: Experience with distributed tracing (e.g., Jaeger, Zipkin) or other observability tools (e.g., OpenTelemetry)
Security
-
Proficient: Security best practices for cloud infrastructure, container security, network security, access control, secrets management
-
Advanced: Experience implementing and maintaining security policies, conducting security audits and reviews
-
Bonus: Experience with security scanning tools (e.g., Trivy, Clair), penetration testing, or security certifications (e.g., CISSP)
Additional Skills (Highly Desirable)
-
Experience with Temporal.io
-
Knowledge of chaos engineering principles and practices
-
Familiarity with GitOps workflows
-
Experience with serverless technologies (e.g., Cloud Functions, AWS Lambda)
-
Understanding of cost optimization techniques for cloud infrastructure
About Eclypsium
Eclypsium is a supply chain security platform that builds trust in every device by identifying, verifying and fortifying software, firmware and hardware throughout enterprise infrastructure. Eclypsium’s SaaS platform does this by integrating the bill of materials from suppliers and continuously monitoring to independently assess risk of each critical asset from chip to cloud, throughout the life cycle, and across enterprise ecosystems. Protecting Fortune 100 enterprises and federal agencies, Eclypsium has been named a Gartner Cool Vendor in Security Operations and Threat Intelligence. A TAG Cyber Distinguished Vendor, one of the World’s 10 Most Innovative Security Companies by Fast Company, a CNBC Upstart 100, a CB Insights Cyber Defender, and an RSAC Innovation Sandbox finalist. For more information, visit eclypsium.com.
Benefits
Eclypsium headquarters are located in Portland, OR with distributed remote employees and global teams in Argentina and Singapore. We offer competitive compensation and benefits packages and are committed to the well-being of our employees and their families.
Benefits & Perks include:
-
Competitive compensation & startup equity
-
Comprehensive medical, dental, and vision coverage
-
Life insurance, short-term, and long-term disability coverage
-
Flexible time off
-
Employee assistance program
- Employer sponsored 401k plan
-
Paid parental leave
-
Paid sabbatical
-
Home office support for remote employees
-
Regular events and celebrations
Equal Opportunity
Eclypsium is an equal opportunity employer. We believe in the importance of diverse teams and value candidates of all backgrounds. We do not discriminate on the basis of age, ancestry, citizenship, color, ethnicity, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or invisible disability status, political affiliation, veteran status, race, religion, or sexual orientation.