Job Description
Our Company
Changing the world through digital experiences is what Adobe’s all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences! We’re passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact with customers across every screen.
We’re on a mission to hire the very best and are committed to creating exceptional employee experiences where everyone is respected and has access to equal opportunity. We realize that new ideas can come from everywhere in the organization, and we know the next big idea could be yours!
The Opportunity
You will be a senior member of the Site Reliability Engineering team of the Digital Experience. We are looking for a Senior Site Reliability Engineer with a strong systems, networking, and scripting background who is passionate about developing software to help scale monitoring, alerting, provisioning and configuration management. We are a multi-cloud environment (Azure/AWS), are security-focused and are helping customers succeed. This individual should be self-motivated and have a drive for quality.
What you'll Do
- Engage with product and engineering to drive and improve the whole lifecycle of operational readiness - from inception and design, through deployment, operation and refinement proactively.
- Assist with the planning and implementation of migrating our applications from our private cloud to a public cloud.
- Write software layers, scripts, deployment frameworks, tracers, monitors, self-healing/auto remediation tools and automate the processes.
- Build and maintain software modules for use and re-use in cloud and on-premise systems automation.
- Maintain business continuity by identifying and driving opportunities to make systems highly resilient and human-free.
- Assist our software engineering team to ensure accurate monitoring and metrics are being built into applications before going to production.
- Maintain up-to-date documentation on deployments, processes, and standard operating procedures/run-books.
What you need to succeed
- Bachelor’s Degree in Computer Science or equivalent and 5 years of relevant work experience
- Advanced Experience with Linux, Internet Protocols, and Large-Scale Operations
- Programming (Python and Bash are our preferred scripting/shell languages) and automation skills.
- Troubleshooting and system engineering exposure in Linux production environments. Experience with Linux, Internet Protocols, and Large-Scale Operations.
- Experience with AWS and/or Azure stack – particularly in the areas of networking (VPCs, security groups), VMs (EC2), databases (RDS), load balancing (ELB, ALB)
- Excellent information management practices, such as thorough documentation, usage of wikis, and other collaboration tools
- Ability to scope project work, estimate effort and then break down work into sub-tasks
- Excellent written and verbal communication skills, demonstrating the ability to effectively convey technical information to both technical and non-technical audiences
Bonus skills
- Experience with relational databases such as MySQL, Postgres, and document stores such as MongoDB.
- Experience deploying applications in containers using Docker and Kubernetes.
- Strong intuition about system design, robustness, and scalability.
Job ID: 34409