➡️ Apply here: Staff Software Engineer, Infrastructure
🔔 Monitor #backend #devops #sre #architect jobs
👩💼 Want to stand out? Improve your resume to appeal to recruiters, hiring managers, and Applicant Tracking Systems.
➡️ Improve your resume
**Description**
Slack is your Digital HQ – a place where work flows between your people, systems, partners, and customers. From Fortune 100 companies to corner markets, millions of people around the world use Slack to connect their teams, unify their systems, and drive their business forward.Slack breaks down communication silos inside and beyond your organization by bringing teams and tools together around common goals, projects and processes in channels and in Slack Connect. It removes the limits of physical walls, giving people the flexibility to do their best work where, when and how they prefer with features like huddles and clips. And it empowers everyone to automate common tasks with apps and workflows. In this digital-first era, Slack’s mission is to make people’s work lives simpler, more pleasant, and more productive.A taste of our scale and reach:
* 77% of the fortune 100 use Slack
* 150+ countries have daily active users in Slack
* Slack delivers 300k+ messages per second
* To date, 1.79 trillion messages have been sent on Slack
* 2.65 Billion actions are taken in Slack each day
* Slack has 200k+ paid customers
**About The Team**
The Webapp Infrastructure (WIN) pillar provides the tools to make it possible for hundreds of developers to develop in a multi million line codebase with safety and productivity at the forefront. WIN handles maintenance and upgrades of the Hack programming language, static analysis tooling, widely used libraries in the codebase, as well as tuning and debugging the HHVM runtime and other services it depends on. With two teams: Runtime, Async Services, and Core Libraries (RASCL) and Webapp Infra Reliability Engineering (WIRE), the Webapp Infrastructure pillar supports the middle layers of the stack above our compute infrastructure and below the product code. **This role is open for the WIRE team.**
The WIRE team develops, runs and scales core components of Slack’s Webapp Infrastructure and Product. We own, maintain, and improve the systems that power Slack’s API servers, asynchronous job processing, caching, and rate limiting. We continuously seek to improve the visibility, speed, and safety of Slack’s distributed application architecture! Part of the team’s charter is also to drive high priority efforts for reliability, infrastructure upgrades, migrations, capacity planning, operational efficiency and simplification.
We know we’ve done our job correctly when *none of our users think about us.* In other words, Slack just works seamlessly!
On this team, you will combine your software and systems engineering expertise to run large-scale, distributed, fault-tolerant services. We welcome new perspectives and strategies to address evolving challenges to reliability. We collaborate with many Infrastructure and Product engineering teams at Slack to continuously improve shared technology and processes.
**What You Will Be Doing**
* Directly support multiple components of Webapp’s infrastructure, including monitoring and visibility automation, and other infrastructure tooling.
* Define and build solutions to improve the reliability and resilience of our services.
* Write code to automate maintenance and reduce the need for manual intervention.
* Help define SLA/SLOs for Webapp infrastructure, manage code deployments, fixes and software updates, and automate our operational processes.
* Have an operational responsibility in addition to being a software developer. You will participate in the team’s on-call rotation, assist with triaging and addressing production issues, and respond to incidents.
* Review code and get your code reviewed; mentor and be mentored by other engineers. Teamwork is what makes the dream work.
**What You Should Have**
* Familiarity and experience with software development, including traditional operations and/or infrastructure tooling.
* Experience managing critical production infrastructure, maintaining reliability and uptime, and having a customer first view of operational safety.
* Experience with functional or imperative programming languages such as Ruby and Go.
* Experience with Chef, Terraform, cloud infrastructure (ideally AWS), IAMs, Docker, Linux, and observability tools such as Logstash, Kibana, Prometheus, and Grafana.
* Strong collaboration skills: collaborating is core to how we operate and this excites you! To us, this means working with other teams on cross functional projects as well as day-to-day collaboration.
* Familiarity with operational metrics, experience with incident management and strong debugging skills.
* Bachelor’s degree in Computer Science, Engineering or related field, or equivalent training or work experience.
**Bonus Points**
* Experience as a Site Reliability Engineer (SRE), or as a platform or infrastructure engineer building and managing reliability mechanisms on distributed infrastructure.
* Comfortable with deploying, operating and debugging distributed systems on Linux at scale.
* Experience with AWS infrastructure at scale.
* Experience with HHVM, mcrouter, and memcached.
* Ability to dig deep across multiple layers of the stack, from networking and virtualization to configuration management and packaging.
* Experience working within highly regulated environments where an understanding of FEDRAMP/NIST frameworks were essential.
* Core Infrastructure is a diverse and inclusive team that treats their colleagues exceptionally well. We are happy to help you learn what you need to know; and we encourage and support each other’s growth and thus it’s not expected that you would have expertise across all of these areas. The team looks for people who are curious, inventive, and work to be a little better every single day. In our work together, we aim to be smart, humble, hardworking and, above all, collaborative. Come join us!
For roles in San Francisco and Los Angeles: Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records.