Serve as a primary point responsible for the overall health, performance, and capacity of one or more of our Internet-facing services.
Gain deep knowledge of our complex applications.
Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and constant growth.
Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment.
Work closely with development teams to ensure that platforms are designed with'operability' in mind.
Requirements / skills:
Experience trouble-shooting that span systems, network, and code.
Python experience, specifically for systems automation.
Strong interpersonal communication skills (including listening, speaking, and writing) and ability to work well in a diverse, team-focused environment with other SREs, Engineers, Product Managers, etc.
Knowledge of most of these: data structures, relational and non-relational databases, networking, filesystems, web architecture, and related topics.