Site Reliability Engineer ›
Filevine
Software Engineering
United States
Posted on Apr 5, 2025
Qualifications
- Curiosity, a willingness to learn, a passion to continually improve, and unbridled enthusiasm to make things better everyday without the need to be directed to do so
- Proficiency in all of the skills expected of our SRE II's
- A bachelors degree in computer science, information systems, a related field; comparable certifications; or equivalent direct work experience
- A minimum of 8 years of experience in hands on technical roles
- A minimum of 2 years of Site Reliability Engineering experience
- Experience building autonomous systems that manage software operational details without human intervention
Preferred Qualifications
- M.S. in computer science, information systems, a related field; comparable certifications; or equivalent direct work experience
- 2-6 years of Site Reliability Engineering Experience
- Experience developing, deploying, and maintaining internet scale applications
- Experience incorporating Artificial Intelligence or Machine Learning into internet scale applications
Responsibilities
- Provide leadership, mentoring, and excellent judgement by being responsible for:
- Developing autonomous systems that manage the details necessary to build, deploy, test, and operate all Filevine Inc. products
- Being the voice of Reliability on your team throughout the SDLC
- Collecting, monitoring, aggregating, dashboarding, and alerting on software and server events
- Improving the CI/CD pipeline
- Developing playbooks, tools, and scripts to streamline processes and shorten problem resolution time
- Identifying and fixing gaps in the availability of systems
- Improving and defending the security of software and systems
- Documenting and diagramming processes, procedures, and best practices
- Finding, learning, improving, or creating new tools that are reliable, usable, and helpful to enable other engineers to perform their work more efficiently
- Work within assigned team to complete duties as assigned, while mentoring, training, and reviewing more junior engineers.
- Work either individually or in conjunction with other engineers to complete assignments
- Be part of an on-call rotation with other team members to provide 24/7/365 production reliability support
- Be part of an on-call rotation with other team members to provide escalated emergency support for the services your team owns
- Communicate frequently, clearly, and effectively with various technical and management audiences