Job Summary
As the largest online distributor of restaurant supplies and equipment, WebstaurantStore hosts an expansive catalogue with over 430,000 products that are delivered through fast, dependable shipping. Unlike most in the e-commerce arena, almost all of our technological design, development, and systems management is done in-house, allowing us to create custom solutions within an ever-changing market.
This consistent, organic growth is why we have a need for a Mid-Level or Senior Site Reliability Engineer with a focus in on-prem Kubernetes/K8s looking to further their career. Successful SREs at our company have typically started their careers as developers or systems engineers who sought a wider variety of work.
Remote Work Qualifications
- Access to a reliable and secure high-speed internet connection. Cable or fiber internet connections (at least 75mbps download/10mbps upload) are preferred, as satellite connections often cannot support the technologies used to perform day-to-day tasks.
- Access to a home router and modem.
- A dedicated home office space that is noise- and distraction-free. The space should have strong wireless connection or a wired Ethernet connection (wired connection is preferred, if possible).
- A valid, physical address (apartment, suite, etc.). PO Boxes are not supported, as a physical address is required for you to receive your computer equipment.
- The desire and ability to work and communicate with other team members via chat, webcam, etc.
- Legal residents of one of the following states: (AK, AL, AR, AZ, CT, DE, FL, GA, IA, ID, IN, KS, KY, LA, MD, ME, MI, MN, MO, MS, NC, ND, NH, NM, NV, OH, OK, PA, SC, SD, TN, TX, UT, VA, VT, WI, WV, and WY).
We only accept W-2 candidates, H-1B sponsorship is not available.
Responsibilities
- Manage and maintain on-premise containerized environments.
- Deploy resources through automated continuous delivery workflows.
- Standardize and oversee service deployments.
- Troubleshoot and resolve issues with running services and infrastructure components.
- Securely handle and rotate sensitive information.
- Manage and optimize persistent storage solutions.
- Configure and maintain network entry points for applications.
- Enhance service-to-service communication and resilience.
- Monitor systems and build effective alerts and dashboards.
- Develop scripts and tools to streamline operations and incident response.
- Respond to production incidents and coordinate follow-up actions.
- Maintain version control practices for infrastructure and application changes.
- Participating in on-call rotation. (The effort we put into reliability keeps the on-call volume low).
Physical Requirements
- Work is performed while sitting/standing and interfacing with a personal computer.
- Requires the ability to communicate effectively using speech, vision, and hearing.
- Requires the regular use of hands for simple grasping and fine manipulations.
- Requires occasional bending, squatting, crawling, climbing, and reaching.
- Requires the ability to occasionally lift, carry, push, or pull medium weights, up to 50lbs.
Qualifications
Experience
3+ years in a professional systems engineering, related development, or SRE role with experience in:
- Kubernetes, Helm Charts, Kustomize, etc. (Required)
- CI/CD platforms (Argo CD, GitLab CI/CD, Flux)
- Troubleshooting pods, nodes, and deployments.
- Secrets management (Sealed Secrets, HashiCorp Vault)
- Persistent storage (Rook/Ceph, NFS)
- Ingress controllers (HAProxy, NGINX, Traefik)
- Service mesh (Istio, Consul)
- Configuration management (Ansible, Terraform)
- Observability tools (OpenTelemetry/OTEL)
- Programming/scripting (Python, Golang)
- Production incident response in a Linux environment.
- Version control (Git)
Note: If your expertise primarily lies in dedicated DevOps (without a focus on system reliability) or Cloud Engineering, we encourage you to explore other opportunities within our organization that may be a better fit.
Education
This role does not require a degree. We value relevant skills and experience and alignment with our core values above all else.
Desired Traits & Skills
Hungry:
- Passionate for their work.
- Goes above and beyond to get the job done.
- Looks for opportunities in all aspects of the company.
- Process improvement focus.
Humble:
- Easily admits to mistakes.
- Acknowledges his/her weaknesses.
- Displays a grounded confidence but not arrogance.
Smart:
- Shows empathy.
- Attentive listener.
- Adjusts his/her own behavior to fit the nature of a conversation.
- Understands what others are thinking.
One Part of the Bigger Picture
WebstaurantStore’s parent company, Clark Associates, has made the Central Penn Business Journal’s list of “Top 50 Fastest Growing Companies” in Pennsylvania for 9 years in a row. The base of Clark’s success comes from four key directives: Hiring great people, creating value for customers, and investing in employees and their communities. These pillars drive each of Clark Associates’ multi-million-dollar businesses forward, including WebstaurantStore and other industry-leading names like 11400, Clark Food Service Equipment, The Restaurant Store, and Clark National Accounts.
Are You Ready?
Entrepreneurial Spirit is the driving force behind WebstaurantStore’s work environment. Making things better for our customers is our goal every single day. Achieving that goal means taking risks, accepting failure, and learning from our mistakes. If that sounds like a mission you’re ready to be a part of, we’d love to discuss this role with you further, and we’re excited to meet you!
Never heard of us? That’s okay! We love sharing our stories.
Check us out on: