Overview
We are building our TradeOps team from scratch and are looking for a hands-on SRE Lead. You will start as an individual contributor and gradually grow into a leadership role, shaping the team and ensuring high availability of our trading systems globally.
Key Responsibilities
- Ensure reliability, availability, and performance of trading infrastructure.
- Monitor trading activities and proactively identify issues.
- Incident management: rapid escalation, troubleshooting, and mitigation.
- Participate in on-call rotations covering multiple regions.
- Debug and troubleshoot production issues in C++ and Python.
- Develop observability systems, metrics, and analytics for trading.
- Identify areas for platform improvement and implement solutions.
- Collaborate with global teams and support cross-regional trading coverage.
- Stay informed on financial and technical news relevant to trading.
Requirements
- Bachelor’s degree in a quantitative discipline (CS, Engineering, Physics, Math, etc.).
- 5+ years of experience in Site Reliability Engineering.
- Strong programming skills in Python and/or Go.
- Solid experience with Unix/Linux systems.
- Hands-on experience with Docker, Kubernetes, Grafana, and Linux-based infrastructure.
- Strong problem-solving skills in complex technical environments.
- Excellent communication skills with internal and external stakeholders.
- Willingness to learn new domains and take ownership.
What We Offer
- Competitive salary and comprehensive benefits.
- Generous bonus structure.
- Cutting-edge hardware and software for production.
- High ownership over critical initiatives that impact the business directly.
- Opportunity to work with global trading systems and exchanges.
- Flexible work environment with minimal bureaucracy.
- Support for professional growth: tuition reimbursement, conferences, and training.