Accountabilities:
- Ensure the performance, reliability, and availability of mission-critical distributed systems supporting high-volume, low-latency workloads.
- Proactively identify and resolve bottlenecks across compute, network, storage, and application layers.
- Design, build, and maintain scalable infrastructure using Infrastructure-as-Code and automation practices.
- Manage cloud-native environments and containerized workloads, ensuring resilience and efficiency at scale.
- Participate in rotational on-call duties, handling incident response, root cause analysis, and preventive improvements.
- Monitor system health and performance using observability tools and metrics-driven approaches.
- Collaborate with engineering, trading, and security teams to align infrastructure with business and application needs.
- Implement and enforce security best practices, including IAM, secure CI/CD, and production hardening.
- Optimize system performance through tuning, profiling, and network-level improvements.
Requirements:
- 5+ years of experience in DevOps or SRE roles supporting production-grade, high-performance systems.
- Strong Linux administration skills and hands-on experience with AWS cloud environments.
- Proven experience with Kubernetes and containerized infrastructure at scale.
- Proficiency in scripting and automation using Python and Bash; additional experience with Go, JavaScript, or TypeScript is a plus.
- Strong experience with Infrastructure-as-Code tools such as Terraform and Ansible.
- Solid background in observability tools such as Prometheus, Grafana, or similar monitoring stacks.
- Experience in incident management, production support, and root cause analysis.
- Understanding of cloud networking, distributed systems, and performance engineering principles.
- Knowledge of security best practices, including IAM, SIEM, and secure deployment pipelines.
- Interest or experience in financial systems or digital asset environments is highly desirable.
Benefits:
- Competitive compensation package.
- Fully remote or flexible working arrangements across global regions.
- Opportunity to work in a high-impact, high-performance engineering environment.
- Exposure to complex systems at the intersection of cloud infrastructure and financial technology.
- Collaborative global team spanning multiple regions and disciplines.
- Strong emphasis on autonomy, ownership, and technical growth.
- Opportunity to work on systems where performance and reliability are business-critical.
🇧🇷 Essa vaga exige inglês. Você está pronto?
A DevSpeak Academy prepara desenvolvedores brasileiros para conquistar vagas internacionais. Domine o inglês técnico com professores que entendem o mundo dev.
Conheça a DevSpeak AcademyCandidaturas encerradasVer outras vagas
