Local: São Paulo - São Paulo, Brazil, São Paulo, State of São Paulo, Brazil Formato: Híbrido
Local
São Paulo - SP
Híbrido
Responsabilidades
- Ensure the availability, performance, and reliability of the LOS through proactive monitoring and incident response.
- Partner with product and engineering teams to define and maintain SLOs/SLAs, introduce error budgets, and drive accountability.
- Collaborate on architectural improvements aimed at increasing resilience, scalability, and observability.
- Lead incident analysis and postmortems, and implement preventive actions.
- Design, build, and operate infrastructure as code (IaC) using Terraform.
- Improve observability tooling and practices using Datadog, enhancing alerting, tracing, and system dashboards.
- Participate in on-call rotations and respond to production incidents.
- Automate operational processes and promote a DevOps culture across squads.
Requisitos
- 5+ years of experience in Site Reliability Engineering or DevOps roles.
- Proven experience managing and improving production systems in a cloud-native environment (preferably AWS).
- Strong experience with observability tools and practices.
- Experience defining and driving adoption of SLIs, SLOs, and SLAs.
- Experience in operating event-driven systems and distributed architectures.
- Solid understanding of Terraform and infrastructure as code best practices.
- Strong debugging and troubleshooting skills across the stack.
- Comfortable writing and reviewing production-grade code (preferably in Java).
- Excellent written and verbal communication in English.
- A pragmatic and collaborative mindset, with a passion for system reliability and operational excellence.
- Bachelor's degree in computer science or similar fields preferred.
Diferenciais
- Generous salaries
- Monthly lunches
- Robust employee recognition and talent development program
- Healthy work-life balance
- Opportunity for career growth
- Commitment to diversity and inclusion
Benefícios
- Generous salaries
- Monthly lunches
- Robust employee recognition and talent development program
- Healthy work-life balance
- Opportunity for career growth
- Commitment to diversity and inclusion
Faixa salarial
Salários generosos
Carga horária
Híbrido
Sobre a empresa
First Help Financial (FHF) is a fast-growing and culturally diverse company in the U.S. We provide auto loans to the underserved and care for our customers and partners with exceptional service. Through flexible financing options and tri-lingual support, we offer consumers an easier way to finance their first car. We lend to and support our portfolio which has consistently grown 30%+ each year over the last nine years. Here you will find hard-working colleagues who come from over 20 countries. We hold ourselves to the highest standards of professionalism but also enjoy our work. Our culture and benefits are geared towards making you successful in life and comfortable at work.
Tech Stack
- Languages: Java (Spring Boot).
- Cloud: AWS (Lambda, Kinesis, S3, EC2).
- IaC & CI/CD: Terraform, AWS CodePipeline.
- Databases: MongoDB Atlas.
- Observability: Datadog.
- Event-driven architecture: Kinesis Streams, Lambdas.
- Version control & Collaboration: GitHub, Slack, Confluence, Jira.