CRED Hiring Site Reliability Engineers in Bangalore

Last Updated 2025-02-26
credsresite reliability engineerbangalorebengalurufull-timefintech
thumbnail

Join CRED as a Site Reliability Engineer in Bangalore

CRED, a leading fintech company based in Bangalore, is seeking an experienced Site Reliability Engineer to join our dynamic team. This full-time role offers a competitive salary and the opportunity to work with cutting-edge technologies in a fast-paced environment.


Role: Site Reliability Engineer


Location: Bangalore, India


Type: Full-Time


Salary: Competitive, commensurate with experience


Experience: Candidates should have prior experience in SRE/DevOps, focusing on distributed cloud-native systems design, observability, container orchestration, maintenance, and troubleshooting.


Key Responsibilities

As a Site Reliability Engineer at CRED, you will:


- Collaborate with cross-functional teams to ensure production scalability and stability.


- Manage and troubleshoot large-scale data engineering infrastructures using technologies such as Spark/EMR, Flink, Apache Pinot, Kafka, Airflow, and Databricks.


- Utilize observability tools like Loki, Victoriametrics, and Datadog to monitor system performance.


- Implement best practices for managing self-hosted platforms on Kubernetes, ensuring high availability and robust CI/CD systems.


- Conduct post-incident reviews, perform root cause analysis, and resolve system issues to maintain service quality.


Qualifications

We are looking for candidates who have:


- Hands-on experience with public cloud platforms, preferably AWS.


- Proficiency in Kubernetes/EKS for operating large-scale production systems with stringent SLOs and SLAs.


- Strong programming and scripting skills in Shell, Python, or GoLang.


- Expertise in Linux infrastructure management and systems administration.


- Experience with infrastructure as code and configuration management tools like Terraform, Helm, and Ansible.


- Knowledge of CI/CD processes and tools such as Jenkins, ArgoCD, and GitHub Actions.


- Familiarity with big data systems and pub/sub solutions like Kafka.


- Proficiency with system observability tools including ELK/EFK, Prometheus, Grafana, and Datadog.


- Excellent interpersonal and communication skills.


If you are passionate about ensuring system reliability and have the required experience, we encourage you to apply for this exciting opportunity at CRED.


Apply Now: Click here to apply!

Join our Telegram Channel to get instant job updates daily!