Job Description
Full details about the role and requirements
Yukerja Summary
The SRE Engineer role at PT. Dollar Information Consultan Indonesia is curated from JobStreet (category Teknologi & IT). Note the work location (Jakarta) before applying. Yukerja.com is not the employer — applications are handled on the official source site.
Job Description
1. Experience operating large-scale GPU cluster data centers, such as those with thousands or tens of thousands of NVIDIA GPUs.
2. Ability to identify and resolve common issues in AI infrastructure computing centers—including GPU, storage, and network-related problems—with sound technical judgment.
3. Demonstrated ability to track known issues end-to-end, improve problem-resolution efficiency, and standardize incident-handling processes.
4. Experience ensuring data center stability through proactive measures, such as configuring monitoring dashboards and performing daily inspections.
5. Experience deploying and implementing common monitoring tools, including Prometheus and Grafana.
Job Requirements
1. Bachelor’s degree or higher in computer science or a related field, with at least three years of experience in data center operations and maintenance.
2. Proficiency in core network protocols, including BGP, OSPF, IS-IS, VXLAN, and EVPN.
3. In-depth understanding of and hands-on experience with high-performance intelligent computing networks—such as InfiniBand, RoCEv2, and lossless Ethernet.
4. Preferred: Experience with SDN controllers (e.g., ONOS, OpenDaylight, or P4).
5. Fluent in English (CET-6 level or equivalent); proficiency sufficient to serve as the working language is strongly preferred.