logo

View all EIRE Systems Jobs

Systems Reliability Engineer

Chiyoda-ku, Tokyo · Information Technology
EIRE Systems is looking to hire a Systems Reliability Engineer to work on-site at one of our major global financial services clients. In this role, you will be responsible for improving the availability, scalability, and performance of essential services within the client’s complex, fast-paced technology environment. You will collaborate closely with engineering and development teams to ensure the smooth operation and resilience of key systems, focusing on troubleshooting, automation, and proactive risk management.

About the Job:
As a Systems Reliability Engineer (SRE), your role will focus on applying software engineering principles to enhance system service availability, observability, scalability, performance, and resilience. You’ll be embedded with the client's infrastructure teams as part of a larger initiative to optimize their technology environment.
 
Your responsibilities will include, but are not limited to:
  • Collaborating closely with engineering and development teams to design, build, and maintain systems.
  • Troubleshooting issues across the entire technology stack: hardware, software, applications, and networks.
  • Identifying and driving opportunities to improve automation for platforms, including creating automation for deployment, management, and visibility of services.
  • Proactively identifying and addressing systems reliability risks.
  • Working alongside global and regional team members in a follow-the-sun model.
  • Representing the Reliability & Production Engineering (RPE) organization in design reviews and operational readiness exercises for new and existing services.
Successful candidates typically possess some or all of the following skills and experience:
  • Conceptual Awareness with of, SQL, and databases.
  • Proven ability to troubleshoot and debug issues to identify root causes, with strong analytical and problem-solving skills.
  • Hands-on experience with enterprise tools such as AppDynamics, Grafana, Splunk, Dynatrace.
  • Experience with automation/configuration/release management tools such as Ansible or GitHub.
  • Automation experience using scripting languages like Python, Bash, Perl, or Ruby. Familiarity with at least one programming language is a plus.
  • Awareness of modern software and systems architectures, including load balancing, databases, queuing, caching, distributed systems failure modes, microservices, and cloud technologies.
  • Practical experience managing large-scale systems is an advantage.
  • Fluency in both English and Japanese is required.
Are you considering a challenging and rewarding work environment that offers you the opportunity to grow and learn? We'd love to have a chat with you.

※ Applicants should be eligible to work full time in Japan.


 

Share This Job

Powered by