System Reliability Analyst 3197389

System Reliability Analyst 3197389

Date: Feb 01, 2022
Location : Montréal (Qc)

As a financial service provider, our client helps individuals, families, institutions and governments raise, manage and distribute the capital they need to achieve their goals.
To support these activities and achieve the goals, the firm processes an average of 40 million trades and moves assets valuing trillions of dollars in multiple countries and currencies across the globe every single day.
Reliability and Production Engineering (RPE) teams are focused on improving the service availability, scalability, performance, and efficiency of the systems that facilitate the volume and variety of transactions at this immense scale.
System Reliability Analysts make powerful contributions to this effort by applying the principles of System Reliability Engineering (SRE) to systems covering multiple business lines, including Investment Banking, Prime Brokerage, Sales & Trading (Equities, Fixed Income), as well as roles in core Finance, Operations, and Compliance technology groups.

Why join our teams?

We are looking for energetic and talented System Reliability Analysts to grow our SRE capability in multiple squads servicing business lines and technology functions across the firm. These roles in RPE offer a front-row seat into the fast-paced world of international finance. You will interact daily with colleagues and businesspeople all over the world. And don’t worry if your background isn’t in finance technology. We value the perspective of talented and creative individuals with experience in other industries and will train you on financial concepts and terminologies in order to help you grow professionally and succeed in your role.
We are passionate and creative professionals. If you have the same passion for technology and innovative solutions as we do, you will enjoy your role in our teams. We have a place for all motivated and talented individuals!

Of course, you also benefit from all the advantages the bank offers to its employees:

  • Competitive compensation and benefits package
  • Full health insurance coverage
  • Professional training, certifications
  • Mobility opportunities
  • A variety of corporate activities and opportunities to give back to our community
  • Flexibility in working hours and working from home

Your role and responsibilities :

As a System Reliability Analyst, your responsibilities will include, but not be limited to:
  • Working closely with engineering/development teams to design, build, optimize, and maintain systems.
  • Troubleshooting issues across the entire technology stack: hardware, software, application, and network.
  • Aggressively targeting toil and operational risk, and deploying solutions to reduce these.
  • Broadening infrastructure and application observability.
  • Proactively identifying and addressing active or potential risks to system reliability.
  • Advocating for reliability priorities in application design reviews and operational readiness exercises for new and existing services.


What skills and experience do I need?

You should apply if you have at least a Bachelor’s degree in Computer Science or other technical discipline(s), plus hands-on experience with any combination of the following:
  • 3-5+ years practical experience in production systems support or application development
  • Hands on experience managing systems in a large scale distributed Unix/Linux environment is essential.
  • Automation-related experience is required, using scripting languages such as Python, bash, Perl, and/or Ruby. Higher-level compiled languages such as C++, C#, JAVA, Scala, and Go are a big plus.
  • Deep knowledge of and hands-on experience applying the principles of System/Site Reliability Engineering (SRE).
  • Practical experience designing and instrumenting SLO/SLI dashboards is particularly valuable.
  • Hands on experience on enterprise tools such as AppDynamics, Grafana, Splunk, Dynatrace
  • Experience with Puppet, Ansible, Chef, GitHub or any automation/configuration/release management tools
  • Awareness of, and ability to reason through modern software and systems architectures, including load-balancing, databases, queueing, caching, distributed systems failure modes, micro services, Cloud, etc.
  • Working ability to interact with message transport platforms and protocols (MQ, CPS, XML, FIX) and distributed database technologies (DB2, Sybase, Mongo, GreenPlum, Postgres, KDB).
  • Autosys scheduling and batch processing concepts.
  • Deep understanding of infrastructure and operating system concepts such as processes, memory allocation, and networking, with an understanding of how applications are affected by the above, and ability to debug and troubleshoot accordingly.

I am convinced! What else does it take to come and work at Morgan Stanley?

  • Problem-solving mindset
  • Creativity
  • Integrity
  • Grit, drive, and a deep sense of ownership
  • Clear and energetic communication style
  • Interest in distributed systems and working with high-scale services
  • Desire to work in a fast-moving environment
  • A fearless approach to change
  • Enjoy solving hard problems
  • Ability to collaborate and work as a cohesive team to create something amazing!


Apply for this job

Our advisory for this position

Charlotte Teulet

HR Advisor and Talent Finder

need you.
Apply today