Skip to main content

Staff Software Engineer · AI/ML Systems · Platform Engineering

Building AI‑powered
systems at
Canada's largest bank.

From LLM ensemble labelling and knowledge distillation to sub-50ms intent classification — nine years of continuous progression at RBC, from intern to Staff Engineer.

Years at RBC
9+
Roles Progressed
6
Intents Classified
14K+
CPU Inference Latency
50ms

About

Nine years at Canada's largest bank. From intern to Staff. Always building.

Nearly a decade ago, I joined RBC as an intern with a Statistics degree and a curiosity about what production software really looks like at scale. That curiosity led me through six distinct engineering disciplines — ChatOps, SRE, security platforms, developer experience — before arriving at what I find most compelling: building AI systems that work at the intersection of research and production.

Today, I lead AI engineering for an end-to-end call center platform at Canada's largest bank. My current work spans LLM ensemble labelling pipelines, knowledge distillation of large embedding models, and intent classification systems routing 14,000+ banking intents with under 50 ms latency on CPU-only infrastructure. The challenge is bridging the gap between what's possible in an AI research lab and what can actually be deployed reliably at enterprise scale.

My foundation is a Statistics degree from the University of Toronto — which means I think in distributions, not just decisions. I care about why models behave the way they do, not just whether they hit a benchmark. That mathematical rigour shapes how I approach everything from embedding fine-tuning to production observability.

Outside work, I maintain a homelab, experiment with network engineering, and prototype ideas. I'm interested in the long game of technology: systems that age gracefully, teams that grow sustainably, and engineering that compounds.

  • LLM Ensemble Labelling
  • Knowledge Distillation
  • <50ms CPU Inference
  • Canada's #1 Bank
K-M Samiul Haque — Staff Software Engineer
Samiul Haque Staff Software Engineer · RBC

Professional Experience

7 roles

Staff Software Engineer — AI Advice Centre

Royal Bank of Canada Toronto, ON

AI / MLBackend
  • Architect end-to-end AI call center platform integrating SIP telephony, real-time speech-to-text transcription, and LLM summarization pipelines processing thousands of daily banking interactions.
  • Engineered LLM ensemble labelling pipeline using GPT-4o mini, GPT-4.1 mini, and GPT-5 mini with consensus voting and tie-breaking escalation to larger thinking models; generated high-quality training labels at scale for fine-tuning.
  • Implemented knowledge distillation workflow: used LLM ensemble as teacher to fine-tune Arctic Embed 2.0 Large as domain-specific student model for banking intent semantics.
  • Built continuous training MLOps pipeline on S3 + Apache Airflow; automated dataset ingestion, model retraining, evaluation, and deployment to OpenShift.
  • Benchmarked SVM (RBF kernel), logistic regression, SetFit (head and full fine-tune), centroid, and fine-tuned Arctic Embed classifiers for production intent routing.
  • Led BART intent classification system using Jina v3 / multilingual-e5 with ONNX Runtime; reduced inference latency from 8 s → 50 ms on CPU-only OpenShift 4.
  • Built Spring Boot microservices with k-NN routing of 14,000+ banking intents across 110 workflow categories; achieved 95% semantic naming accuracy via LLM-assisted clustering.
Stack Python · Java · Spring Boot · ONNX Runtime · Arctic Embed 2.0 · Jina v3 · multilingual-e5 · SetFit · SVM · GPT-4/4o/5 · Claude · S3 · Apache Airflow · OpenShift 4 · AWS Transcribe/Bedrock

Staff Software Engineer — Security Platforms

Royal Bank of Canada Toronto, ON

SecurityBackend
  • Built centralized vulnerability triage platform spanning Canada, US, and UK subsidiaries; reduced security assessment time by 60% across multi-subsidiary operations.
  • Designed data ingestion pipelines integrating Snyk, NexusIQ, Kenna, Tenable, and Aqua security feeds into unified Snowflake data warehouse.
  • Delivered full-stack platform using React, FastAPI, Snowflake, Redis, and SQL Server deployed on OpenShift.
Stack React · FastAPI · Python · Snowflake · Redis · SQL Server · OpenShift · Snyk · NexusIQ · Kenna · Tenable · Aqua · Recorded Future

Lead Software Engineer — SRE & Chaos Engineering

Royal Bank of Canada Toronto, ON

SREDevOps
  • Automated large-scale chaos experiments on VMs and Kubernetes namespaces across AKS, OpenShift, and VMware using Tanium and Gremlin APIs; reduced experiment setup time by 90%.
  • Architected SRE data enrichment pipelines integrating PagerDuty, ServiceNow, and ELK Stack; reduced mean time to detect (MTTD) by 30%.
  • Built observability data pipelines using Python, Pandas, Oracle DB, SQL Server, and PostgreSQL to feed critical internal monitoring dashboards.
Stack Python · Go · Tanium API · Gremlin API · Elasticsearch · PagerDuty · ServiceNow · Pandas · PostgreSQL · SQL Server · Oracle DB · OpenShift · AKS

Senior Software Developer — Developer Experience & OSPO

Royal Bank of Canada Toronto, ON

BackendDevOps
  • Architected mTLS API gateway connecting Slack applications to internal banking services using Golang, HashiCorp Vault, GitHub API, and Nginx on OpenShift; enabled enterprise ChatOps saving hundreds of engineer-hours weekly.
  • Built bulk employee onboarding service reducing provisioning time from hours to minutes using Golang, Gin, PingFederate SSO, and ServiceNow on OpenShift.
  • Developed metrics pipeline tracking open-source/innersource artifact reuse across the organization; delivered executive dashboards using Logstash, Kibana, Redis, NexusIQ, and Artifactory.
  • Created innersource SDKs in Python/Golang reducing Slackbot onboarding from 2 weeks to 2 days for 15+ teams.
Stack Go · Gin · HashiCorp Vault · Nginx · Elasticsearch · Logstash · Kibana · Redis · NexusIQ · Artifactory · MariaDB · Java · Python · OpenShift

Software Developer — Developer Experience

Royal Bank of Canada Toronto, ON

Backend
  • Built internal Stack Overflow-style Q&A platform with gamification; increased answered-question rate by 25% and connected developers across 10+ lines of business.
  • Architected production-ready WebSocket Slackbot pub-sub framework scaling horizontally without message duplication using Golang and RabbitMQ.
Stack Java · Spring Boot · Elasticsearch · MariaDB · PingFederate · PCF · Golang · RabbitMQ

Software Developer Intern → SRE Intern — 3 terms

Royal Bank of Canada Toronto, ON

AI / MLDevOps
  • Implemented ChatOps into DevOps pipeline via scalable Slackbot saving 20+ engineer-hours per microservice per week using Python, MongoDB, Redis, RASA NLU, Docker, OpenShift, and Kubernetes.
  • Built ML-based SRE pipeline improving application resilience and reducing toil/recovery times using Elasticsearch, Logstash, Kafka, and IBM CDP.
  • Developed automation test portal reducing test execution time by 80%, saving $100k+ annually.
  • Built intelligent war room using Raspberry Pi, Python, Snips, PagerDuty, Moogsoft, and ServiceNow to surface solution vectors from similar past incidents.
Stack Python · Java · MongoDB · Redis · RASA NLU · Docker · Kubernetes · OpenShift · IBM UDeploy · Logstash · Kafka · Ansible · Dynatrace · ServiceNow · Moogsoft · Elasticsearch

Full-Stack Developer — Contract

University of Toronto Remote

Backend
  • Developed web application for the History Department to track course enrollment trends; built with Django, Bootstrap, Redis, jQuery, Nginx, Chart.js, and PostgreSQL.
Stack Django · Bootstrap · Redis · jQuery · Nginx · Chart.js · PostgreSQL
"The best engineers don't just ship code — they design systems that make everyone around them more effective."
— How I think about engineering leadership

Featured Projects

7 projects

Production Systems Internal · Sanitized

  • <50ms CPU inference
  • 14K+ intents
  • 95% naming accuracy

BART Intent Router

Knowledge Distillation Pipeline

LLM ensemble labelling pipeline (GPT-4o mini + GPT-4.1 mini + GPT-5 mini with consensus voting) used as teacher to fine-tune Arctic Embed 2.0 Large as a domain-specific student model. Routes 14,000+ banking intents across 110 categories with sub-50ms latency on CPU-only OpenShift 4.

PythonONNX RuntimeArctic Embed 2.0LLM EnsembleApache AirflowOpenShift
  • 1000s of daily interactions
  • Real-time SIP
  • Multi-model LLM

AI Call Center Platform

Real-time SIP + STT + LLM

End-to-end AI-powered platform integrating SIP telephony, real-time speech-to-text transcription, and LLM summarization pipelines. Processes thousands of daily banking interactions with automated intent classification and agent-assist workflows.

JavaSpring BootAWS TranscribeAWS BedrockGPT-4ClaudeOpenShift
  • 60% faster assessments
  • 3 global subsidiaries
  • 5 security feeds

Vulnerability Triage Platform

Multi-Subsidiary Security Data Pipeline

Centralized vulnerability management platform spanning Canada, US, and UK subsidiaries. Unified data ingestion from Snyk, NexusIQ, Kenna, Tenable, and Aqua into a Snowflake warehouse with real-time triage dashboards.

ReactFastAPISnowflakeRedisPythonOpenShift
  • 90% setup reduction
  • AKS + OpenShift + VMware
  • 100s of VMs/pods

Chaos Engineering Platform

Automated Resilience Testing at Scale

Automated large-scale chaos experiments across AKS, OpenShift, and VMware clusters using Tanium and Gremlin APIs. Enables self-service chaos scheduling for VM and Kubernetes workloads across multi-cloud infrastructure.

PythonGoTanium APIGremlinKubernetesAKS

Open Source & Hackathons

MirrML

ML Clothing Style Classifier — UofTHacksIV

Flask + Clarifai Image Recognition API; clothing style classifier (business/casual/evening) via neural network trained on scraped image data. Matches users with friends with similar style profiles.

LendR

NFC Micro-Lending App — TD Finhacks

Android app using NFC technology for social micro-financing. Built a Karma system to incentivize repayment; negative karma reduces borrowing limits. Custom backend tracks transactions and karma ratings.

HackTheValley API

Event Management API

Production event management API for HackTheValley, a University of Toronto hackathon. Built with Go for high performance and deployed for the annual event.

The Journey

From intern to Staff Engineer — nine years of building at scale.

Path to Principal Engineer
Staff
  1. Staff Software Engineer — AI Advice Centre

    LLM ensemble labelling, knowledge distillation, Arctic Embed fine-tuning

  2. Staff Software Engineer — Security Platforms

    Centralized vulnerability triage across global subsidiaries

  3. Lead Software Engineer — SRE & Chaos

    Chaos engineering & SRE enrichment pipelines

  4. Senior Software Developer — Dev Experience

    Enterprise ChatOps, mTLS gateway & innersource SDKs

  5. Software Developer — Dev Experience

    Internal Q&A platform & WebSocket pub-sub Slackbot framework

  6. Software Developer Intern → SRE Intern

    ChatOps automation with RASA NLU + ML-based SRE pipeline

Technical Skills

10 categories

AI / ML Systems

  • LLM Ensemble Labelling
  • Knowledge Distillation
  • Arctic Embed 2.0
  • Embedding Fine-tuning
  • ONNX Runtime
  • SetFit
  • SVM (RBF)
  • Intent Classification
  • Semantic Clustering
  • RAG
  • Prompt Engineering
  • RASA NLU
  • Clarifai

Languages

  • Python
  • Java
  • Go (Golang)
  • JavaScript
  • TypeScript
  • SQL
  • Shell/Bash
  • C
  • C++
  • Ruby

MLOps & Data Pipelines

  • Apache Airflow
  • S3 Data Pipelines
  • Continuous Training
  • Model Evaluation
  • Snowflake
  • Apache Kafka

Cloud & Containers

  • OpenShift (OCP)
  • Kubernetes
  • Docker
  • Azure AKS
  • AWS (Transcribe, Bedrock, S3)
  • PCF

Frameworks

  • Spring Boot
  • FastAPI
  • Flask
  • Django
  • React
  • Gin
  • ASP.NET Core
  • RabbitMQ

Databases

  • PostgreSQL
  • SQL Server
  • Elasticsearch
  • MongoDB
  • Redis
  • MariaDB
  • MySQL
  • Oracle DB

DevOps & CI/CD

  • GitHub Actions
  • Jenkins
  • Ansible
  • HashiCorp Vault
  • Artifactory
  • SonarQube
  • IBM UDeploy
  • Git

SRE & Observability

  • Gremlin
  • Tanium
  • PagerDuty
  • ServiceNow
  • ELK Stack
  • Prometheus
  • Grafana
  • Moogsoft
  • Dynatrace

Security

  • Snyk
  • NexusIQ
  • Kenna
  • Tenable
  • Aqua
  • Recorded Future
  • mTLS
  • PingFederate

AI Dev Tools

  • Claude
  • ChatGPT / GPT-4o
  • GitHub Copilot
  • Cursor

Education

Honours B.Sc. Statistics

University of Toronto

Toronto, ON

Double Minor: Geographic Information Systems (GIS) & Psychology

Activities VP of Development, AMACSS (Association of Mathematical & Computer Science Students)

Volunteering

3 organizations

Contributed to club infrastructure and development projects, mentored students in software engineering and site reliability engineering fundamentals.

Web Developer

Women in Computer Science, Statistics & Mathematics (WiCSM)

Built and maintained the organization's web presence, supporting an initiative that advocates for gender diversity in STEM at the University of Toronto.

Get in Touch

Let's work together
or just say hello.

Whether you're a hiring manager looking for a Principal-calibre engineer, a team exploring AI/ML systems at enterprise scale, or just want to connect — my inbox is open.