Machine Learning Engineer

Full Time
  • Full Time
  • Toronto

DeepRec.ai

Job Title: MLOps Engineer

Work Arrangement: Remote


Location: Toronto, Canada

Salary: Up-to $125,000 CAD


MLOps Engineer – Real-Time AI Systems


We’re looking for an experienced MLOps Engineer to help deploy and scale cutting-edge ML models for real-time video and audio applications. You’ll work alongside data scientists and engineers to build fast, reliable, and automated ML infrastructure.

Key Responsibilities

  • Build and manage ML pipelines for training, validation, and inference.
  • Automate deployment of deep learning and generative AI models.
  • Ensure model versioning, rollback, and reproducibility.
  • Deploy models on AWS, GCP, or Azure using Docker and Kubernetes.
  • Optimize real-time inference using TensorRT, ONNX Runtime, or PyTorch.
  • Use GPUs, distributed systems, and parallel computing for performance.
  • Create CI/CD workflows (GitHub Actions, Jenkins, ArgoCD) for ML.
  • Automate model retraining, validation, and monitoring.
  • Address data drift, latency, and compliance concerns.


What You Bring

  • 3+ years in MLOps, DevOps, or model deployment roles.
  • Strong Python and experience with ML frameworks (PyTorch, TensorFlow, ONNX).
  • Proficiency with cloud platforms, Docker, and Kubernetes.
  • Experience with ML tools like MLflow, Airflow, Kubeflow, or Argo.
  • Knowledge of GPU acceleration (CUDA, TensorRT, DeepStream).
  • Understanding of scalable, low-latency ML infrastructure.


Nice to Have

  • Experience with Ray, Spark, or edge AI tools (Triton, TFLite, CoreML).
  • Basic networking knowledge or CUDA programming skills.

Source

To apply, please visit the following URL: