CV

Basics

Name Athar Mahmoudi
Label Machine Learning Scientist
Email athar1@ualberta.ca
Phone (587) 501-8919

Technical Skills

Reinforcement Learning
Model-Free RL
Deep Reinforcement Learning (DQN, PPO, SAC)
Q-Learning
Reward Modeling (RLHF-style)
Personalized Reward Models
Reward Shaping
Transfer Learning in RL
Deep Learning & Time Series
Transformers
Decision Transformer
Long Short-Term Memory (LSTM)
Representation Learning (VAE, VQ-VAE)
Online Decision Transformer
Domain Adaptation
Domain Generalization
Optimization & AutoML
Bayesian Optimization
Hyperparameter Optimization (HBO)
AutoML pipelines
Research Methods
Explainable AI (SHAP)
Experimental Design (A/B Testing, Ablation Studies)
Statistical Analysis
Signal Processing
Uncertainty Estimation
Video Games
Games as Psychological Measures
Games for Therapy
Game Adaptation

Work

  • 2025.01 - Present

    Edmonton, Canada

    Machine Learning Scientist
    Advanced Sensor Research Inc. (ASR)
    Building end-to-end time-series machine learning pipelines for non-invasive glucose estimation.
    • Built end-to-end time-series machine learning pipelines for non-invasive glucose estimation. [PyTorch, TensorFlow, Hugging Face]
    • Developed and benchmarked Transformer-based sequence models alongside baselines (LSTM, TCN), including representation learning with autoencoder-style pretraining. [PyTorch]
    • Ran structured architecture and hyperparameter experiments; compared training protocols for stability across sensor noise and session variability. [PyTorch, Scikit-learn]
    • Designed evaluation splits to test generalization across subjects/sessions/sensor units; investigated domain generalization and domain adaptation strategies. [PyTorch, Pandas]
    • Implemented explainability and error analysis workflows (slice-based analysis, failure mode analysis, calibration/uncertainty checks). [PyTorch]
    • Performed exploratory data analysis and signal-quality investigations to identify data issues and guide data-cleaning and collection improvements. [SciPy]
  • 2022.06 - 2023.04

    Montreal, Canada

    Intern Research Scientist
    Samsung Research Montreal
    Designed and evaluated deep reinforcement learning methods for network load balancing and energy efficiency.
    • Designed and evaluated deep reinforcement learning methods for network load balancing in simulated environments. [PyTorch, OpenAI Gym, Stable Baselines3]
    • Implemented curriculum learning to improve training robustness and generalization to harder scenarios. [PyTorch, Stable Baselines3]
    • Implemented a Vector-Quantized Variational Autoencoder (VQ-VAE) for representation learning within RL pipelines. [PyTorch]
    • Adapted and integrated an Online Decision Transformer into an existing RL framework to enhance network load balancing and energy efficiency. [PyTorch, Hugging Face]
  • 2017.09 - 2018.09

    Tehran, Iran

    Software Engineer
    Pars Cognition
    Created engaging video game experiences aimed at cognitive assessment and development.
    • Developed mini-serious video games targeting cognitive science applications and research.

Education

  • 2018.09 - 2025.01

    Edmonton, Canada

    Doctor of Philosophy
    University of Alberta
    Computing Science
    Thesis: Automated Personalized Exposure Therapy Based on Physiological Measures Using Experience-driven Procedural Content Generation via Reinforcement Learning
    • Intro to Virtual/Augmented Reality and Telepresence
    • Machine Learning and the Brain
    • Image Processing and Analysis in Diagnostic Imaging
  • 2014.09 - 2017.08

    Tehran, Iran

    Master of Science,
    Shahid Beheshti University,
    Computer Engineering/Artificial Intelligence
    Thesis: Pattern Extraction and Analysis of the Behavior of Children with Autism Spectrum Disorders Using a Designed Video Game
    • Machine Learning
    • Neural Network
    • Pattern Recognition
    • Data Mining
    • Image Processing
    • Natural Language Processing
  • 2009.09 - 2014.08

    Tehran, Iran

    Bachelor of Science
    University of Tehran,
    Computer Engineering/Software Engineering
    Thesis: Designed and Developed a Video Game for Children with Autism to Improve their Receptive Language Skills.
    • Advanced Programming
    • Database Design
    • Data Structures
    • Artificial Intelligence
    • Operating Systems
    • Human-Computer Interaction
    • Intro to Multimedia
    • Intro to eLearning

Skills

Programming Languages
Python
MATLAB
C#
SQL
Development Tools & Platforms
Visual Studio Code
Git
Docker
Google Cloud Platform
AWS
Linux/Bash
Frameworks & Libraries
PyTorch
Scikit-learn
TensorFlow
Hugging Face
Stable Baselines3
OpenAI Gym
Game and Simulation Engines
Unity
GameMaker Studio
Virtual Reality (VR)

Publications

Projects

  • Transformer-Based Glucose Level Estimation
    Built end-to-end time-series pipelines using Transformer, LSTM, and autoencoder models for non-invasive glucose estimation from sensor data. [PyTorch, TensorFlow]
    • Benchmarked Transformer architectures against LSTM and TCN baselines
    • Implemented representation learning with autoencoder-style pretraining
  • RLHF-Inspired Personalized Reward Learning
    Proposed an approach inspired by RLHF that learns reward models from physiological feedback and user embeddings for faster agent personalization. [PyTorch]
    • Designed reward models incorporating physiological signals and user-specific embeddings
    • Implemented pretraining and fine-tuning pipelines for rapid adaptation to individual users
  • Domain Adaptation for Sensor Data
    Developed domain adaptation and generalization techniques to handle sensor variability across users, body locations, and sensor units. [PyTorch, Pandas]
    • Addressed domain shift where sensors behave differently per person and placement
    • Implemented transfer learning strategies across sensor configurations
  • Online Decision Transformer
    Adapted and integrated an Online Decision Transformer into RL frameworks for real-time sequential decision-making. [PyTorch, Hugging Face]
    • Extended offline Decision Transformer for online learning scenarios
    • Applied to network load balancing and energy efficiency optimization
  • VQ-VAE for Time-Series Representation Learning
    Implemented Vector-Quantized Variational Autoencoder for clustering and representation learning on high-dimensional time-series data. [PyTorch]
    • Designed discrete latent representations for improved clustering
    • Applied to network traffic patterns for efficient data compression
  • Explainability and Error Analysis Framework
    Built workflows for model interpretability including slice-based analysis, failure mode detection, and uncertainty quantification. [PyTorch, SHAP]
    • Implemented systematic error analysis to identify prediction failure patterns
    • Designed calibration checks to assess model confidence reliability
  • Experience-Driven PCG via Reinforcement Learning
    Developed RL-based procedural content generation that adapts virtual environments based on real-time physiological feedback. [PyTorch, OpenAI Gym, Stable Baselines3]
    • Trained DQN and PPO agents to generate personalized therapeutic content
    • Integrated physiological sensors for closed-loop adaptation
  • Curriculum Learning for RL
    Implemented curriculum learning strategies to improve RL training robustness and generalization to harder scenarios. [PyTorch, Stable Baselines3]
    • Designed progressive task difficulty scheduling for stable learning
    • Improved agent generalization through structured training curricula
  • Stress Detection from Physiological Signals
    Built classification models to detect stress states from PPG and physiological responses using ML and deep learning. [PyTorch, Scikit-learn]
    • Compared traditional ML methods with LSTM-based approaches
    • Achieved 70.6% accuracy in VR-based stress classification
  • Heart Rate Estimation from Video
    Developed CNN-based remote photoplethysmography (rPPG) for non-contact heart rate estimation from facial video. [PyTorch]
    • Implemented end-to-end deep learning pipeline for video-based vital signs
    • Enabled non-intrusive physiological monitoring without wearables
  • Player Experience Modeling
    Built data-driven models to predict player affect from gameplay videos for adaptive game content generation. [PyTorch]
    • Developed label-free approach using Let's Play videos
    • Validated through human studies correlating predictions with affective measures
  • Adaptive VR Environment
    Created VR environments with real-time parameter adaptation based on user interactions and physiological responses. [Unity, C#]
    • Integrated physiological sensors for closed-loop environment control
    • Applied to exposure therapy for anxiety disorders
  • Behavior Pattern Analysis
    Applied ML techniques to identify and classify user behavior patterns in interactive game environments. [MATLAB]
    • Extracted features to differentiate behavioral patterns
    • Applied to autism research for early screening applications
  • Hidden Semi-Markov Model for Mobility Tracking
    Implemented probabilistic models to track mobility patterns with missing data and multiple observation sequences. [Java]
    • Handled incomplete and noisy sequential data
    • Improved tracking accuracy through advanced statistical modeling
  • Serious Games for Cognitive Development
    Developed educational and therapeutic video games targeting cognitive assessment and social skill development. [Unity, GameMaker]
    • Designed game mechanics for children with autism spectrum disorders
    • Created engaging experiences for cognitive science research

Volunteer

  • 2025
    Reviewer
    PeerJ Journal
    Reviewed academic articles submitted to the PeerJ journal, ensuring the quality and validity of research in AI and related fields.
  • 2025
    Program Committee Member
    AIIDE Conference 2025
    Provided peer reviews for research submissions focused on Artificial Intelligence and Interactive Digital Entertainment.
  • 2025
    Reviewer
    Transactions on Affective Computing journal
    Served as a reviewer for ransactions on Affective Computing journal, providing peer reviews for submissions in human-computer interaction and related fields.
  • 2024
    Program Committee Member
    Experimental AI in Games (EXAG)
    Active committee member served as a reviewer for EXAG workshops focused on experimental AI in games.
  • 2024
    Reviewer
    CHI Conference 2024
    Served as a reviewer for CHI Conference 2024, providing peer reviews for submissions in human-computer interaction and related fields.
  • 2019

    Edmonton, Canada

    Treasurer
    Computer Science Graduate Student Association (CSGSA)
    Managed the financial operations of the CSGSA, a voluntary group offering support and activities for Computing Science graduate students at the University of Alberta.

Awards

References

Dr. Matthew Guzdial
Assistant Professor in Computing Science at the University of Alberta, specializing in creative artificial intelligence and machine learning.
Professor Pierre Boulanger
Director of the Advanced Human-Computer Interfaces Laboratory at the University of Alberta, with over 40 years of expertise in 3D computer vision and virtual reality systems.
Dr. Di Wu
Senior Staff Research Scientist at Samsung AI Center Montreal and Adjunct Professor at McGill University, focusing on reinforcement learning and AI for telecommunications.
Dr. Hadi Moradi
Assistant Professor at the University of Tehran's Electrical and Computer Engineering School, specializing in climbing robots and intelligent systems.