Hieu Quoc Nguyen

Hieu Quoc Nguyen

Full Stack Data Scientist

University of Waterloo

About Me

Hi, I’m a Full Stack Data Scientist with a firm grounding in statistics and computer science from the University of Waterloo. My focus and commitment center on solving complex data science problems, with a special focus on Natural Language Processing (NLP). My experience spans across vital sectors such as finance, cybersecurity, and the commercial domain name industry.

I have a proven track record in developing and deploying end-to-end, high-performance machine learning and deep learning services. These projects were not only completed on time, and within budget but also designed for robust performance, scalability and high availability on cloud platforms.

I’m deeply passionate about statistical/machine learning, deep learning (particularly in the context of NLP and Reinforcement Learning), and machine learning/data engineering.

Interests
  • Statistical/Machine Learning
  • Deep Learning (NLP + RL)
  • Machine Learning/Data Engineering
Education
  • Bachelor of Mathematics, 2019

    University of Waterloo

Skills

Machine/Deep Learning
Python
Database (SQL/noSQL)
AWS
Github
Docker

Experiences

 
 
 
 
 
Tech Co-founder
b(x) Theory Inc.
July 2022 – September 2023 Toronto, Canada
Our team researched and developed machine learning services to help our clients manage risks and identify opportunities in restructuring/special situations.
 
 
 
 
 
Data Scientist
GoDaddy Inc.
April 2021 – July 2022 Santa Clara, United States (Remote)
Our team researched, developed & deployed (end to end) deep neural nets for GoDaddy’s core global domain business, namely domain names auto completion, generation, ranking, fraud detection & domain names valuation.
 
 
 
 
 
Data Scientist
Royal Bank of Canada (RBC) - DNA Group
January 2020 – April 2021 Toronto, Canada (Remote)
Our team collaborated with RBC’s Canadian Banking Operations to develop and deploy NLP solutions, which was designed to improve current business process & saved significant cost in manual labour annually.
 
 
 
 
 
Various Internships in Data Science and Software Engineering
RBC Capital Markets, RBC Global Cybersecurity, and Marsh Canada Ltd
May 2017 – September 2019 Toronto, Canada

Research/Projects

*
Generative AI with Large Language Models Course Notes
Comprehensive course notes of “Generative AI with Large Language Models (LLMs)” Course, offered by DeepLearning.ai
Generative AI with Large Language Models Course Notes
AI Research Agent - R0D1
R0D1 is an autonomous research agent powered by Flask API. The end-to-end pipeline from Docker containerization, image push to AWS ECR, and deployment to AWS ECS Fargate with a load balancer is fully automated with Github Actions
AI Research Agent - R0D1
End-to-End Python ETL Pipeline
This project demonstrates the construction of an end-to-end Python ETL (Extract, Transform, Load) pipeline using AWS services. The pipeline is designed to extract Toronto real estate property data from the Zillow Rapid API and process it through various AWS components, such as EC2, S3, Lambda, Redshift, and QuickSight
End-to-End Python ETL Pipeline
Alpha Go
Mastering the game of Go with deep neural networks and tree search
Alpha Go
Latent Dirichlet Allocation (LDA) Algorithms
Full Derivation of Latent Dirichlet Allocation (LDA) with Variational E-M Algorithms
Latent Dirichlet Allocation (LDA) Algorithms
Graph Convolution Networks (GCN)
An Introduction to Graph Convolution Networks (GCN)
Graph Convolution Networks (GCN)