MDS Okanagan

Covering all stages of the data science value chain, UBC’s Okanagan campus Master of Data Science program prepares graduates to thrive in one of the world’s most in-demand fields. Over 10 months, you’ll learn how to extract and analyze data in all its forms, how to turn data into knowledge, and how to clearly communicate your recommendations to decision-makers.

Program Benefits

Highlights Across All MDS Programs:

10-month, full-time, accelerated program offers a short-term commitment for long-term gain
Condensed one-credit courses allow for in-depth focus on a limited set of topics at one time
Capstone project gives students an opportunity to apply their skills
Real-world data sets are integrated in all courses to provide practical experience across a range of domains

Highlights Specific To Okanagan Campus Option:

Curriculum is designed by computer science and statistics experts, emphasizing optimization and statistics with a focus on operations research
Courses are taught by renowned computer science and statistics faculty, giving students access to experts across a broad skill set
With a cohort limited to 60 students, this program offers a collaborative and intimate learning environment with focus on student success
The Okanagan campus offers students the opportunity to study at a top 40 university in a smaller setting, situated in a diverse region of natural beauty, and bordering the city of Kelowna, a hub of economic development
The Okanagan region hosts 2,000 tech start-ups, providing networking and employment opportunities
Fluency with both open source software and commercial software, including Tableau and Microsoft products (Excel, Azure, SQL Server).

Curriculum

The program structure includes 24 one-credit courses offered in four-week segments. Courses are lab-oriented and delivered in-person with some blended online content.

At the end of the six segments, an eight-week, six-credit capstone project is also included, allowing students to apply their newly acquired knowledge, while working alongside other students with real-life data sets.

Fall: September - December

Block 1 (4 weeks, 4 credits)

Computing Platforms for Data Science | DATA 530

Installation and configuration of data science software. Advanced data analysis using Excel. Analysis of data using libraries in R, Python, and cloud services.

Instructor(s):

Ifeoma Adaji

Programming for Data Science | DATA 531

Programming in R and Python including iteration, decisions, functions, data structures, and libraries that are important for data exploration and analysis.

Instructor(s):

Gema Rodrigues-Peres

Scripting and Reporting | DATA 541

Command line scripting including bash and Linux/Unix. Reporting and visualization.

Instructor(s):

Khalad Hasan

Modelling and Simulation I | DATA 580

Pseudorandom number generation, testing and transformation to other discrete and continuous data types. Introduction to Poisson processes and the simulation of data from predictive models, as well as temporal and spatial models.

Instructor(s):
Xiaoping Shi

Block 2 (4 weeks, 4 credits)

Algorithms and Data Structures | DATA 532

How to choose and use appropriate algorithms and data structures such as lists, queues, stacks, hash tables, trees and graphs to solve data science problems. Key concepts include recursion, searching and sorting, and asymptotic complexity.

Instructor(s):

Gema Rodrigues-Peres

Databases and Data Retrieval | DATA 540

How to use and query relational SQL and NoSQL databases for analysis. Experience with SQL, JSON, and programming with databases.

Instructor(s):

Ifeoma Adaji

Privacy, Security and Professional Ethics | DATA 553

The legal, ethical, and security issues concerning data, including aggregated data. Proactive compliance with rules and, in their absence, principles for the responsible management of sensitive data. Case studies.

Instructor(s):
Mostafa Mohamed

Predictive Modelling | DATA 570

Introduction to regression for Data Science, including: simple linear regression, multiple linear regression, interactions, mixed variable types, model assessment, simple variable selection, k-nearest-neighbours regression.

Instructor(s):
Xiaoping Shi

Block 3 (4 weeks, 4 credits)

Collaborative Software Development | DATA 533

How to exploit practices from collaborative software development techniques in data scientific workflows. Appropriate use of abstraction and classes, the software life cycle, unit testing / continuous integration, quality control, version control, and packaging for use by others.

Instructor(s):

Khalad Hasan

Data Collection | DATA 543

Fundamental techniques in the collection of data. Focus will be devoted to understanding the effects of randomization, restrictions on randomization, repeated measures, and blocking on the model fitting.

Instructor(s):
Emelie Gustafsson

Resampling and Regularization | DATA 571

Resampling techniques and regularization for linear models, including Bootstrap, jackknife, cross-validation, ridge regression, and lasso.

Instructor(s):

Jeff Andrews

Modelling and Simulation II | DATA 581

Markov chains and their applications, for example, queueing and Markov Chain Monte Carlo.

Instructor(s):
Ladan Tazik

Winter: January - April

Block 4 (4 weeks, 4 credits)

Web and Cloud Computing | DATA 534

DATA 534
How to use the web as a platform for data collection, computation, and publishing. Accessing data via scraping and APIs. Using the cloud for tasks that are beyond the capability of your local computing resources.

Instructor(s):
TBA

Data Wrangling | DATA 542

Converting data from the form in which it is collected to the form needed for analysis. How to clean, filter, arrange, aggregate, and transform diverse data types, e.g. strings, numbers, and date-times.

Instructor(s):

Fatemeh Hendijani Fard

Data Visualization I | DATA 550

Data visualization to produce effective graphs and images. Use of open source libraries in Python and R and commercial products such as Tableau.

Instructor(s):

Irene Vrbik

Supervised Learning | DATA 572

Introduction to supervised machine learning. Key concepts include: logistic regression, k-nearest-neighbours classification, discriminant analysis, decision trees and random forests.

Instructor(s):

Shan Du

Block 5 (4 weeks + 1 week break, 4 credits)

Data Visualization II | DATA 551

Advanced concepts in data visualization, using business intelligence and data analysis software. Key concepts include interactive visualization and production of visualizations for mobile and web.

Instructor(s):

Fatemeh Hendijani Fard

Communication and Argumentation | DATA 552

How to present and interpret data science findings. Drawing on the scholarship of language and cognition, this course is about how effective data scientists write, speak, and think.

Instructor(s):
TBA

Unsupervised and Semi-supervised Learning | DATA 573

How to analyse data with unknown responses. Distance measures, hierarchical clustering, k-means, mixture models.

Instructor(s):

Jeff Andrews

Advanced Predictive Modelling | DATA 583

Advanced study in predictive modelling techniques and concepts, including multiple linear regressions, splines, smoothing, and generalized additive models.

Instructor(s):
John Thompson

Block 6 (4 weeks, 4 credits)

Bayesian Inference | DATA 582

Introduction to Bayesian paradigm and tools for Data Science. Topics include Bayes theorem, prior, likelihood and posterior. A detailed analysis of the cases of binomial, normal samples, normal linear regression models. A significant focus will be on computational aspects of Bayesian problems using software packages.

Instructor(s):

Irene Vrbik

Optimization | DATA 585

Modeling using mathematical programming. Key concepts include fundamental continuous and discrete optimization algorithms; optimization software for small to medium scale problems; and optimization algorithms for data science.

Instructor(s):

Yves Lucet

Advanced Machine Learning | DATA 586

Advanced machine learning methods and concepts, including neural networks, backpropagation, and deep learning.

Instructor(s):

Shan Du

Special Topic | DATA 589

Advanced or specialized topic in Data Science with applications to specific data sets. Analysis of Big Data using Hadoop and Spark.

Instructor(s):
TBA

Spring: May - June

Capstone Project (8-10 weeks, 6 credits)

Capstone Project | DATA 599

A mentored group project based on real data and questions from a partner within or outside the university. Students will formulate questions and design and execute a suitable analysis plan. The group will work collaboratively to produce a reproducible analysis pipeline, project report, presentation and possibly other products, such as a dashboard.

Instructor(s):

MDS Staff

Meet Mitchell

What attracted Mitchell to the Master of Data Science program at UBC Okanagan was the capstone project as it gave him experience in each stage of project creation. It allowed Mitchell to take what he learned in the classroom and apply it in the real world.

Review Admission Requirements Contact Us With Questions

Breadcrumb

MDS Okanagan

Program Benefits

Highlights Across All MDS Programs:

Highlights Specific To Okanagan Campus Option:

Curriculum

Fall: September - December

Block 1 (4 weeks, 4 credits)

Block 2 (4 weeks, 4 credits)

Block 3 (4 weeks, 4 credits)

Winter: January - April

Block 4 (4 weeks, 4 credits)

Block 5 (4 weeks + 1 week break, 4 credits)

Block 6 (4 weeks, 4 credits)

Spring: May - June

Capstone Project (8-10 weeks, 6 credits)

Meet Mitchell