MDS Okanagan

Covering all stages of the value chain, UBC’s Okanagan campus Master of Data Science program prepares graduates to thrive in one of the world’s most in-demand fields. Over 10 months, you’ll learn how to extract and analyze data in all its forms, how to turn data into knowledge, and how to clearly communicate your recommendations to decision-makers.

Program Benefits

Highlights Across All MDS Programs:

  • 10-month, full-time, accelerated program offers a short-term commitment for long-term gain
  • Condensed one-credit courses allow for in-depth focus on a limited set of topics at one time
  • Capstone project gives students an opportunity to apply their skills
  • Real-world data sets are integrated in all courses to provide practical experience across a range of domains

Highlights Specific To Okanagan Campus Option:

  • Curriculum is designed by computer science and statistics experts, emphasizing optimization and statistics with a focus on operations research
  • Courses are taught by renowned computer science and statistics faculty, giving students access to experts across a broad skill set
  • With a cohort limited to 40 students, program offers a more intimate learning environment
  • The Okanagan campus offers students the opportunity to study at a top 40 university in a smaller setting, situated in a diverse region of natural beauty, and bordering the city of Kelowna, a hub of economic development
  • With 2,000 tech start-ups launching in the region in the last year, networking and employment opportunities are abundant

Curriculum

The program structure includes 24 one-credit courses offered in four-week segments. Courses are lab-oriented and delivered in-person with some blended online content.

At the end of the six segments, an eight-week capstone project is also included, allowing students to apply their newly acquired knowledge, while working alongside other students with real-life data sets.

Fall: September - December

Block 1 (4 weeks)

Programming for Data Science

Programming in R and Python including iteration, decisions, functions, data structures, and libraries that is important for data exploration and analysis.

Instructor(s): TBD
Computing Platforms for Data Science

Installation and configuration of data science software. Advanced data analysis using Excel. Analysis of data using libraries in R, Python, and cloud services.

Instructor(s): TBD
Scripting and Reporting

Command line scripting including bash and Linux/Unix. Reporting and visualization.

Instructor(s): TBD
Modelling and Simulation I

Pseudorandom number generation, testing and transformation to other discrete and continuous data types. Introduction to Poisson processes and the simulation of data from predictive models, as well as temporal and spatial models.

Instructor(s): TBD

Block 2 (4 weeks)

Predictive Modelling

Introduction to regression for Data Science, including: simple linear regression, multiple linear regression, interactions, mixed variable types, model assessment, simple variable selection, k-nearest-neighbours regression.

Instructor(s): TBD
Modelling and Simulation II

Markov chains and their applications, for example, queueing and Markov Chain Monte Carlo.

Instructor(s): TBD
Algorithms and Data Structures

How to choose and use appropriate algorithms and data structures such as lists, queues, stacks, hash tables, trees and graphs to solve data science problems. Key concepts include recursion, searching and sorting, and asymptotic complexity.

Instructor(s): TBD
Databases and Data Retrieval

How to use and query relational SQL and NoSQL databases for analysis. Experience with SQL, JSON, and programming with databases.

Instructor(s): TBD

Block 3 (4 weeks)

Data Wrangling

Converting data from the form in which it is collected to the form needed for analysis. How to clean, filter, arrange, aggregate, and transform diverse data types, e.g. strings, numbers, and date-times.

Instructor(s): TBD
Resampling and Regularization

Resampling techniques and regularization for linear models, including Bootstrap, jackknife, cross-validation, ridge regression, and lasso.

Instructor(s): TBD
Privacy, Security and Professional Ethics

The legal, ethical, and security issues concerning data, including aggregated data. Proactive compliance with rules and, in their absence, principles for the responsible management of sensitive data. Case studies.

Instructor(s): TBD
Collaborative Software Development

How to exploit practices from collaborative software development techniques in data scientific workflows. Appropriate use of abstraction and classes, the software life cycle, unit testing / continuous integration, quality control, version control, and packaging for use by others.

Instructor(s): TBD

Winter: January - April

Block 4 (4 weeks)

Data Visualization I

Data visualization to produce effective graphs and images.

Instructor(s): TBD
Data Collection

Fundamental techniques in the collection of data. Focus will be devoted to understanding the effects of randomization, restrictions on randomization, repeated measures, and blocking on the model fitting.

Instructor(s): TBD
Web and Cloud Computing

How to use the web as a platform for data collection, computation, and publishing. Accessing data via scraping and APIs. Using the cloud for tasks that are beyond the capability of your local computing resources.

Instructor(s): TBD
Supervised Learning

Introduction to supervised machine learning. Key concepts include: logistic regression, k-nearest-neighbours classification, discriminant analysis, decision trees and random forests.

Instructor(s): TBD

Block 5 (4 weeks + 1 week break)

Communication and Argumentation

How to present and interpret data science findings. Drawing on the scholarship of language and cognition, this course is about how effective data scientists write, speak, and think.

Instructor(s): TBD
Unsupervised and Semi-supervised Learning

How to analyse data with unknown responses. Distance measures, hierarchical clustering, k-means, mixture models.

Instructor(s): TBD
Data Visualization II

Advanced concepts in data visualization, using business intelligence and data analysis software. Key concepts include interactive visualization and production of visualizations for mobile and web.

Instructor(s): TBD
Bayesian Inference

Introduction to Bayesian paradigm and tools for Data Science. Topics include Bayes theorem, prior, likelihood and posterior. A detailed analysis of the cases of binomial, normal samples, normal linear regression models. A significant focus will be on computational aspects of Bayesian problems using software packages.

Instructor(s): TBD

Block 6 (4 weeks)

Advanced Predictive Modelling

Advanced study in predictive modelling techniques and concepts, including multiple linear regressions, splines, smoothing, and generalized additive models.

Instructor(s): TBD
Optimization

Modeling using mathematical programming. Key concepts include fundamental continuous and discrete optimization algorithms; optimization software for small to medium scale problems; and optimization algorithms for data science.

Instructor(s): TBD
Advanced Machine Learning

Advanced machine learning methods and concepts, including neural networks, backpropagation, and deep learning.

Instructor(s): TBD
Special Topic

Advanced or specialized topic in Data Science with applications to specific data sets.

Instructor(s): TBD

Spring: May - June

Capstone Project (8-10 weeks)

Capstone Project

A mentored group project based on real data and questions from a partner within or outside the university. Students will formulate questions and design and execute a suitable analysis plan. The group will work collaboratively to produce a project report, presentation, and possibly other products, such as a web application.

Instructor(s): MDS Staff

Did You Know?

“The Okanagan region is one of the fastest growing tech sectors in Canada. With 24% growth in tech businesses over the last five years, the sector now contributes $1.67 billion to the local economy.”

Review Admissions RequirementsMeet Our Faculty