Data Science and Analytics (MSc)
Overview
This unique one-year Master of Science (MSc) degree program enables students to develop interdisciplinary skills and gain a deep understanding of technical and applied knowledge in data science and analytics. Graduates are highly trained, qualified data scientists who can pursue careers in industry, government or research.
Degree awarded: MSc
Administered by: Yeates School of Graduate and Postdoctoral Studies
Data Science and Analytics graduate program website
(PDF file) Data Science and Analytics graduate program calendar 2024-25
Curriculum
Course Code | Degree Requirements: Master of Science | Credits |
---|---|---|
|
Thesis |
(Milestone) |
DS8001 |
Design of Algorithms and Programming for Massive Data |
1 |
DS8002 | Machine Learning |
1 |
DS8004 | Data Mining and Prescriptive Analytics |
1 |
DS8016 | Directed Studies |
Pass/Fail |
Two Elective credits |
2 | |
OR | ||
Major Research Paper (MRP) |
(Milestone) | |
DS8012 |
Research Skills |
Pass/Fail |
DS8005 |
Soft Skills, Communication and Ethics |
Pass/Fail |
DS8001 |
Design of Algorithms and Programming for Massive Data |
1 |
DS8002 |
Machine Learning |
1 |
DS8003 |
Management of Big Data and Big Data Tools |
1 |
DS8004 |
Data Mining and Prescriptive Analytics |
1 |
|
Two Elective credits |
2 |
Electives
Course code | Course name | Credits |
---|---|---|
BP8113 |
Advanced Imaging |
1 |
CP8202 |
Advanced Software Engineering |
1 |
CP8203 |
Advanced Database Systems |
1 |
CP8206 |
Soft Computing and Machine Intel |
1 |
CP8304 |
Distributed Systems |
1 |
CP8305 |
Knowledge Discovery |
1 |
CP8311 |
Genetic Programming |
1 |
CP8314 |
Advanced Artificial Intelligence |
1 |
DS8006 |
Social Media Analytics |
1 |
DS8007 |
Advanced Data Visualization |
1 |
DS8008 |
NLP (Text Mining) |
1 |
DS8009 |
Special Topics in Data Science and Analytics |
1 |
DS8010 |
Interactive Learning in Decision Process |
1 |
DS8011 |
Bayesian Statistics and Machine Learning |
1 |
DS8013 |
Deep Learning |
1 |
DS8014 |
Graph Mining |
1 |
DS8015 |
Machine Learning non Data Science Student |
1 |
EF8903 |
Applied Econometrics |
1 |
EF8913 |
Empirical Topics in International Finance |
1 |
EF8914 |
Financial Econometrics |
1 |
EF8933 |
Empirical Topics Int’l Trade |
1 |
EF8937 |
Labour Economics |
1 |
EF8944 |
Panel Data and NL Model Analysis |
1 |
EF8945 |
Nonparametric Data Analysis |
1 |
ME8118 |
Info Sys Analysis & Design |
1 |
ME8127 |
Optimization Models |
1 |
ME8140 |
Simulation Theory/Methodology |
1 |
MT8310 |
Special Topics Info Sys Mgmt |
1 |
SA8901 |
Geospatial Data Analytics |
1 |
SA8911 |
Geodemographics |
1 |
Thesis
The student is required to conduct advanced research on a topic related to data science. The topic is chosen in consultation with thesis supervisor, and the student presents research plan in writing before research starts. The student must submit the completed research in a thesis format to an examination committee and make an oral presentation of the thesis. The student is expected to furnish evidence of competence in research and a sound understanding of data science associated with the research. This is “Milestone”.
Major Research Project
The student is required to conduct an applied advanced research project. The project will be carried out under the guidance of a supervisor. On completion of the project, the results are submitted in a technical report format to an examining committee and the student will make an oral presentation of the report to the committee for assessment and grading of the report. The student is expected to provide evidence of competence in the carrying out of a technical project and present a sound understanding of the material associated with the research project. This is a “Milestone.” Pass/Fail
DS8001- Design of Algorithms and Programming for Massive Data
NP-completeness, approximation algorithms and parallel algorithms. Study of algorithmic techniques and To introduce students to the theory and design of algorithms to acquire and process large dimensional data.
Advanced data structures, graph algorithms, and algebraic algorithms. Complexity analysis, complexity classes, and modeling frameworks that facilitate the analysis of massively large amounts of data. Introduction to information retrieval, streaming algorithms and analysis of web searches and crawls. 1 Credit
DS8002 – Machine Learning
Overview of artificial learning systems. Supervised and unsupervised learning. Statistical models. Decision trees. Clustering. Feature extraction. Artificial neural networks. Reinforcement learning. Applications to pattern recognition and data mining. 1 Credit
DS8003 – Management of Big Data and Big Data Tools
The course will discuss data management techniques for storing and analyzing very large amounts of data. The emphasis will be on columnar databases and on Map Reduce as a tool for creating parallel algorithms that can process very large amounts of data. Big Data applications, Columnar stores, distributed databases, Hadoop, Locality Sensitive Hashing (LSH), Dimensionality reduction, Data streams, unstructured data processing, NoSQL, and NewSQL 1 Credit
DS8004 Data Mining and Prescriptive Analytics
The course teaches to use data to recommend optimum course of action to achieve the optimum outcome and to formulate new products and services in a data driven manner. The course will cover all these issues and will illustrate the whole process by examples. Special emphasis will be given to data mining and computational techniques as well as optimization and stochastic optimization techniques. Prerequisite: DS8002 1 Credit
DS8005 Soft Skills, Communication and Ethics
The course will focus on communicating and presenting data analytics and modeling results. It aims at building the competency in story telling from numbers. The course also covers ethical and social impacts of data science, analytics and AI. Prerequisite: DS8012 Pass/Fail
DS8006 Social Media Analytics
The course will cover fundamental concepts and tools in Social Network Analysis by showing how AI, math, and statistical methods are used to study them. The topics include: weblog analysis, centrality in social networks, influence, sentiment analysis and opinion mining, information cascades, multimedia analysis, reasoning and prediction with social media and modeling behaviour. The lab component of the class will use R or Python to develop and analyze network models. Prerequisite: DS8002 1 Credit
DS8007 Advanced Data Visualization
Overview of data visualization. Basic visualization design and evaluation principles. Learn to acquire, parse, and analyze large datasets. Techniques for visualizing multivariate, temporal, text-based, geospatial, hierarchical, and network/ graph data using tools such as ggplot2, R, D3, etc. 1 credit
DS8008 NLP (Text Mining)
The course covers important topics in text mining including: basic natural language processing techniques, document representation, text categorization and clustering, document summarization, sentiment analysis, social network and social media analysis, probabilistic topic models and text visualization. Prerequisites: DS8002 and DS8003 1 credit.
DS8009 Special Topics in Data Science and Analytics
This course consists of lectures, seminars and readings covering the latest advances and research in data science and analytics. The course description will be announced prior to scheduling of the course. 1 credit.
DS8010 Interactive Learning in Decision Process
This course focuses on topics related to reinforcement learning. The course will cover making multiple-stage decisions under uncertainty, heuristic search in planning, Markov decision processes, dynamic programming, temporal-difference learning including Q-learning, Monte Carlo reinforcement learning methods, function approximation methods, and the integration of learning and planning. Other topics can be included as well.
Prerequisites: DS 8002 1 Credit
DS8011 Bayesian Statistics and Machine Learning
This course will cover modern machine learning techniques from a Bayesian probabilistic perspective. Bayesian probability allows us to model and reason about all types of uncertainty. The result is a powerful, consistent framework for approaching many problems that arise in machine learning, including parameter estimation, model comparison, and decision making. We will begin with a high-level introduction to Bayesian inference, then proceed to cover more-advanced topics. Prerequisites: DS 8002 1 Credit
DS8012 Research Skills
This course will be an introduction to research preparation, experimental design, methods of data collection, exploratory data analysis, and understanding threats to validity of results with aim to prepare student for MRP work. Pass/Fail
DS8013 Deep Learning
The course aims to present the mathematical, statistical and computational challenges of building stable representations for highdimensional data, such as images, text and data. The topics include: Convolutional neural networks. Autoencoders, their sparse, denoising variants, and their training. Regularization methods for preventing overfitting. Stacked autoencoders and end-to-end networks. Recurrent and recursive networks. Multimodal approaches. Deep architectures for vision, speech, natural language processing, and reinforcement learning. Prerequisite: DS8002. 1 Credit
DS8014 Graph Mining
The course aims to present the mathematical, statistical and computational challenges of building stable representations for highdimensional data, such as images, text and data. The topics include: Convolutional neural networks. Autoencoders, their sparse, denoising variants, and their training. Regularization methods for preventing overfitting. Stacked autoencoders and end-to-end networks. Recurrent and recursive networks.
Multimodal approaches. Deep architectures for vision, speech, natural language processing, and reinforcement learning. Prerequisite: DS8002
DS8015 Machine Learning non Data Science Student
This course will introduce students to the theory and design of machine learning algorithms using Python. The course will cover Python Fundamentals, Data Structures, Functions and Functional Programming, Python Libraries, Exploratory Data Analysis, Statistical Inference, Introduction to Machine Learning, Unsupervised Learning, Supervised Learning: Regression, Supervised Learning: Classification, Dimensionality Reduction. 1 credit
DS8016 Directed Studies
This course assists the student with the development of the Thesis through the proposal, preliminary literature review, outline, and reporting stages. It is tailored to the needs of each student and the work in this course will be used as a foundation for the Thesis. Students are required to select an advisor and present a formal report, or take a formal examination, at the end of the class. Directed studies course is a prerequisite for starting Thesis work, and requires approval from PD. Pass/Fail
For course descriptions of non DS courses, go to the Program offering the course. BP – Biomedical Physics CP – Computer Science EF – Economics ME – Mechanical and Industrial Engineering MT – Master of Science in Management SA – Spatial Analysis