Data Science (DATA)

DATA*6100  Introduction to Data Science  Fall Only  [0.50]  

The course includes an introduction to the methods of modern statistics such as splines, general additive models, principal components analysis, and classifiers. Students learn resampling methods such as bootstrap, cross-validation, boosting, and bagging. Methods of model selection include search-and-score and regularization, and students practice communicating technical ideas to a non-technical audience, including via data visualization.

Department(s): Department of Mathematics and Statistics  
Location(s): Guelph  
DATA*6200  Data Manipulation and Visualization  Fall Only  [0.50]  

This course provides a hands-on introduction to the manipulation and visualization of complex data sets using a programming language. Efficient techniques for importing and exporting data in various formats, data acquisition, data integrity, and good analysis practices are discussed. Several programming tools and libraries are introduced to restructure, transform and fuse disparate data types for visualization and data summaries in table format. Basics of manipulating space-time data is also covered.

Restriction(s): Restricted to Master of Data Science students.  
Department(s): Department of Mathematics and Statistics  
Location(s): Guelph  
DATA*6300  Analysis of Big Data  Unspecified  [0.50]  

This course introduces software tools and data science techniques for analyzing big data. It covers big data principles, state-of-the-art methodologies for large data management and analysis, and their applications to real-world problems. Modern and traditional machine learning techniques and data mining methods are discussed and ethical implications of big data analysis are examined. May be offered in conjunction with CIS*6180.

Restriction(s): Credit may be obtained for only one of CIS*6180 or DATA*6300.  
Department(s): School of Computer Science  
Location(s): Guelph  
DATA*6400  Machine Learning for Sequential Data Processing  Unspecified  [0.50]  

This course emphasizes machine learning for sequential data processing. It covers common challenges and pre-processing techniques for sequential data such as text, biological sequences, and time series data. Students are exposed to machine learning techniques, including classical methods and more recent deep learning models, so that they obtain the background and skills needed to confront real-world applications of sequential data processing. May be offered in conjunction with CIS*6190.

Restriction(s): Credit may be obtained for only one of CIS*6190 or DATA*6400.  
Department(s): School of Computer Science  
Location(s): Guelph  
DATA*6500  Analysis of Spatial-Temporal Data  Summer Only  [0.50]  

This course introduces software tools and data science techniques for analyzing big geospatial data. An overview of raster-based geographic information systems (GIS) for identifying patterns and clusters in spatial-temporal data using state-of-the-art software and programming languages is provided. Concepts such as kriging/Gaussian processes, vgrams and autoregressive correlation structures are discussed. Data summaries and visualizations specific to spatial-temporal problems are introduced.

Restriction(s): Restricted Master of Data Science students.  
Department(s): Department of Mathematics and Statistics  
DATA*6600  Applications of Data Science  Summer Only  [0.50]  

This interdisciplinary team-taught seminar course provides students the opportunity to synthesize information, research methods, and present cutting-edge applications of data science. Learning outcomes include identifying reliable sources, understanding and presenting relevant contemporary data science methods, thinking critically about practical implementations of data science, and effective peer collaboration. Emphasis is placed on effectively communicating technical content and insights to a non-technical audience.

Prerequisite(s): DATA*6200 and DATA*6300  
Restriction(s): Restricted to Master of Data Science students.  
Department(s): Department of Mathematics and Statistics  
Location(s): Guelph  
DATA*6700  Data Science Project  Unspecified  [1.00]  

This course is a one-semester research project course for students in the Master of Data Science program. In this course, students plan, develop, and write a faculty- or industry-led research paper, as well as present on their work. The project should advance knowledge or practice in data science or a closely related area, and address a real-world problem faced by industry. The project should focus on data science in the spatial and temporal dimension(s), to be approved by the course instructor.

Restriction(s): Instructor consent required.  
Department(s): Department of Mathematics and Statistics  
Location(s): Guelph