Skip to Main Content

MSIT.5320 Managing Large Data Sets (Formerly 94.532)

Id: 037827 Credits Min: 3 Credits Max: 3

Description

The amount of data generated by businesses, science, Web, and social networks is growing at a very fast rate. This course will cover the algorithms and database techniques required to extract useful information from this flood of data. Data mining, which is the automatic discovery of interesting patterns and relationships in data, is a central focus of the course. Topics covered in data mining include association discovery, clustering, classification, and anomaly detection. Special emphasis will be given to techniques for data warehousing where extremely large datasets (e.g.,many terabytes) are processed. The course also covers Web mining. Topics covered include analysis of Web pages and links (like Google) and analysis of large social networks (like Facebook).

Prerequisites

Students must already have completed a bachelor's degree in a related discipline to enroll in this course and in a graduate career.

View Current Offerings