94.532 Managing and Mining Large Data Sets

Managing and Mining Large Data Sets

Course Details
Min Credits 3
Max Credits 3
Course ID 37827
Status Active

The amount of data generated by businesses, science, Web, and social networks is growing at a very fast rate. This course will cover the algorithms and database techniques required to extract useful information from this flood of data. Data mining, which is the automatic discovery of interesting patterns and relationships in data, is a central focus of the course. Topics covered in data mining include association discovery, clustering, classification, and anomaly detection. Special emphasis will be given to techniques for data warehousing where extremely large datasets (e.g.,many terabytes) are processed. The course also covers Web mining. Topics covered include analysis of Web pages and links (like Google) and analysis of large social networks (like Facebook).

Pre/Co-Requisites: Students must already have completed a bachelor's degree in a related discipline to enroll in this course and in a graduate career.