Automatic Syllabus Classification using Support Vector Machines

Xiaoyan Yu
Weiguo Fan
Yubo Yuan
Manuel Pérez-Quiñones
Edward A. Fox
William Cameron
Lillian Cassel
Handbook of Research on Text and Web Mining Technologies, Information Science Reference(2008)

Abstract

Syllabi are important educational resources. Gathering syllabi that are freely available and creating useful services on top of the collection presents great value for the educational community. However, searching for a syllabus on the Web using a generic search engine is an error-prone process and often yields too many irrelevant links. In this chapter, we describe our empirical study on automatic syllabus classification using Support Vector Machines (SVM) to filter noise out from search results. We describe various steps in the classification process from training data preparation, feature selection, and classifier building using SVMs. Empirical results are provided and discussed. We hope our reported work will also benefit people who are interested in building other genre-specific repositories.

Research Areas