a) Two groups of project algorithms are given for the Big Data
Technology Subject and the Web Mining Subject, respectively.
For each subject, you are required to select ONE of the project
algorithms, implement the algorithm, and evaluate the
effectiveness of your implementation. In case you have joined
two subjects, TWO algorithms are required to be worked on.
For evaluation purpose, you may choose or design your own
appropriate evaluating datasets.
For those students who pursuit excellent grades, it is highly
recommended that you choose a large-scale dataset, conduct
your experiments on big data platform, and evaluate the
effectiveness and efficiency of your selected algorithm.
Samples of large-scale datasets can be found from the following
website http://web.stanford.edu/class/cs224w/resources.html.
(You may download others from other sources.)