6350 Spring 2009 Project/Term-Paper

You can form a team of up-to three persons and you will get the same score. Each term is expected to 1) write a term paper or 3) a survey paper or 3) carry out a design/implementation project.

 

The ideal process of a term paper should have the following steps:

Possible ideas for a term paper (you are not limited to these):

(1)   Identify a well defined problem, e.g. aggregation over sliding window, propose a new algorithm or processing paradigm (potentially better in some typical scenarios) to solve the problem, and validate your algorithm through experimental comparison with existing algorithms.

(2)   Identify a potentially useful operator (for many applications) and the methods to process them. For example, “group nearest neighbor” operator is useful to help a group of people to find a common meeting place, taking into the individual travelling distance constraints.

(3)   Find a dataset, identify a set of information/patterns/rules you want to get from the dataset, propose a slew of algorithms to get them. Address the performance issues, e.g. accuracy and computational time.

The ideal process of a survey paper should have the following steps:

Possible topics for a survey paper (you are not limited to these):

(1)   Concept drifting in stream data mining.

(2)   Blocking operators in stream data processing.

(3)   Stream clustering

(4)   Processing dynamic nearest neighbor queries

The ideal process of a project should have the following steps:

Possible topics for a survey paper (you are not limited to these):

(1)   Implement a new data type and operator in SECONDO database system (http://dna.fernuni-hagen.de/Secondo.html/)

(2)   Find a large set of documents, cluster them incrementally, and compare the results with clustering the whole documents.

(3)   Implement two to three algorithms found in published papers, e.g. GREEDY or FIFO on operator scheduling, simulate data input and compare results.

(4)   Implement a simple publication/subscription system using  triggers and stored functions in a traditional database system, e.g. MySQL.

 

Instructions on final report:

Follow the outlines from the research papers covered in the course. The report should be 6 pages or more and may have 5 sections:

For a survey paper, the report should be 6 pages or more that may include:

For implementation project, a demonstration paper of 6 pages or more is expected which may include:

In addition, you should also submit the implantation package with the documented source code, a readme file, datasets (if any), and any related documents useful for the project. A sample demo paper is available from: http://research.microsoft.com/~melnik/pub/adonet-demo_SIGMOD07.pdf

 

Instructions on oral presentation: