| |
CSCE 5200 Information Retrieval and Web Search
|
|
.
.
.
.
.
announcements
syllabus
Download syllabus as a [pdf]
|
Instructor:
|
Rada Mihalcea, Research Park F228, email: rada at cs.unt.edu |
|
TA:
|
Veronica Perez-Rosas, vp0091 at unt.edu
|
|
Class hours:
|
TTh 12:30-01:50pm
|
|
Instructor office hours:
|
Th 03:00-05:00pm or by appointment
|
|
TA office hours:
|
T 11am-12pm; Th 2-3pm
|
|
Course description:
|
This course will cover traditional material, as well as recent advances in Information Retrieval (IR), the study of indexing, processing, and querying textual data. Basic retrieval models, algorithms, and IR system implementations will be covered. The course will also address more advanced topics in "intelligent" IR, including Natural Language Processing techniques, and "smart" Web agents.
|
class notes
|
Date
|
Lecture
|
Reading material
|
NB
|
|
01/17/2012
|
Course overview
(ppt)
|
-
|
-
|
|
01/19/2012
|
Introduction to IR models and methods [ppt]
|
-
|
-
|
|
01/24/2012
|
Short Perl tutorial [ppt]
|
One of the tutorials below [see the "Links" section]
|
-
|
|
01/26/2012
|
Short Perl tutorial [ppt]
|
One of the tutorials below [see the "Links" section]
|
-
|
|
01/31/2012
|
Short Perl tutorial [ppt]
Text processing [ppt]
|
Porter stemmer
Chap.2: The term vocabulary & postings lists
|
-
|
|
02/02/2012
|
Text processing [ppt]
|
Porter stemmer
Chap.2: The term vocabulary & postings lists
|
-
|
|
02/07/2012
|
Text properties [ppt]
|
Chap.2: The term vocabulary & postings lists
|
Assignment 1 issued
|
|
02/09/2012
|
Web Spidering [ppt]
Practical problems in web spidering [ppt]
|
Chap.5: Index compression, sect.5.1
Chap.20: Web crawling and indexes
Optional reading: Baeza-Yates chapter 6.3
|
-
|
|
02/14/2012
|
Boolean model and extensions [ppt]
|
Chap.1: Boolean retrieval
|
-
|
|
02/16/2012
|
Vector space model [ppt]
|
Chap.6: Scoring, term weighting and the vector space model
|
-
|
|
02/21/2012
|
Vector space model [ppt]
|
Chap.6: Scoring, term weighting and the vector space model
|
-
|
|
02/23/2012
|
Term weighting schemes
|
Chap.6: Scoring, term weighting and the vector space model
[Sparck-Jones] Term weigthing approaches, pg. 323
|
Assignment 1 due.
Assignment 2 issued
|
assignments
readings
-
(required) Introduction to Information Retrieval
(online version available)
Christopher D. Manning, Prabhakar Raghavan and Hinrich Schutze
Cambridge University Press, 2008.
- (recommended) Readings in Information Retrieval
K.Sparck Jones and P. Willett
Morgan Kaufmann, 1997
- (recommended) Modern Information Retrieval
Ricardo Baeza-Yates and Berthier Ribeiro-Neto
Addison-Wesley, 1999
links
|