Asee peer logo

Putting Information Retrieval Theory Into Practice

Download Paper |

Conference

2003 Annual Conference

Location

Nashville, Tennessee

Publication Date

June 22, 2003

Start Date

June 22, 2003

End Date

June 25, 2003

ISSN

2153-5965

Conference Session

Web Systems and Web Services

Page Count

10

Page Numbers

8.958.1 - 8.958.10

DOI

10.18260/1-2--12086

Permanent URL

https://strategy.asee.org/12086

Download Count

470

Request a correction

Paper Authors

author page

Xiannong Meng

Download Paper |

Abstract
NOTE: The first page of text has been automatically extracted and included below in lieu of an abstract

Session 1658

Putting Information Retrieval Theory into Practice – A Web Search Engine Project for an Undergraduate Computer Science Elective Course Xiannong Meng Computer Science Department Bucknell University Lewisburg, PA 17837

Abstract

This paper describes a semester project for an undergraduate computer science senior elective course, CSCI 379 Computer Science Topics – Information Retrieval and Web Search, taught at Bucknell University in the fall semester of 2002. In this course, students working in groups developed a Web search engine using information retrieval theory. The project includes implementing a basic Web server which acts as the dispatcher, an indexing component which builds the inverted indexing system for search, a ranking component which ranks the documents based on term frequency (tf) and inverted document frequency (idf), a retrieval component which takes the user query and retrieves the documents based on the ranking, and an crawling component which collects documents from the Web. The project is very practical in that students have to build a complete system, yet it involves many theoretical aspects of the information retrieval, algebra, and probability. It is an ideal project for a senior level course which requires a combination of the knowledge students have learned in their college years.

1 Introduction The wide spread use of the Web brought increased interests to college undergraduate courses such as information retrieval and computer networks. Students in these courses learn the basic concepts of the Web, the information retrieval theory (IR) and the Internet. They hope to gain an understanding how the Internet and its applications work and practice basic programming skills needed to program the Internet related applications. Web search engines provide ideal case studies for such courses. Implementation of a successful Web search engine requires a combination of information retrieval theory and solid network programming skills. This paper describes a semester project for an undergraduate CS se- nior elective course, CSCI 379 Computer Science Topics – Information Retrieval and Web Search. In this course, students working in groups developed a functional Web search en- gine. The project includes implementing a basic Web server which acts as the dispatcher, an indexing component which builds the inverted indexing system for search, a ranking component which ranks the documents based on term frequency (tf) and inverted docu- ment frequency (idf), a retrieval component which takes the user query and retrieves the “Proceedings of the 2003 American Society for Engineering Education Annual Conference & Exposition Copyright c 2003, American Society for Engineering Education”

Meng, X. (2003, June), Putting Information Retrieval Theory Into Practice Paper presented at 2003 Annual Conference, Nashville, Tennessee. 10.18260/1-2--12086

ASEE holds the copyright on this document. It may be read by the public free of charge. Authors may archive their work on personal websites or in institutional repositories with the following citation: © 2003 American Society for Engineering Education. Other scholars may excerpt or quote from these materials with the same citation. When excerpting or quoting from Conference Proceedings, authors should, in addition to noting the ASEE copyright, list all the original authors and their institutions and name the host city of the conference. - Last updated April 1, 2015