Research Papers Library

Automatic Extraction of Keywords for a Multimedia Search Engine Using the Chi-Square Test

We present a method to automatically acquire a set of keywords that characterise a large multimedia collection. Our method compares captions associated with pictures in the collection with a model of general English language. The words that deviate from the model are very specific of the captions and thus make appropriate keywords. Professional annotators evaluated our results and concluded that more than 97% of our top 2,000 one-word keywords were truly descriptive of the collection. We also mined the collection’s query logs and extracted keywords that reflect the most important indexing terms from the users’ perspective. Our method offers a strategy for selecting the keywords that make up the indices of multimedia search engines.

Download PDF


World's leading professional association of Internet Research Specialists - We deliver Knowledge, Education, Training, and Certification in the field of Professional Online Research. The AOFIRS is considered a major contributor in improving Web Search Skills and recognizes Online Research work as a full-time occupation for those that use the Internet as their primary source of information.

Get Exclusive Research Tips in Your Inbox

Receive Great tips via email, enter your email to Subscribe.