- Amazon Student members save an additional 10% on Textbooks with promo code TEXTBOOK10. Enter code TEXTBOOK10 at checkout. Here's how (restrictions apply)
Introduction to Information Retrieval Hardcover – Jul 7 2008
|New from||Used from|
Special Offers and Product Promotions
Frequently Bought Together
Customers Who Bought This Item Also Bought
No Kindle device required. Download one of the Free Kindle apps to start reading Kindle books on your smartphone, tablet, and computer.
To get the free app, enter your e-mail address or mobile phone number.
"This is the first book that gives you a complete picture of the complications that arise in building a modern web-scale search engine. You'll learn about ranking SVMs, XML, DNS, and LSI. You'll discover the seedy underworld of spam, cloaking, and doorway pages. You'll see how MapReduce and other approaches to parallelism allow us to go beyond megabytes and to efficiently manage petabytes."
Peter Norvig, Director of Research, Google Inc.
"Introduction to Information Retrieval is a comprehensive, up-to-date, and well-written introduction to an increasingly important and rapidly growing area of computer science. Finally, there is a high-quality textbook for an area that was desperately in need of one."
Raymond J. Mooney, Professor of Computer Sciences, University of Texas at Austin
"Through compelling exposition and choice of topics, the authors vividly convey both the fundamental ideas and the rapidly expanding reach of information retrieval as a field."
Jon Kleinberg, Professor of Computer Science, Cornell University
H.Levkowitz, Choice Magazine
"Introduction to Information Retrieval is a comprehensive, authoritative, and well-written overview of the main topics in IR. The book offers a good balance of theory and practice, and is an excellent self-contained introductory text for those new to IR."
Olga Vechtomova, Computational Linguistics
Coherent and up -to -date, this textbook for advanced undergraduate and beginning graduate students in computer science covers all aspects of the design and implementation of systems for gathering, indexing, and searching documents; methods for evaluating systems; and an introduction to the use of machine learning methods on text collectionsSee all Product Description
Inside This Book(Learn More)
What Other Items Do Customers Buy After Viewing This Item?
Top Customer Reviews
Easy to read and allows for quick learning.
Most Helpful Customer Reviews on Amazon.com (beta)
In these two books, they describe the theory behind a vast toolbox which can be used to construct new tools/products for the Internet. Now I can go back to them when the need arises.
For starters, I appreciate the detailed theoretical explanations of topics that I could not find in other texts, and the references to related work are especially helpful. One of the other books I read was Information Retrieval by Grossman, which is an older book but has a more condensed style compared to this. Grossman's discussion of clustering was more high level and referenced a few more papers that I found useful. That helped increase my interest to read through these chapters in which offer greater detail.
Before I felt like I could place each topic in its appropriate context, I had to spend six months of reading both the books, playing with code and finding s/w packages, searching the research literature, reading papers and other books, and then cycling back to the books. Here's are some suggestions for things I'd like to see:
1. A set of recomended programming tools: in some books on Perl -- such as the chapter "Natural Language Tools" in pages 149-171 in "Advanced Perl Programming" by Simon Cozens (O'Reilly) -- you get a very "quick & dirty" introduction to maybe 20-30% of the concepts in these two books along with ways to implement and play around with them. Although Perl has many natural language processing tools, the Cozens book cuts to the chase, explains which are the best tools, and shows you how to use them. I think knowing such shortcuts aids in learning how to apply and improve on them. The more complex and sophisticated topics, the more likely to make it out into the real world if they are easy to play with.
2. More data/examples on what does/doesn't work with end-users: Numbers, graphs, and charts are all good stuff. I always appreciate it when the authors referenced quantitative comparisons, real-world products, and history of Internet. One of the reasons I had to consult the research literature was to broaden my understanding of quantitative comparisons between different techniques involving end-users, which were typically done in the context of complete systems studies that users could try out.
This book not only describes how to build a search engine (including crawling, indexing, ranking, classification, and clustering), but also has many of the insights you can only get from lengthy experience using these techniques at large scale.
Definitely my new favorite book on search. If you work in search or just have an interest in the field, it is a great read.
I knew from the free sample that this book was what I was looking for. Thinking this would be a completely a new field to me, I was surprised how much I already knew. Some of it is not relevant to corpus linguists (result ranking for example), but if you're a corpus linguist and want to build an index for your corpus, I doubt you'll find a better book than this.
And the Kindle edition is done well, which is not always the case. Websites are hyperlinked and you can jump to the next or previous section with the 5-way controller.
I like the book most for the following two reasons :
(1) detailed explanation into the level of implementation in many cases (data structures//memory size etc..)
(2) good review on practice vs. theory. The authors present diverse attractive theories, and on the other hand, discusses why sometimes just simpler methods are hard to be beaten down by those more complicated methods from their experience in practice.
I like that!
Look for similar items by category
- Books > Computers & Technology > Databases > Beginning & Introductory
- Books > Computers & Technology > Networking & Cloud Computing > Network Administration
- Books > Computers & Technology > Programming > Languages & Tools
- Books > Computers & Technology > Programming > Software Design, Testing & Engineering > Object-Oriented Design
- Books > Computers & Technology > Software > Word Processors & Editors
- Books > Textbooks > Computer Science & Information Systems > Database Storage & Design
- Books > Textbooks > Computer Science & Information Systems > Object-Oriented Software Design
- Books > Textbooks > Computer Science & Information Systems > Programming Languages