Vous voulez voir cette page en français ? Cliquez ici.

 

or
Sign in to turn on 1-Click ordering.
 
 
More Buying Choices
17 used & new from CDN$ 31.47

Have one to sell? Sell yours here
 
   
Webbots, Spiders, and Screen Scrapers: A Guide to Developing Internet Agents with PHP/CURL
 
See larger image
 

Webbots, Spiders, and Screen Scrapers: A Guide to Developing Internet Agents with PHP/CURL (Paperback)

by Michael Schrenk (Author)
4.0 out of 5 stars  See all reviews (2 customer reviews)
List Price: CDN$ 49.95
Price: CDN$ 31.47 & eligible for FREE Super Saver Shipping on orders over CDN$ 39. Details
You Save: CDN$ 18.48 (37%)
o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o o
Usually ships within 3 to 5 weeks.
Ships from and sold by Amazon.ca. Gift-wrap available.

Ordering for Christmas?? This item requires additional time to ship and will arrive after December 25. Need a last-minute gift? Send an Amazon.ca Gift Certificate.

9 new from CDN$ 31.47 8 used from CDN$ 47.61

Product Details


Product Description

Product Description

The Internet is bigger and better than what a mere browser allows. "Webbots, Spiders, and Screen Scrapers" is for programmers and businesspeople who want to take full advantage of the vast resources available on the Web. There's no reason to let browsers limit your online experience-especially when you can easily automate online tasks to suit your individual needs.

Learn how to write webbots and spiders that do all this and more: Programmatically download entire websites Effectively parse data from web pages Manage cookies Decode encrypted files Automate form submissions Send and receive email Send SMS alerts to your cell phone Unlock password-protected websites Automatically bid in online auctions Exchange data with FTP and NNTP servers

Sample projects using standard code libraries reinforce these new skills. You'll learn how to create your own webbots and spiders that track online prices, aggregate different data sources into a single web page, and archive the online data you just can't live without. You'll learn inside information from an experienced webbot developer on how and when to write stealthy webbots that mimic human behavior, tips for developing fault-tolerant designs, and various methods for launching and scheduling webbots. You'll also get advice on how to write webbots and spiders that respect website owner property rights, plus techniques for shielding websites from unwanted robots.

As a bonus, visit the author's website to test your webbots on sample target pages, and to download the scripts and code libraries used in the book.

Some tasks are just too tedious-or too important!- to leave to humans. Once you've automated your online life, you'll never leta browser limit the way you use the Internet again.


Tag this product

 (What's this?)
Think of a tag as a keyword or label you consider is strongly related to this product.
Tags will help all customers organize and find favorite items.
Your tags: Add your first tag
 

 

Customer Reviews

2 Reviews
5 star:
 (1)
4 star:    (0)
3 star:
 (1)
2 star:    (0)
1 star:    (0)
 
 
 
 
 
Average Customer Review
4.0 out of 5 stars (2 customer reviews)
 
 
 
 
Share your thoughts with other customers:
Most helpful customer reviews

 
5.0 out of 5 stars A great introduction and more!, Jun 6 2007
By Colin J. Mccubbin (BC, Canada) - See all my reviews
(REAL NAME)   
This is the book I wish I'd read last year before struggling with regex to scrape daily weather and snowfall data from my local ski hill's website and place it in a database to plot trends.

The explanations and sample applications were clear and the libraries, available for download at the book's website, are a great help towards simplifying the process of both downloading (using the cURL functions in php) and extracting the required data from the page. It greatly simplifies the process of isolating specific bits of information and relieves many of the headaches that using Regular Expressions cause.

All in all I found this book to be an inspiration and am now looking forward to rewriting my weather scraper using the techniques described.
Was this review helpful to you? Yes No (Report this)



 
3.0 out of 5 stars Good basic intro, with a catch, April 27 2007
By Paul M. Reinheimer "Author" (Montréal, Quebec, Canada) - See all my reviews
(REAL NAME)   
I picked up this book full of enthusiasm, spiders are just plain cool, they go out and start downloading data for you, reading webpages, and even understanding them a little. My enthusiasm was dashed a little however on page four: You may use any of the scripts in this book for your own personal use, as long as you agree not to redistribute them... and agree not to sell or create derivative products under any circumstances.. I develop in PHP professionally, and a lot of the code I write ends up getting used somewhere with some sort of a for-profit basis, which pretty effectively prevents me from using any code between the covers (at its strictest reading, Im not sure I can even change the code).

The book does a great job of introducing different sorts of web agents that you can create programatically (more than just spiders) and introduces all sorts of interesting projects along those lines. Throughout the book a series of libraries written by the author are leveraged to make the retrieval and parsing of the various pages much easier. While newer developers will enjoy being able to concentrate on the big picture I found myself itching for more information on the nitty gritty.

Some of the projects explored include: price monitoring, image capturing (want to be your own google image search? :) ), link verification, spiders, and snipers. Each of the different projects received its own chapter, and effectively covered a lot of the topics covered within.

Overall, I would recommend this book to beginner to intermediate PHP developers looking to tackle the world of web agents, its a good primer on the related topics, and at the very least will give you some ideas on the complexities involved. As their skill grows they will probably find them-self either moving past the libraries included with the book, or modifying them greatly. My biggest complaint is the lack of coverage on the robots.txt file, some talk is given to it in terms of blocking robots from your own site, but I didnt see any code that actually dealt with parsing it for your own robot.
Was this review helpful to you? Yes No (Report this)


Share your thoughts with other customers: Create your own review
 
 
Only search this product's reviews





Feedback


Your Recent History

 (What's this?)

After viewing product detail pages or search results, look here to find an easy way to navigate back to pages you are interested in.