Website scraping to improve data quality

by David Cameron on November 19, 2008

Do you find yourself complaining that the data you get on your feed isn’t to the same standard that you see on the Vendor website?


Do you worry that your content being the same as everyone else’s will hurt you in the search engine positions?

Someone has written here how they use the supplied vendor feed and advanced use of Excel to scrape for particular fields that may not be available via a feed in order to differentiate their own site.

I imagine that the best scraping would best be done on the webserver to serve up live stock positions for hard to find items … remember the Nintendo Wii availability checkers that sprung up?

I have never tried scraping, but it appears that this somewhat legitimate use versus posting up sites of nothingness and plagiarised content if you know what I mean.

Have you used any scraping techniques in the past? I would be interested in your experience – please comment either here or in the Forum

Leave a Comment

You can use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Send To Twitter

Spam Protection by WP-SpamFree

Subscribe without commenting

Previous post:

Next post: