Spider or Bot for collecting data (ID:5672)
Project Creator: |
wozie
FC Member For 6444 Days
Credits 120 Completed Proj. Num. 4 / 15 Total payment USD 1,090.00 Avg Daily Online 0.03 h (From 21/5/2007) Available on MSN/Skype No Last Login 7/1/2013 Peers Rating 100.00% |
---|---|
Budget: | 250 - 500 |
Created: | 4/26/2009 2:20:53 PM EST |
Bidding Ends: | 5/8/2009 2:20:53 PM EST ( Expired ) |
Development Cycle: | 15 Days |
Bid Count: | 5
|
Average Bid: | 420.00 |
Project Description:
Hi, I am looking for a script to run on my linux server that visits a website that I specify and go through all the pages and spiders the data and saves it to a mysql database. The script needs to only collect information I specify from the site, (so searches a page for some start code and then for the end code to stop grabbing), also needs to be capible of collecting 3/4 pieces of information from one page. Example, if I wanted to get a virus database website, it gets the TITLE, TYPE, DESCRIPTION, SOLUTION and maybe a URL or Image (this can be saved to a directory on my server with a random name and this name stored in my DB). I need to be able to change the script later to collect data from other websites. Newly added descriptions: Revision: A Windows based system would be ok instead of a Linux system. If the program is written in Linux, it would need to *hide* the IP address of my server requesting the websites scraped data (if the website I am scraping blocks my IP address). |
|
Job Type | Java, PHP, Other |
Attached Files: | N/A |