Search engine, add-url, search, edit and remove pages with adverts (ID:6218)
Project Creator: |
wozie
FC Member For 6444 Days
Credits 120 Completed Proj. Num. 4 / 15 Total payment USD 1,090.00 Avg Daily Online 0.03 h (From 21/5/2007) Available on MSN/Skype No Last Login 7/1/2013 Peers Rating 100.00% |
---|---|
Budget: | Not Sure/Confidential |
Created: | 11/8/2009 4:59:44 AM EST |
Bidding Ends: | 11/25/2009 4:59:44 AM EST ( Expired ) |
Development Cycle: | 14 Days |
Bid Count: | 1
|
Average Bid: | 300.00 |
Project Description:
Hi, I am looking for a PHP / mySQL / AJAX / Javascript written search engine. People will come along and enter their website URL to be spidered to be added to the search engine index. Once they have been spidered and checked for certain content, it is added to the search engine database. When a user enters his URL: 1. The search engine verifies the URL to see if it valid. 2. Goes into a pending spider cue database. 3. The spider is run by CRON job every hour and spiders the entered URLs in the pending spider cue (if the spider is already running, it will not run 2 instances of the job). 4. The spider follows all links in the website and indexes all pages and follows links to other websites by adding these URLs to the Pending spider cue. 5. The script needs to connect to a ad-blocking database (https://easylist.adblockplus.org/easylist.txt) and download the latest ad definitions. It then checks the web pages and if the web pages/website has ANY advertisements in the pages. The site is not indexed and is not added to the database. If the website does not have any advertisements it IS added to the database. 6. The users have a basic control panel so they can add/remove URLs. The script needs to be easy on CPU usage (IE spider 1 site at a time) and the databases need to be optimized, fast and quick to display results. The front page is a search box, simple and easy to use like Google with a search box and a few links like |
|
Job Type | PHP, Javascript, Other |
Attached Files: | N/A |