Search Cloud For Nutch! (ID:2889)
Project Creator: |
Kano
FC Member For 6536 Days
Credits 20 Completed Proj. Num. 0 / 3 Total payment USD Avg Daily Online 0.00 h (From 21/5/2007) Available on MSN/Skype No Last Login 8/29/2007 Peers Rating 0.00% ![]() ![]() ![]() |
---|---|
Budget: | Less than 250 |
Created: | 8/17/2007 2:48:15 PM EST |
Bidding Ends: | 8/24/2007 2:48:15 PM EST ( Expired ) |
Development Cycle: | 14 Days |
Bid Count: | 0
|
Average Bid: | |
Project Description:
What I need created: I need a dynamically updated Search Cloud that works like a standard tag cloud, I need it to be able to work with nutch 0.7.2 with tomcat 4.x and java 1.4.x. the searches all log through tomcat's catalina.out sequence needing to be read: query: query catalina.out snippit 050913 180935 11 query request from x.x.x.xxx 050913 180935 11 query: wow 050913 180935 11 searching for 20 raw hits 050913 180935 11 total hits: 0 050913 180939 12 query request from x.x.x.xxx 050913 180939 12 query: slow 050913 180939 12 searching for 20 raw hits 050913 180939 12 total hits: 0 end catalina.out snippit from this document the search cloud would produce 2 words wow and slow each having an equal score of 1 each having an equal size I want the text size range to be 60% to 140% I want the color range to be black at 60% and Green at 140% Needs to be able to use multiple word queries ie: brown dog It needs to be able to only parse the last 100000 entries of the catalina.out file for newest querys (make 100000 a variable in case we want it to be 1 million or one hundred), current file is over 200mb from several hundred thousand querys with internal testing. production file will be several Gbytes it needs to be automatically updated, every 24hrs, the system is not currently linux, its a windows environment with cygwin that means it has to pase the data make the cloud code and update it on the pages every 24hrs the links for the cloud term results is search.jsp page where queries work like this: http://search.myserver.com/search.jsp?query=search&hitsPerPage=10&hitsPerSite=0&clustering=&pivot=0 for multiple words its like this: http://search.myserver.com/search.jsp?query=term1+term2&hitsPerPage=10&hitsPerSite=0&clustering=&pivot=0 with the same program that does the search cloud I also need the program to autofill the search bar as people search. look at the search on www.mininova.org, thats exactly what I want when you start to fill out a search it fills it in from the search querys of the catalina.out file once again it should have a list of the last 100000 queries like the cloud to use for the autofill so basically 2 part program that uses the same datasource We are opeing up a search engine by rebuilding nutch and making it our own. If this works out we will have alot more work for you. This Project we will pay $50.00. Thanks Michael |
|
Job Type | |
Attached Files: | N/A |