screen scrape and report (ID:4103)
Project Creator: |
oman138
FC Member For 6077 Days
Credits 70 Completed Proj. Num. 4 / 16 Total payment USD 805.00 Avg Daily Online 0.06 h (From 21/5/2007) Available on MSN/Skype Yes Last Login 5/30/2013 Peers Rating 100.00% |
---|---|
Budget: | Less than 250 |
Created: | 3/28/2008 6:39:31 PM EST |
Bidding Ends: | 4/4/2008 6:39:31 PM EST ( Expired ) |
Development Cycle: | 7 Days |
Bid Count: | 5
|
Average Bid: | 514.00 |
Project Description:
Hi attached you find a flow chart and sample scripts to get things started. I'm mainly interested in being able to automate this process. The difficulty i'm having is creating a smooth flow between getting to the data, scraping the data, outputting it to a file and parsing(curl or similar program), then compiling it to an csv or text file so I can use in a database and output to a report. I'm not a programmer. I don't know how to add screen scraping and output to a file for a simple access database. Later i plan on upgrading to mysql or sql database. I saved it as an html, java script, perl and php. I would like to generate about 5 thousand reports per day. Based on my raw script how long will it take you to complete the project? I need a script or program written or designed to do the following automation: 1. open web browser, then 2. Go to designated website, then 3. Enter id #s, then 4. When at designated screen data scrape data, then 5. Output data content to CSV file or text file format 6. Parse content and keep specific data only 7. Move new data to database with specific format for reporting purpose 8. Upload data to designated website as html/php I should be able to adjust script or program depending because it will access different types of WebPages. I should be able to make changes to code. if you have a better suggestion please let me know. first go to http://courts.phila.gov then go click on http://fjdweb2.phila.gov/fjd1/repl1/zk_fjd_public_qry_00.zp_main_idx.html then click on Display Civil Docket Report then click on agree then enter case id 050101005 then capture data on screen then output to txt or csv file or you prefer any other i prefer a simple text file like notepad less code conflict i only need the following to remain case id:######### Plaintiff info Defendent info and address everything else can get parse case type i need it to run about 4-5 thousand reports... it has to be able to run daily and automatically Also i need to go different sites as well such as http://www2.montcopa.org/montco/site/default.asp then click on http://www2.montcopa.org/montco/cwp/view,a,3,q,4033.asp then http://mway.montcopa.org/mway/site/default.asp then http://mway.montcopa.org/mway/cwp/view,a,3,Q,11670.asp then http://webapp.montcopa.org/PSI/Viewer/Search.aspx?c=CaseSearch&panel=CaseNumber then case id 200501001, 200501001, 2005***** if possible, can it grab id's from an external text or csv file? it grab id's from another file? or can i tell it to enter all id's between 08**-09** for example 200501001กกกก, 200501002, 200501003 output should be 1 file per day per site Newly added descriptions: If possible i would like to use Selenium engine. Please review at http://selenium.openqa.org |
|
Job Type | .NET, Java, PHP, Python, Javascript, Other |
Attached Files: | 20080328183747.gif |