[advanced search]
 

Go Back   NamePros.com > Discussion > Web Design & Development > Programming

Programming PHP, Perl, Ruby on Rails, AJAX, HTML, XHTML, CSS, JavaScript, MySQL and any other coding topics.


Closed Thread
 
LinkBack Thread Tools
Old 05-10-2007, 09:11 AM   #1 (permalink)
New Member
 
Join Date: May 2007
Posts: 4
0.00 NP$ (Donate)

Personaltrainer is an unknown quantity at this point


Building a web site crawler

Hi,
We are in the process of building a cutomised site crawlers. We are quiet successful in building one. But I have a question for the expert coders. Is it possible to fetch last modified data of a page from anywhere if so how is it done?
__________________

Templates with CMS at 50$ http://www.affordablewebsolutions.com/ready-templates.php
Personaltrainer is offline  
Old 05-10-2007, 02:12 PM   #2 (permalink)
Traveller
 
-NC-'s Avatar
 
Join Date: Mar 2007
Location: Yet another city
Posts: 1,392
614.57 NP$ (Donate)

-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of-NC- has much to be proud of

Animal Cruelty Animal Rescue Ethan Allen Fund Protect Our Planet
Found this:

14.29 Last-Modified

The Last-Modified entity-header field indicates the date and time at which the origin server believes the variant was last modified.

Last-Modified = "Last-Modified" ":" HTTP-date

http://www.w3.org/Protocols/rfc2616/rfc2616-sec14.html

Might only work for static pages where the server can tell when the content has changed though. (e.g. web server has no idea if the database content has changed)
-NC- is offline  
Old 05-12-2007, 02:50 PM   #3 (permalink)
NamePros Member
 
Join Date: Feb 2006
Location: Online
Posts: 111
100.80 NP$ (Donate)

Barts has a spectacular aura aboutBarts has a spectacular aura about


The http last-modified header only works if your webserver supports it. If you are crawling sites on servers that are not yours, you shouldn't rely on that header. But besides that I have the feeling that you would like to only fetch the differences since the last update. Right? If so, than the answer is that you should code it yourself. Fetch a page, save that copy in a database (or file if you like) and compare the saved copy with the online version. Thazzz how you should do it.
__________________
A soul?... I've got not use for such frivolities.
Barts is offline  
Old 05-28-2007, 06:44 AM   #4 (permalink)
New Member
 
Join Date: May 2007
Posts: 4
0.00 NP$ (Donate)

Personaltrainer is an unknown quantity at this point


Thanks buddies, But I could not find a practical solution for this. May be its yet to be resolved for dynamic sites
__________________

Templates with CMS at 50$ http://www.affordablewebsolutions.com/ready-templates.php
Personaltrainer is offline  
Old 05-30-2007, 11:20 PM   #5 (permalink)
NamePros Regular
 
DylanButler's Avatar
 
Join Date: Jan 2006
Location: San Diego, CA
Posts: 704
0.00 NP$ (Donate)

DylanButler is a splendid one to beholdDylanButler is a splendid one to beholdDylanButler is a splendid one to beholdDylanButler is a splendid one to beholdDylanButler is a splendid one to beholdDylanButler is a splendid one to beholdDylanButler is a splendid one to behold


more posts from you guys please
__________________
:great: -Dylan Butler

EXAMP - San Diego Web Design
DylanButler is offline  
Closed Thread


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Site Sponsors
Advertise your business at NamePros

All times are GMT -7. The time now is 10:02 PM.


Powered by: vBulletin® Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.3.0
Template-Modifications by TMS
vBCredits v1.4 Copyright ©2007 - 2008, PixelFX Studios

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85