[advanced search]
 

Go Back   NamePros.com > Discussion > Domain Names > Domain Newbies

Domain Newbies New to domain names? Have your questions answered here.


Closed Thread
 
LinkBack Thread Tools
Old 05-19-2008, 11:34 AM   #1 (permalink)
NamePros Regular
 
RUPERT's Avatar
 
Join Date: Nov 2007
Posts: 563
0.00 NP$ (Donate)

RUPERT is a splendid one to beholdRUPERT is a splendid one to beholdRUPERT is a splendid one to beholdRUPERT is a splendid one to beholdRUPERT is a splendid one to beholdRUPERT is a splendid one to beholdRUPERT is a splendid one to behold


Removing myself from Archive.org's history-lookup thing.

Could anyone please explain how I do this?

According to their site, I need to place a simple robots.txt file on my Web server, but I'm not exactly sure what they mean by this, or how this is done.


Thanks
RUPERT is online now  
Old 05-19-2008, 06:01 PM   #2 (permalink)
Senior Member
 
nielsencl's Avatar
 
Join Date: Jul 2006
Location: Minneapolis
Posts: 1,371
1,500.43 NP$ (Donate)

nielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant futurenielsencl has a brilliant future


All you have to do is create a simple text file and name it robots.txt. Then you have to add this information to the file:

User-agent: ia_archiver
Disallow: /

Upload the file to the folder where your site files are.

This info is from their page about this, but they don't exactly spell it out as clearly as they could.

If you are already in Archive.org I don't think this will remove any information they have for your site.
nielsencl is offline  
Old 05-19-2008, 06:22 PM   #3 (permalink)
NamePros Legend
 
weblord's Avatar
 
Join Date: Dec 2005
Location: Philippines - www.Nabaza.com
Posts: 19,840
21,700.43 NP$ (Donate)

weblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatnessweblord Has achieved greatness

Autism Protect Our Planet
The Internet Archive is not interested in offering access to Web sites or other Internet documents whose authors do not want their materials in the collection. To remove your site from the Wayback Machine, place a robots.txt file at the top level of your site (e.g. www.yourdomain.com/robots.txt) and then submit your site below.

The robots.txt file will do two things:

1. It will remove all documents from your domain from the Wayback Machine.
2. It will tell us not to crawl your site in the future.

To exclude the Internet Archive’s crawler (and remove documents from the Wayback Machine) while allowing all other robots to crawl your site, your robots.txt file should say:

User-agent: ia_archiver
Disallow: /

Robots.txt is the most widely used method for controlling the behavior of automated robots on your site (all major robots, including those of Google, Alta Vista, etc. respect these exclusions). It can be used to block access to the whole domain, or any file or directory within. There are a large number of resources for webmasters and site owners describing this method and how to use it. Here are some:

* http://www.robotstxt.org/
* http://pageresource.com/zine/robotstxt.htm

Once you have put a robots.txt file up, submit your site (www.yourdomain.com) on the form on http://pages.alexa.com/help/webmaste...tml#crawl_site.

The robots.txt file must be placed at the root of your domain (www.yourdomain.com/robots.txt). If you cannot put a robots.txt file up, read our exclusion policy. If you think it applies to you, send a request to us at info@archive.org.
http://www.archive.org/about/exclude.php
weblord is offline  
Closed Thread


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is Off
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Site Sponsors
Advertise your business at NamePros

All times are GMT -7. The time now is 03:45 AM.


Powered by: vBulletin® Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.3.0
Template-Modifications by TMS
vBCredits v1.4 Copyright ©2007 - 2008, PixelFX Studios

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85