NameSilo

Extract domain names from a large text file?

Spacemail by SpaceshipSpacemail by Spaceship
Watch

Gene

Gene PimentelTop Member
Impact
485
I'm looking for an easy way to extract all domain names from a large TXT file that contains lots of other info besides the domain names. It would take forever to do it manually. I need a piece of software that will pull out all the domains (.com, .net, .org, and other extensions) so I am left with just a list of the domains. Any suggestions? Thanks!
 
0
•••
The views expressed on this page by users and staff are their own, not those of NamePros.
AfternicAfternic
Hi Gene.
I have just the tool for you:

http://www.midano.com/domainInventory.asp

Copy/paste your raw text into the form and run the script. It will extract all domains and sort/count/calculate percentages by extension.
Let me know if the script needs any tweeking.

Leo.
 
0
•••
Are you talking about processing a zone file ?
 
0
•••
Gene,
If your text is formatted in some way, you could import the text file to either MS Excel or MS Access and then assuming the text formats correctly, just delete the columns or info you don't need. I've done this lots of times with text...
 
0
•••
sdsinc said:
Are you talking about processing a zone file ?
THAT would likely crash my script. Or even hang the server. The application will timeout - that's for sure.
 
0
•••
0
•••
0
•••
Midano said:
Hi Gene.
I have just the tool for you:

http://www.midano.com/domainInventory.asp

Copy/paste your raw text into the form and run the script. It will extract all domains and sort/count/calculate percentages by extension.
Let me know if the script needs any tweeking.
Thanks Leo, but for some reason, it didn't work. It found only two domains out of 1100. It is a comma delimited file... is that the problem?

yandig said:
Gene,
If your text is formatted in some way, you could import the text file to either MS Excel or MS Access and then assuming the text formats correctly, just delete the columns or info you don't need. I've done this lots of times with text...
Thanks Scott, I don't have either installed on this computer... will have to try to get it.

ManicGirl said:
This online tool works pretty good...

http://www.ohashi.us/domaincleaner/
That came close to working, but for some reason it picked up 1097 out of 1100 names... I'll add the three others manually if I can figure out which ones they are! Thanks!

Edit: I found the problem -- The three it didn't pick up were .ws domains. All in all a great tool though!
 
Last edited:
0
•••
ManicGirl said:
This online tool works pretty good...

http://www.ohashi.us/domaincleaner/

doesn't seem to do anything but major extensions.

Here are some other online list cleaners:

http://www.enteryourkeyword.com/DomainFilter.php
http://www.clickmojo.com/tools/clean.php
http://www.expireddomainspy.com/domains/member/namecleaner.php (demo, demo)

And here's some freeware that does a great job, but sometimes you have to tweak the settings a little depending on the input:
http://www.domainpunch.com/products/domainfilter/

Hope this helps,
Jorge
 
0
•••
0
•••
0
•••
Gene,
Have you tried mine? How many out of 1100?
 
0
•••
Midano said:
Gene,
Have you tried mine? How many out of 1100?

Hi Leo, yes, yours was the first I tried... see above
 
0
•••
Gene said:
Hi Leo, yes, yours was the first I tried... see above
FIXED!
The script freaked out because of the amount of double quotes surrounding every field (thanks for the sample). Please try again:
http://www.midano.com/domainInventory.asp
Extracted 100 of 100 from your sample.
 
0
•••
Midano said:
FIXED!
The script freaked out because of the amount of double quotes surrounding every field (thanks for the sample). Please try again:
http://www.midano.com/domainInventory.asp
Extracted 100 of 100 from your sample.
Works great now, thanks! Only problem is it just runs all the domains together in one large paragraph instead of one per line. Any way to fix that?
 
0
•••
You can also just paste them directly into Domain Name Analyzer (The entire raw text, although it does pick up some extra stuff from time-to-time) or Available Domains Standard or Available Domains Pro (Programs, generic names, I know) and they are pretty good at picking names out of lists.
-Allan
 
0
•••
Gene said:
Works great now, thanks! Only problem is it just runs all the domains together in one large paragraph instead of one per line. Any way to fix that?
Sure:
http://www.midano.com/domainInventory.asp
The output is now being generated in both formats - string of text and one-per-line (sorted alphabetically). How's that?
 
0
•••
Midano said:
Gene said:
Works great now, thanks! Only problem is it just runs all the domains together in one large paragraph instead of one per line. Any way to fix that?
Sure:
http://www.midano.com/domainInventory.asp
The output is now being generated in both formats - string of text and one-per-line (sorted alphabetically). How's that?

Looks great, my new favorite list sorter!
 
0
•••
Midano said:
Sure:
http://www.midano.com/domainInventory.asp
The output is now being generated in both formats - string of text and one-per-line (sorted alphabetically). How's that?

That's what I'm talking about! Thanks a million. There is one more thing though. Is it possible to have the output list match the input character case (upper/lower case)? In other words, the domain file I inset has all the domain words capitalized, like "YachtLovers.com" but the resulting output list formats it like: "yachtlovers.com". It would be highly tedious to have to manually change the case on 1100 domains. I know I'm asking for a lot, but thought you would want to hear about what is wanted in a utility like this. Thanks again for all your work.
 
0
•••
Jeeze Gene, give the guy a break! :p

I agree, this would be a nice feature to have.
 
0
•••
Dynadot โ€” .com Registration $8.99Dynadot โ€” .com Registration $8.99
Appraise.net
Unstoppable Domains
Domain Recover
DomainEasy โ€” Zero Commission
  • The sidebar remains visible by scrolling at a speed relative to the pageโ€™s height.
Back