AOL Proudly Releases Massive Amounts of Private Data | Profitable for SEO

SpaceshipSpaceship
Watch

Gnet

VIP Member
Impact
9
AOL Proudly Releases Massive Amounts of Private Data

AOL must have missed the uproar over the DOJ’s demand for “anonymized” search data last year that caused all sorts of pain for Mcft and Google. That’s the only way to explain their release of data that includes 20 million web queries from 650,000 AOL users.

The data includes all searches from those users for a three month period this year, as well as whether they clicked on a result, what that result was and where it appeared on the result page. It’s a 439 MB compressed download, expanded to just over 2 gigs. The data is available here (SEE BELOW) and the output is in ten text files, tab delineated.

The utter stupidity of this is staggering. AOL has released very private data about its users without their permission. While the AOL username has been changed to a random ID number, the abilitiy to analyze all searches by a single user will often lead people to easily determine who the user is, and what they are up to. The data includes personal names, addresses, social security numbers and everything else someone might type into a search box.


HERE IT IS - Lots of mirrors - 439mb file - Enjoy!
This collection consists of ~20M web queries collected from ~650k users over three months.
The data is sorted by anonymous user ID and sequentially arranged.

The goal of this collection is to provide real query log data that is based on real users. It could be used for personalization, query reformulation or other types of search research.



If you don't understand the implications of this file and what you can do with it, don't bother with it. For those of you who are into SEO and running a profitable website, you'll find a lot of useful information here.
 
0
•••
The views expressed on this page by users and staff are their own, not those of NamePros.
GoDaddyGoDaddy
That's a buttload of data to Analyze. Nice post though and rep added.
 
0
•••
Yea i just extracted and saw and ill try my self to find something if not will hire someone hehe
 
0
•••
Does anyone know of a script/program or can create one that can parse these text file results for further analysis? For example, I know I can use the pivottable options on excel to parse these repeated quarries into one line but the limitations of excel do not allow this for the full text file list(s) at once. I would have to manaully split each file into probably hundreds to use in excel. I can use access but think that the pivot table also comes from excel so it still does not work.
 
0
•••
Nop i dont as im not a coder, but i do know that someone who is in this line of work has used this info to his profit im still trying to find out who and how.

Will update when i find something out.
 
0
•••
Does it contain very private information like my account details? I use AOL and i TOLD MY MUM THAT AOL ARE DODGEY AND SHE JUST IGNORED ME. I ASKED HER OVER 1 YEAR AGO FOR AN ISP CHANGE AND SHE STILL IGNORED ME. APART FROM BEING SLOW, it also does this. I am downloading the file now and i am going to search my own address and phone numbers in here to check it is not my info in it.

Geez, AOL SUCKS!!
 
0
•••
try: aolsearchdatabase.com

Have a cheap laugh at users such as: 17627832
 
0
•••
This guy is after xxx: 15251566

Robert
 
0
•••
Scary thing is, theres a number for all of us. Our infos just isn't online quite yet.
 
0
•••
Yes and the even scarier thing is that AOL released this info on its will and not forced to or hacked for.
 
0
•••
i downloaded a copy for future reference.
 
0
•••
Dluzion said:
Nop i dont as im not a coder, but i do know that someone who is in this line of work has used this info to his profit im still trying to find out who and how.

Will update when i find something out.


Ok. or any other ideas on how to parse the results.
 
0
•••
Dynadot — .com TransferDynadot — .com Transfer
Appraise.net

We're social

Escrow.com
Spaceship
Rexus Domain
CryptoExchange.com
Domain Recover
CatchDoms
DomainEasy — Zero Commission
DomDB
  • The sidebar remains visible by scrolling at a speed relative to the page’s height.
Back