Help understanding what data is commonly for sale

Hi all! I’m an IT person working with someone who’s 30+ years old and for various reasons is only now starting to use technology like phones and laptops.

They’re naturally quite an anxious person, and due to their tech-illiteracy are prone to misunderstanding things in the digital world, which brings me to why I’m posting!

They’ve latched onto digital/online privacy, growing very anxious that anyone, anywhere, anytime can see what they’re doing and have done online and has access to an MI5/FBI esc dossier on their activities.

My goal is to try and quell these concerns by informing them more about what data companies like Google and Facebook collect, how it’s used, and who has access to it. To that end, it would be of great help if anyone has a moment to fact-check what I’ve found out so far and comment on a few questions!

I’ve focused most of my research around Google because there is lot written about them and their processes, and Google is a name the person I’m working with will recognise.

What information are Google and similar companies collecting?

Pretty much everything you can think of, this list by security.org seems to be the best breakdown I’ve seen on it so far.

Then, from what I’ve read this data is normally split into two categories:

  • Information you create/content you generate. This would be emails you send through Gmail, posts you make on Instagram or images you upload to Google Images.

  • Data that’s collected about you; your IP address, the times at which you’re using services, a log of topics you’ve shown interest in etc.

What happens to all this data?

The line seems blurry, but information from section 1 that you would expect to be private -such as the content of your emails - typically is, however, this would vary depending on the company and how devious their Privacy policy is. Information from section 2 is much more obviously used for marketing, which is understandable.

Then, Information that can be used for marketing is collated into a profile along with a few identifiers.

How is this data accessed?

There are three main ways to access it:

  • The profiles are then aggregated into larger demographics and advertisers can advertise to those demographics – which sounds reasonable, assuming the individual profiles remain in-house at Google/other companies.

  • Real-time bidding, which is more invasive because the data is much more specific to particular users and shared with many more parties as part of the process.

  • Google’s “Customer Match” service which allows you to upload existing information you know about a user if Google has a match for the data you’ve supplied.

My question here is:

How much information is revealed during the bidding process; Is it just some IDs, Year of birth, Gender, a string of keywords for interests, and their location, or, looking at the OpenRTB documentation is there some more concerning data exchanged as part of the “data” and “ext” attributes?

Who has access to this data

Err, well… this is where I’m starting to struggle.

Anyone involved in the RTB process would have access to individual, but anonymised (ish) data.

Anyone can pay Google to advertise to a demographic that contains you, but that’s not too bad either.

Anyone could use Customer Match to advertise to you, but I assume Google wouldn’t hand over much if any data as part of this process?

The data would end up with a data broker, and could ultimately be bought and accessed by anyone who creates an account. That leads me to my next question, Which forms of data (individual profiles, Anonymised profiles, profiles aggregated into demographics) can be bought from brokers? And what would this data contain?

And, the million-pound question: how realistic is it that someone who knows you could find and purchase your data? Finding people from their data is proven to be possible, but that’s very different from finding the data for a person.

Any thoughts, insight or advice would be much appreciated!

1 Like

Here is some data from a basic real world dataset from a database with the information of basically every adult in USA:

  • AddressID
  • AfricanAmericanProfessionals
  • airconditioning
  • americanexpresscard
  • AmericanExpressGoldPremium
  • AnimalWelfareCharitableDonation
  • ApparelChildrens
  • ApparelMens
  • ApparelMensBigAndTall
  • ApparelWomens
  • ApparelWomensPetite
  • ApparelWomensPlusSizes
  • areacode
  • Arts
  • ArtsAndAntiquesAntiques
  • ArtsAndAntiquesArt
  • ArtsOrCulturalCharitableDonation
  • AssimilationCodes
  • AutomotiveAutoPartsAndAccessories
  • automotivebuff
  • Autowork
  • Aviation
  • BANK_CARD_HOLDER
  • BANK_CARD_PRESENCE_IN_HOUSEHOLD
  • BeautyCosmetics
  • BoatingSailing
  • bookbuyer
  • bookreader
  • BooksAndMagazinesMagazines
  • BooksAndMusicBooks
  • BooksAndMusicBooksAudio
  • BroaderLiving
  • businessowner
  • CampingHiking
  • Career
  • CareerImprovement
  • carrier_route
  • cats
  • censusblock
  • CensusMedianHomeValue
  • CensusMedianHouseholdIncome
  • censustract
  • charitable
  • CharitableDonations_Other
  • ChildrenAge00_02
  • ChildrenAge00_02Female
  • ChildrenAge00_02Male
  • ChildrenAge00_02Unknown
  • ChildrenAge03_05
  • ChildrenAge03_05Female
  • ChildrenAge03_05Male
  • ChildrenAge03_05Unknown
  • ChildrenAge06_10
  • ChildrenAge06_10Female
  • ChildrenAge06_10Male
  • ChildrenAge06_10Unknown
  • ChildrenAge11_15
  • ChildrenAge11_15Female
  • ChildrenAge11_15Male
  • ChildrenAge11_15Unknown
  • ChildrenAge16_17
  • ChildrenAge16_17Female
  • ChildrenAge16_17Male
  • ChildrenAge16_17Unknown
  • ChildrensApparelInfantsAndToddlers
  • ChildrensCharitableDonation
  • ChildrensInterests
  • ChildrensLearningAndActivityToys
  • ChildrensProductsGeneral
  • ChildrensProductsGeneralBabyCare
  • ChildrensProductsGeneralBackToSchool
  • ChristianFamilies
  • cityname
  • citynameabbr
  • CollectiblesandAntiquesGrouping
  • CollectiblesAntiques
  • CollectiblesArts
  • CollectiblesCoins
  • CollectiblesGeneral
  • CollectiblesSportsMemorabilia
  • CollectiblesStamps
  • CollectorAvid
  • collect_specialfoodsbuyer
  • CommonLiving
  • CommunityCharities
  • COMMUNITY_INVOLVEMENT_CAUSES_SUPPORTED_FINANCIALLY
  • computerowner
  • COMPUTERS
  • ComputingHomeOfficeGeneral
  • ConsumerElectronics
  • cookingenthusiast
  • CookingFoodGrouping
  • CookingGeneral
  • COOKING_GOURMET
  • countycode
  • countyname
  • Crafts
  • crafts_hobbmerchbuyer
  • CRA_IncomeClassificationCode
  • CreditCardholderUnknownType
  • CREDIT_CARD_INDICATOR
  • CreditCardNewIssue
  • CreditCardUser
  • Credit_RangeOfNewCredit
  • CreditRating
  • CulturalArtisticLiving
  • CurrentAffairsPolitics
  • DATE
  • deeddateofrefinanceday
  • deeddateofrefinancemonth
  • deeddateofrefinanceyear
  • del_point_check_digit
  • DietingWeightLoss
  • DiscoverGoldPremium
  • DiscoverRegular
  • DIYLiving
  • dogs
  • do_it_yourselfers
  • donatesbymail
  • donatestoenvironmentalcauses
  • DONOTCALL
  • dpv_code
  • DVDsVideos
  • dwellingtype
  • EducationOnline
  • ElectronicsandComputingTVVideoMovieWatcher
  • ElectronicsComputersGrouping
  • ElectronicsComputingAndHomeOffice
  • EMAIL
  • EMAILFLAG
  • Email_Score
  • EnvironmentalIssuesCharitableDonation
  • ENVIRONMENT_OR_WILDLIFE_CHARITABLE_DONATION
  • Equestrian
  • estimatedcurrenthomevaluecode
  • estimatedincomecode
  • ethniccode
  • EthnicConfidenceCode
  • ethnicgroup
  • ExerciseAerobic
  • exerciseenthusiast
  • ExerciseHealthGrouping
  • ExerciseRunningJogging
  • ExerciseWalking
  • femalemerchbuyer
  • Females_18_24
  • Females_25_34
  • Females_35_44
  • Females_45_54
  • Females_55_64
  • Females_65_74
  • Females_75_Plus
  • Fishing
  • FoodsNatural
  • FoodWines
  • GamesBoardGamesPuzzles
  • GamesComputerGames
  • GamesVideoGames
  • Gaming
  • GamingCasino
  • Gardener
  • GARDENING
  • GARDENING2
  • gardening_farmingbuyer
  • GAS_DEPARTMENT_RETAIL_CARD_HOLDER
  • GasDeptRetailCardHolder
  • GasolineOrRetailCardGoldPremium
  • GASOLINE_OR_RETAIL_CARD_REGULAR
  • generalcontributor
  • GenerationsInHousehold
  • golfenthusiasts
  • Grandchildren
  • HealthAndBeauty
  • health_institutioncontributor
  • HealthMedical
  • HeavyBusinessTravelers
  • Highbrow
  • HighEndAppliances
  • hightechleader
  • HIGH_TECH_LIVING
  • hispaniccountrycode
  • HistoryMilitary
  • HomeandGarden
  • homedecoratingenthusiast
  • HomeFurnishingsDecorating
  • homeheatindicator
  • HomeImprovement
  • HomeImprovementGrouping
  • HomeLiving
  • homeownerprobabilitymodel
  • homepurchasedateday
  • homepurchasedatemonth
  • homepurchasedateyear
  • homepurchaseprice
  • homepurchasepricecode
  • homeswimmingpoolindicator
  • homeyearbuilt
  • housenumber
  • HousePlants
  • Hunting
  • HuntingShooting
  • IndividualId
  • Individual_Match_Flag
  • InferredAge
  • InferredHouseholdRank
  • IntendToPurchaseHDTVSatelliteDish
  • IntendtoPurchaseHomeImprovement
  • InternationalAidCharitableDonation
  • Investing_Active
  • InvestingFinanceGrouping
  • investment
  • InvestmentEstimatedResidentialPropertiesOwned
  • InvestmentsForeign
  • InvestmentsPersonal
  • InvestmentsRealEstate
  • investmentstocksecurities
  • IP
  • Jewelry
  • languagecode
  • latitude
  • lengthofresidence
  • lengthofresidencecode
  • LifestylesInterestsandPassionsCollectibles
  • livingunitid
  • LoanToValue
  • longitude
  • Luggage
  • Magazines
  • MailOrderBuyer
  • mailresponder
  • malemerchbuyer
  • Males_18_24
  • Males_25_34
  • Males_35_44
  • Males_45_54
  • Males_55_64
  • Males_65_74
  • Males_75_Plus
  • MastercardGoldPremium
  • MastercardRegular
  • MembershipClub
  • MilitaryMemorabiliaWeaponry
  • mortgageamountinthousands
  • mortgageamountinthousandscode
  • mortgagelendername
  • mortgagelendernameavailable
  • mortgageloantype
  • mortgagerate
  • mortgageratetype
  • MostRecent2ndLenderCode
  • MostRecentLenderCode
  • MostRecentLenderName2nd
  • MostRecentMortgage2ndInterestRate
  • MostRecentMortgage2ndInterestRateType
  • MostRecentMortgage2ndLoanTypeCode
  • MostRecentMortgageAmount2nd
  • MostRecentMortgageDate2nd
  • MostRecentMortgageInterestRate
  • Motorcycling
  • MovieCollector
  • MovieMusicGrouping
  • msa
  • Musicalinstruments
  • MusicAvidListener
  • MusicCollector
  • MusicHomeStereo
  • MusicPlayer
  • NAME
  • Nascar
  • NCOA_Effective_date
  • Networth
  • newsandfinancial
  • NumberOfAdults
  • NumberOfChildren
  • NumberOfLinesOfCredit
  • numberofpersonsinlivingunit
  • NumberOfSources
  • occupationgroup
  • OnlinePurchasingIndicator
  • opportunityseekers
  • OtherPetOwner
  • outdoorenthusiast
  • OutdoorsGrouping
  • outdoorsportslover
  • Parenting
  • PassProspectorValueHomeValueMortgageFile
  • personagecode
  • persondateofbirthday
  • persondateofbirthmonth
  • persondateofbirthyear
  • personeducation
  • personexactage
  • personfirstname
  • persongender
  • personlastname
  • personmaritalstatus
  • personmiddleinitial
  • personoccupation
  • PersonSurnameSuffix
  • persontitleofrespect
  • pets
  • Phone
  • photography
  • PhotographyAndVideoEquipment
  • PO_BOX_FLAG
  • PoliticalCharitableDonation
  • PoliticalConservativeCharitableDonation
  • politicalcontributor
  • PoliticalLiberalCharitableDonation
  • postdirection
  • predirection
  • PREMIUM_CARD_HOLDER
  • PresenceOfBankCard
  • presenceofchildren
  • PresenceOfCreditCard
  • presenceofgoldorplatinumcreditcard
  • PresenceOfPremiumCreditCard
  • PresenceOfUpscaleRetailCard
  • primaryaddress
  • ProfessionalLiving
  • Purchase2ndMortgageAmount
  • Purchase2ndMortgageInterestRate
  • Purchase2ndMortgageInterestRateType
  • Purchase2ndMortgageLoanTypeCode
  • PurchaseLenderCode
  • PURCHASE_LENDER_NAME
  • PurchaseMortgageDate
  • RDI
  • RDID
  • ReadingAudioBooks
  • ReadingGeneral
  • ReadingGrouping
  • ReadingMagazines
  • READING_RELIGIOUS_INSPIRATIONAL
  • ReadingScienceFiction
  • refinanceamountinthousands
  • refinanceamountinthousandscode
  • refinancelendername
  • refinancelendernameavailable
  • refinanceloantype
  • refinanceratetype
  • religioncode
  • religiouscontributor
  • ReligiousInspirational
  • religiousmagazine
  • RespondedtoCatalog
  • ScienceSpace
  • ScubaDiving
  • secondaryaddress
  • secondaryaddresspresent
  • SelfImprovement
  • SeniorAdultInHousehold
  • Sewer
  • SewingKnittingNeedlework
  • SingleParent
  • Smoker
  • Snowskiing
  • SohoIndicator
  • SpectatorSportsAutoMotorcycleRacing
  • SpectatorSportsBaseball
  • SpectatorSportsBasketball
  • SpectatorSportsFootball
  • SpectatorSportsHockey
  • SpectatorSportsSoccer
  • SpectatorSportsTVSports
  • SportsandLeisure
  • SportsGrouping
  • SportyLiving
  • state
  • streetname
  • streetsuffix
  • sweepstakes
  • Telecommunications
  • Tennis
  • TheaterPerformingArts
  • timezone
  • transactiontype
  • TRAVEL
  • TravelAndEntertainmentCardHolder
  • TravelCruiseVacations
  • TravelDomestic
  • traveler
  • TravelGrouping
  • TravelInternational
  • TVCable
  • TVSatelliteDish
  • unitdesignator
  • unitdesignatornumber
  • Unknowngender_18_24
  • Unknowngender_25_34
  • Unknowngender_35_44
  • Unknowngender_45_54
  • Unknowngender_55_64
  • Unknowngender_65_74
  • Unknowngender_75_Plus
  • UPSCALE_DEPARTMENT_STORE_CARD_HOLDER
  • UpscaleLiving
  • URL
  • valuehunter
  • veteraninhousehold
  • VeteransCharitableDonation
  • VisaGoldPremium
  • VisaRegular
  • Water
  • WirelessPhone
  • Woodworking
  • WorkingWoman
  • Xaxis
  • Yaxis
  • YoungAdultInHousehold
  • YoungMensApparel
  • YoungWomensApparel
  • Zaxis
  • Zip_4
  • ZipCode