I work with Ad Data a lot in my job, and there's a lot of misconceptions about w...

chaps · 2026-03-06T00:22:37 1772756557

I worked in ad-tech for a year before I left the tech industry as a whole. I've also done a fair bit of investigative journalism.

Let me share a thing:

Factual, a company that specializes in hyperlocal geofencing, uses geofencing much smaller than the self-regulation that their industry allows in their own rules. I learned this after a coworker quit because our company was allowing ad targeting to people using these smaller geofences. The whole company had an all-hands about it where the CEO of the company told everyone that we were not going to stop using Factual nor the smaller-than-allowed geofences because we, ourselves, were not the ones to produce those geofences. We were just a man in the middle helping to build a system to track people at high resolution.

Please try to reconcile with what your industry has and continues to destroy.

gruez · 2026-03-06T00:52:07 1772758327

>Please try to reconcile with what your industry has and continues to destroy.

I don't see anything contradictory between your comment and the OP. Having an amoral CEO who condones breaking geotargeting self-regulation doesn't contradict OP's claim that it's hard to tie geotargeting data in bidstreams back to a particular person.

majormajor · 2026-03-06T03:57:35 1772769455

Only one person/company has to solve any given hard problem before they can sell it to interested parties. Who might lose it in a data leak, or package it up and re-sell it, etc, etc.

chaps · 2026-03-06T01:03:48 1772759028

Sure, hard. But, um, lots of things are hard.

For example, it was very hard for me to identify myself in an anonymized public dataset of vehicle trips, but I did. It was also hard to FOIA for the documents showing them writing SQL to spot my trip.. but I did.

Hard doesn't mean impossible.

vvanpo · 2026-03-06T08:28:04 1772785684

It sounds like there is a story here, have you written about this somewhere?

chaps · 2026-03-06T16:23:55 1772814235

There definitely is and I've definitely pitched it to places. The Intercept had interest but told me that they wanted me to build the story out more to be less focused on Chicago. I understand where they were coming from (and the others who said the same thing) but it wasn't possible for me to continue doing freelance work, so no stories ended up being published about it at all.

arghwhat · 2026-03-06T08:03:45 1772784225

First thing would be that a small geofence (i.e., a narrow church on available data) is entirely orthogonal to having high precision, high quality location data available.

I won't claim with certainty that this is the case, but it seems likely that Factual was overselling their capabilities. That, or they relied specifically on having users grant high precision location data access and had nothing otherwise.

Apps that already need location data are probably the most likely sources of collecting such data - food apps, dating apps, chat apps you have sent your location in, ...

chaps · 2026-03-06T18:23:17 1772821397

"Apps that already need location data are probably the most likely sources of collecting such data"

Yes, and many companies have access to both feeds.....

dygd · 2026-03-05T23:41:06 1772754066

> Each SDK might be tattling on you, but unless you give them a key to match you across apps, each signal from each app is unique

You'd be surprised what can be done when data from different source is fused together.

Large-Scale Online Deanonymization with LLMs: https://news.ycombinator.com/item?id=47139716

Robust De-anonymization of Large Sparse Datasets: https://www.cs.cornell.edu/~shmat/shmat_oak08netflix.pdf

sroussey · 2026-03-06T00:58:16 1772758696

There are whole companies that de-anon ad data as a service. Which gives the lots of data brokers the ability to not do the last mile and feel good about themselves. It’s a joke.

janalsncm · 2026-03-06T02:52:07 1772765527

I remember when the first article was posted. Their method requires two parallel corpuses e.g. people who write on LinkedIn (under their real name) and Reddit.

Also, people who post under their real name are likely to write with their real voice:

> Any deanonymization setup with ground truth introduces distributional biases. In our cross-platform datasets, the pro-files are likely easier to deanonymize than an average profile: the very fact that ground truth exists implies that the user may not have cared about anonymity in the first place. Similarly, two split-profiles of a single user are inherently alike, whereas two pseudonymous accounts of the same person (e.g., an official and a pseudonymous alt account) might expose more heterogeneous micro-data.

ducttape12 · 2026-03-05T19:06:55 1772737615

Neither the government nor an ad agency needs to know where I am, no matter how "rough" the data is. It's none of their business.

bigbuppo · 2026-03-05T23:48:00 1772754480

But dude... just think of all the optimal personalized mattres sales they can do with that data. I mean, people that use the bathroom at 3:57pm for seven minutes are 0.00138% more likely to buy a new mattress within the next six months. They need that data. Think of all the unsold mattresses.

legitster · 2026-03-05T19:27:30 1772738850

At this point, your device is not giving anyone your location without explicit permission. So it really just comes down to your IP Address, which services do need.

nerdsniper · 2026-03-06T05:41:55 1772775715

Verizon and AT&T were literally selling the realtime location of your device without any ability for users turn it off. https://arstechnica.com/tech-policy/2025/09/court-rejects-ve...

You're still that confident that no one else is selling your location data without your knowledge?

golem14 · 2026-03-05T20:36:10 1772742970

I think your is statement is inaccurate to the point of being intentionally misleading:

Many devices, when running, and in some cases even if turned off but connected to their battery, will ping cell towers (maybe even BLE/Wifi) and get triangulated by the network infrastructure (such as cell towers) without actively broadcasting the GPS location.

That's why I don't quite understand why the gubernment needs to have finer grained data (esp around the US/Mexican border). Precision location info would only be needed if you need to track people in densely populated areas.

jonas21 · 2026-03-05T22:18:32 1772749112

That location information is not available to apps or ad networks without user consent. The government can access it from the carrier with a warrant, but that's not what we're discussing here.

techdmn · 2026-03-05T23:34:49 1772753689

Carriers have also sold customer location data, no search warrant required. Though we can rest assured that the FCC has slapped the carriers' wrists with the utmost seriousness.

lesuorac · 2026-03-05T23:49:48 1772754588

And sold it to not just the government but anybody _claiming_ to be a bounty hunter (and some other professions).

tempaccount5050 · 2026-03-06T00:04:36 1772755476

Couldn't you just maintain a list of cell tower IPs and figure it out with traceroute?

wat10000 · 2026-03-06T14:59:06 1772809146

IP doesn't handle roaming very well. If you got routed onto the internet directly from your local cell tower, then your connections would drop whenever you switched to a different tower, which is somewhat suboptimal. Cell networks handle it at a lower level and route your traffic through a central location which serves as the origin of your IP traffic. Geolocate your IP while on cell data and you'll probably see something pretty far away from where you are. My phone's IP address at the moment is about 400 miles away from the actual phone.

sroussey · 2026-03-06T00:59:54 1772758794

Cell towers are not working at the IP level, so no

golem14 · 2026-03-06T00:26:07 1772756767

I think that's very much what is discussed in this whole thread.

legitster · 2026-03-05T23:16:06 1772752566

Cell-site location information (CSLI) is not available to apps or adware and is protected by the Fourth Amendment.

janalsncm · 2026-03-06T02:55:12 1772765712

You may want to look into the Third Party Doctrine.

If the government wants to tap your phone they need a warrant. If they want to buy it from a willing seller like Verizon they don’t.

kube-system · 2026-03-06T00:04:21 1772755461

It was freely sold up until a handful of years ago

golem14 · 2026-03-06T00:25:16 1772756716

Yes, but it is available to the gubernment ? Especially this gubernment?

notnullorvoid · 2026-03-05T21:54:52 1772747692

IP Address is all you need to get fairly accurate (town or neighborhood) location for most of North America.

But it is necessary to send it somewhere, otherwise the internet wouldn't work.

Unfortunately it seems to have become accepted for our devices to communicate constantly and often with services we never explicitly started communication with (like Ad networks used in Apps).

Permission systems on devices should care about Network connections just as much as Location. Ideally when installing an app you'd get the list of domains it requests to communicate with, and you could toggle them. Bonus points if the app store made it a requirement to identify which Domains are third parties and the category like an Ad service.

danaris · 2026-03-06T07:37:16 1772782636

That would be great—for about 0.3% of us, those who both care about privacy and have the knowledge and time to go through those sorts of listings.

No; what we need is to ban this data collection entirely.

titzer · 2026-03-06T00:32:33 1772757153

If you use Google Location Services, which is stock install on basically all Android devices, it absolutely is uploading "anonymized" GPS data all the time.

unethical_ban · 2026-03-05T23:25:43 1772753143

IPv6 addresses, particularly hardlines, are often accurate down to the block.

tlavoie · 2026-03-05T23:03:19 1772751799

I think the issue here is one of informed consent. You might say, "OK, this makes sense" when agreeing to location data for a weather app. In the context of whether it's going to hail soon, location is reasonable. What you only see in those GDPR-type banners is that the data is being re-sold off to 1001 "partners", none of whom are important for my hail-to-head concerns. Never mind all the cases where it's re-sold on to all the governments and personal-level creeps through aggregators.

tencentshill · 2026-03-05T21:13:56 1772745236

Then you are obligated to obscure that with a trusted no-log VPN too.

d4mi3n · 2026-03-06T00:08:56 1772755736

Well, in the case of a company trying to market to you, it literally _is_ their business. It makes them money.

The problem is that we have markets where we: - Incentivize organizations to pursue profits at the expense of everything else, which includes social good and civic rights - Rarely hold bad actors accountable (and almost never in a timely manner)

Which means, given enough time, we're always going to trend to whatever makes the most money. Targeted advertising makes money, and will continue to do so unless or until we collectively decide to make it a greater risk to profits than it is today.

spike021 · 2026-03-06T02:55:17 1772765717

i'm not confident they know where i am at all. i routinely get ads on social media for places (super random US states, cities, etc.) nowhere near where i live (SF Bay Area).

jojobas · 2026-03-05T23:32:54 1772753574

The government does need to know where the people building their lives on breaking the law are. Don't think CBP wants to know where you are.

wat10000 · 2026-03-06T14:53:29 1772808809

The government wants to know that. They don't need to know.

CPB doesn't care where I am. Unless they make a mistake and think I'm an illegal immigrant. Or they decide to teach a lesson to someone who's critical of them.

gib444 · 2026-03-06T04:57:11 1772773031

> unless you give them a key to match you across apps

Eg by running standard Android? That doesn't have eg secure app spawning, so apps can profile app initialisation data AIUI

And probably 10 other things behind the scenes that GrapheneÓS plugs?

ece · 2026-03-06T13:10:35 1772802635

Exactly, people are going to be logged into these apps with trackable identifiers. You can see it with tracker control on android, download a new app and see existing apps report things to new trackers, which seems to be happening at the sdk level.

uncletammy · 2026-03-06T14:47:33 1772808453

> Each SDK might be tattling on you, but unless you give them a key to match you across apps, each signal from each app is unique

Aren't there many examples of these? For example IMEI, IMSI, phone number, etc?

Even without "unique" signals, isn't it fairly trivial to identify a user with a handful of "not very unique" signals? User-agent, a few recent IP addresses, browser capabilities, list of installed apps, device operating system properties, etc?

cm2012 · 2026-03-05T23:23:28 1772753008

1000% agreed with this