zlacker

[parent] [thread] 12 comments
1. mholt+(OP)[view] [source] 2025-02-07 18:15:13
Yeah. ZIP codes are sets in the abstract-dimensional space of carrier delivery points. I suppose you could think of them as lines, but definitely not polygons.
replies(1): >>cogman+L2
2. cogman+L2[view] [source] 2025-02-07 18:28:37
>>mholt+(OP)
Zip codes (in the US) are machine readable numbers a mail sorter can use to send a parcel to the right delivery truck for final delivery. In the US, they represent the hierarchy of postal centers with the most significant digit representing the primary hub for a region and the smallest number the actual post office that will be in charge of delivering the letter (or truck if you do the extended post code).

They don't represent geography at all, they represent the organizational structure of USPS.

They work by making the address on a letter almost meaningless. For some smaller population zip codes you can practically just put the name and zip code down and achieve delivery.

replies(4): >>Spivak+wc >>alsodu+3d >>mywitt+Nf >>mattfo+Fm
◧◩
3. Spivak+wc[view] [source] [discussion] 2025-02-07 19:25:29
>>cogman+L2
Right but this ends up being a good approximation for geography because the reality of logistics is that you end up doing a cute n-ary search of the geography. When you know the regional hub you can say for certain a huge chunk of the US the zip code doesn't represent. And then you keep n-secting. Sometimes the land-mass you get at the end is specific enough for your uses.

You're not going to wind up with a situation where zip codes with the same regional marker end up on different coasts.

replies(2): >>mattfo+cn >>makeit+lH
◧◩
4. alsodu+3d[view] [source] [discussion] 2025-02-07 19:28:11
>>cogman+L2
I agree that they weren't explicitly meant to represent geography, but implicitly they do, right? Are there cases where this is violated?

In other words, is it safe to assume that for entity in a zip code is less than x distance away from the closest entity in the same zip code?

replies(4): >>freyfo+WF >>makeit+JG >>perryg+yL >>maxeri+BP1
◧◩
5. mywitt+Nf[view] [source] [discussion] 2025-02-07 19:44:53
>>cogman+L2
> For some smaller population zip codes you can practically just put the name and zip code down and achieve delivery.

A 5+4 formatted ZIP code maps to just a handful of addresses. In cities with larger populations, the +4 could map to a single building, and in more sparely populated place, it might include houses on a handful of roads.

For smaller datasets, ZIP+4 might as well be a unique household identifier. I just checked a 10 million address database and 60% of entries had a unique ZIP+4, so one other bit of PII would be enough to be a 99.99% unique identifier per person.

With a geo-coded ZIP+4 database, you could locate people with a precision that's proportional to the population density of their region.

replies(1): >>mattfo+Sm
◧◩
6. mattfo+Fm[view] [source] [discussion] 2025-02-07 20:27:41
>>cogman+L2
Well put
◧◩◪
7. mattfo+Sm[view] [source] [discussion] 2025-02-07 20:28:50
>>mywitt+Nf
Yeah but we have that already in the census hierarchy. Plus you have to pay to access Zip+4 geospatial data and it changes sometime as frequently as quarterly
◧◩◪
8. mattfo+cn[view] [source] [discussion] 2025-02-07 20:30:03
>>Spivak+wc
Just use a spatial query. That’s what they are made for.
◧◩◪
9. freyfo+WF[view] [source] [discussion] 2025-02-07 22:34:29
>>alsodu+3d
it is safe to assume nothing.

Please see: https://opencagedata.com/guides/how-to-think-about-postcodes...

I write this as someone who grew up in the ZIP code 09180

◧◩◪
10. makeit+JG[view] [source] [discussion] 2025-02-07 22:40:16
>>alsodu+3d
It might be true, but does it help if the x varies from "on a nearby mountain" to "within a street block", and you sometimes have every habitants closer to another zip code than theirs ?
◧◩◪
11. makeit+lH[view] [source] [discussion] 2025-02-07 22:44:15
>>Spivak+wc
> You're not going to wind up with a situation where zip codes with the same regional marker end up on different coasts.

Couldn't this happen for military or proxy codes (PO boxes or other) ?

◧◩◪
12. perryg+yL[view] [source] [discussion] 2025-02-07 23:19:51
>>alsodu+3d
> less than x distance away

zip codes don't even need to be contiguous. It's a mail delivery route, not a polygon.

There are 5 cases where the assumption is violated:

- Non-contiguous areas

- Zip codes that are a single point (some big companies get their own zip with a single mailbox, e.g. GE in Schenectady, NY is zip 12345)

- Zip codes that are a single line (highway-based delivery routes)

- Overlapping boundaries (since mail routes are linear, choosing a polygon representation is arbitrary and often not unique in space)

- Residents of some zip codes are not stationary (e.g. houseboats)

In short, asking questions about the area of a zip code is a category error - zip codes do not have a uniform representation in space. And we should be highly skeptical of any geospatial analysis that assumes polygons.

◧◩◪
13. maxeri+BP1[view] [source] [discussion] 2025-02-08 13:10:16
>>alsodu+3d
They do provide a location with whatever error bars on it.

What they do not have is any sort of spatial consistency, they are a convenience for mail sorting. So if you start analyzing patterns across zip codes, you are pulling in information that is likely useless for or harmful to answering your question.

[go to top]