Hacker News

4 years ago by jedimastert

I'm baffled they used any plurals at all. I feel like "only one form of a word" should have been near top priority for generating the dictionary. No plurals, no conjugations (even better would be to pick one conjugation, say, infinitive) maybe even no adjective/adverb forms of verb.

I wonder how many words in the english language would be left? I know the english language is massive

4 years ago by cybergibbons

The word list already is 40k long. That's beyond most people's vocab and includes really awkward to spell words.

IMO, if the solution is to use words, then What4Words would have had a word list of less than 3000, resulting in a word list with less confusable words and more accessible to children and people who struggle to read or write.

4 years ago by pmoriarty

Spelling is the Achilles' heel of all word-based systems.

People who have trouble with spelling (such as non-native speakers of whatever language the words come from or children) may not be able to rely on word-based systems. Word-based systems are also going to be hampered by speakers of different accents.

Letter- and number-based systems are probably always going to be much more robust, especially when used with a standard phonetic alphabet[1]. There could even be a checksum letter/number to make the system even more robust. Unfortunately, such systems will never be as memorable or as easy to say as a few words (spelling issues aside).

[1] - https://en.wikipedia.org/wiki/NATO_phonetic_alphabet

4 years ago by jeroenhd

Spelling is very much an underappreciated problem. Written English is particularly bad, sometimes requiring memorization that's not much unlike Chinese characters, because of the written language not adapting to the vowel shifts and changes in pronunciation, as well as the mess of a history the language has gone through as it developed in the UK.

Children, dyslexics, non-native speakers, all will have a hard time writing down many words even if they're part of the top 1000 list.

With the right word set (avoiding homophones) and the presence of autocorrect (or an input only allowing the limited word list), you could probably create a pretty resilient system if you only take the most common words (top 1k would likely be sufficient). You'll need a longer address, but remembering six words is a lot easier than remembering six letters.

Sadly, the entire concept is flawed and doomed for as long as the goons of What Three Words operate their business like a failed media company, sending out threats, falsifying legal documents to enforce takedown requests, and lawyering up to anyone who even considers applying "their" algorithm on their own. "Their" idea may be patentable in the US, but in areas of the world where there is no such patent, these goons cannot take down the competition without lying and dishonesty and they've shown to do anything to prevent any competitor from entering the market.

4 years ago by vonuebelgarten

The sad part is that this problem was solved decades ago with grid locators like Maidenhead[1]. It's a purely mathematical conversions with no lookup tables, cover the entire world and alternate pairs of letters with pairs of digits (so no confusing or language and culture dependent words).

Code becomes longer according to the required precision, so eight digits are enough for a typical city neighborhood, ten digits goes to a 30x20m block that is good for disambiguation even in the most dense urban areas. Adding more digits will work too, but that just more work.

A typical location would be something like: fm19oc75hv

[1] https://en.wikipedia.org/wiki/Maidenhead_Locator_System

4 years ago by flir

> Spelling is the Achilles' heel of all word-based systems.

Stick with nouns? Then you can use icons as supplements. Ball, Pen, Light bulb, Burger, Guitar.

Possibly easier to translate between languages, too.

4 years ago by fpig

I don't disagree, but not sure why you associate poor spelling with non-native speakers.

4 years ago by jfrunyon

Bingo. Either more words, or fewer grid squares (f.e. instead of 3x3m grid, a 4x4m grid). I truly don't understand why they thought they needed 3m (10ft) of precision: my home has something like 10 What3Words addresses (80 if you count the yard and the other side of the duplex!). The old outhouse at the family ranch has more than one!

IMO, they also should have built redundancy (some form of error checking at least) into the encoding.

4 years ago by noir_lord

Funnily enough I did that math the other day when this kicked off.

It comes out to 4th root of ((510.1 trillion)/9) which is ~2743.8

Where it gets interesting is that it's only ~4752 for 1mx1m cells :)

Another use for `Correct Horse Battery Staple` I guess.

4 years ago by kybernetikos

I did one of these for fun at the beginning of the year, and ended up needing to spend way more time on the wordlist than I'd expected. In the end I felt that a list of 4096 words was a decent compromise between accuracy and is still fairly managable for trying to remove words that are too easy to mistake for each other. It lets you do everywhere on earth to slightly more accuracy than what3words in 4 words.

Something that what3words does is not have an obvious hierarchy of words (e.g. where the first one covers a larger area, and subsequent words home in). I didn't like that, but I understand why they do it - if a single word is going to cover a large area, you have to be extra careful that you don't choose something offensive for a particular region. By having no obvious structure, they get away with being less careful on the wordlist.

4 years ago by eterm

The issue of hierarchy is that it's orthogonal to having very different results for nearby areas.

With a hierarchy you immediately run into, "Am I in gibbons.apple.banana or was it gibbons.apple.bandana" which is just down the road.

Without hierarchy it jumps out that one of those results is improbable if tallied with any knowledge of roughly where the person is.

4 years ago by open0

English wiktionary had over 500k entries as of a few years back.

A "word family" is the term for the related forms of a word resulting from derivation, inflection, and other processes. Usually "conjugation" only refers to inflection of verbs whereas "declension" is used for inflection of nouns. And "lexeme" or "headword" are sometimes used to refer to some canonical form of a word family.

https://en.wikipedia.org/wiki/Word_family

https://en.wikipedia.org/wiki/Lexeme

4 years ago by kybernetikos

I did a words lat/long system as a couple day project at Christmas. https://wherewords.id/

I used the google S2 mapping, and spent quite a bit of time on my own wordlist. I like the S2 mapping - it lets you use fewer words to refer to a bigger area. I write a little bit about it here: https://wherewords.id/+about

I would have really liked to have found some good research on 'words that are hard to mistake when spoken aloud across various accents', but in the end I just spent ages going through my word list with various libraries and manually. And having friends point me to locations that had incredibly rude or offensive wherewords!

Really there should be a free, open source word<->location system that we all just standardise on and build into GPS systems and maps, because a common, free system would be genuinely useful.

4 years ago by rozab

How many times has the wheel been reinvented with these phonetic word list schemes? Loads of crypto apps use them, e.g. for bitcoin wallet recovery. Do they all roll their own scheme? Someone should attempt to make a standard if none exists.

Interoperability isn't even that important, I just want to pull something into my project.

4 years ago by kybernetikos

I originally planned to use the crypto bip39 wordlist but ultimately it proved not to be suitable. I talk a little about it https://wherewords.id/+about

4 years ago by rozab

Great summary, thanks

4 years ago by snypher

Crypto wallets may use BIP39 protocol, it might be close to a 'standard'?

https://github.com/bitcoin/bips/blob/master/bip-0039.mediawi...

Edit: 2048 English wordlist is here https://github.com/bitcoin/bips/blob/master/bip-0039/english...

4 years ago by ryebit

In addition to BIP39 cited below, the EFF also published some useful wordlists a few years ago... https://www.eff.org/deeplinks/2016/07/new-wordlists-random-p...

One has the nice property that all words have unique 3 letter prefix. But not as many in prefix list (1296) as their "long" list (7776).

That said I'm kinda partial to BIP39... first four letters are unique, and words are more uniform than EFF prefix list.

But it looks like GPS addressing schemes like w3w need a MUCH larger list by an order of magnitude.

4 years ago by kybernetikos

> But it looks like GPS addressing schemes like w3w need a MUCH larger list by an order of magnitude.

If you're trying to get down to 3 words yes, but if you're happy with 4 words (and I am), then the long list would be more than enough. One problem I found was that I think people don't want strongly negative phrases being used to describe where they live (rural poverty assault, evil disease island). The BIP39 wordlist can create word groups that are very offensive, or that would feel rascist if applied to particular parts of the world. It also has words that are easily confused for other words like era/error, son/sun, alter/altar, aisle/I'll, floor/flaw, .

I did use words from BIP39, but had to remove quite a few in the end for my wordlist (e.g. blast, load, black, finger, female etc.) because of the unfortunate clusters it could create.

Ultimately, I think coming up with a good wordlist is still a bit of an unsolved problem. The ideal wordlist for something like this

1. can't form obviously rascist or overtly sexual word clusters

2. can be easily distinguished in spoken communication

3. can be easily distinguished in written communication

4. doesn't have words that sound similar to other words in any of the most common accents

5. doesn't have geographic words (it'd be confusing)

6. is mainly positive or neutral words

7. consists of words that are common and easy to spell, avoiding words that are commonly misspelled and where there are different standard ways of spelling the words depending on region

8. has words that are not too long, a small number of syllables

9. doesn't contain words that are concatenations of other words in the wordlist

Obviously not easy if you need a significant number of them.

4 years ago by kybernetikos

> I just want to pull something into my project

My experimentation mostly lives here: https://github.com/kybernetikos/wherewords/tree/main/lib/wor...

If you're coding in JS, I'd use https://www.npmjs.com/package/verbal-id for 10bits, bip39 for 11 bits, and maybe my wordlist for 12 bits.

4 years ago by MzHN

Wow, I love this! It is such a great application of S2, which is an awesome concept itself. A drilldown of the words would be a neat way to explore the mapping. In other words: first you see the largest areas, and you can click on them, then you see areas/words within that area, and as you drill down you form the wherewords location.

4 years ago by kybernetikos

You're right that would be cool. I haven't done anything on it since the beginning of the year.

There is a small easter egg though - wherewords.id works with fewer than the four words, so for example, if you go to https://wherewords.id/system/festival/ you'll see it highlight a particular part of New York City, while https://wherewords.id/system covers a big part of New York State and Pennsylvania.

4 years ago by teachingassist

What3Words accounts at Companies House in the UK are extremely interesting.

They burned through just over £15,000,000 in 2019, for a total revenue just under £400,000 from 100+ employees.

4 years ago by ealexhudson

I thought you were kidding, but apparently not. Even worse : £320k of that revenue was from sales to Daimler AG (a shareholder) and is described as "services". I would love to understand the business model here, they've maintained £23M ish cash at hand since end 2017 but I'm not clear what the play here is...

4 years ago by trollied

I think that £23m is just capital from issued shares. Anyway, it's certainly not viable. I guess it remains to be seen how long investors will keep on chucking money at it to keep it afloat.

Filing history, if anyone else is interested: https://find-and-update.company-information.service.gov.uk/c...

4 years ago by ealexhudson

No, it's definitely cash and equivalents. The shareholder account is virtually £50m at this point, and they had nothing like that cash until the raises in 2017.

4 years ago by mbirth

IIRC Daimler wanted to integrate w3w into their onboard navigation units.

4 years ago by Doctor_Fegg

They've raised over £80m. Astonishing.

https://tech.eu/brief/london-based-location-tech-what3words-...

4 years ago by tgv

You mean people thought they were going to make a load of money with this? And that they spend 5 times more money on this simplistic google maps ripoff than the 40 employee company I work for, which runs a £250k profit? I am baffled.

4 years ago by mavhc

Costs a lot to get all the free publicity from journalists, got to avoid them pointing out it's a private company trying to control access to a coordinate system, or mentioning any alternatives.

4 years ago by rdpintqogeogsaa

Going by other people's experience with them[0], I wonder how long this post will stay up.

[0] https://twitter.com/AaronToponce/status/1387933438305394690

4 years ago by ykat7

Aaron posted an update. As of an hour ago they "consider the matter closed": https://twitter.com/AaronToponce/status/1388828107407245312

4 years ago by detaro

Given that cybergibbons has spent the past weeks looking into this very loudly and publicly, don't expect him to back down.

4 years ago by boramalper

For future reference, https://web.archive.org/web/20210502132217/https://cybergibb...

4 years ago by dmitrykoval

Story by TC - https://techcrunch.com/2021/04/30/what3words-legal-threat-wh...

4 years ago by NelsonMinar

A bunch of replies to this are "I implemented my own version of What3Words". That sport has been around for a few years now. https://what3emojis.com/ is probably the most long-lived; I'm writing from telephone bus eggplant. (Hacker News' lack of emoji support makes interop here a little difficult.)

Unfortunately http://www.what3fucks.com/ seems to have ended their incredible journey.

It's a really stupid idea for addressing, triply so when you consider the pathetic little proprietary word database that's What3Words tool for extracting rents. Geohash is the oldest system that solves the problem of "give me a short textual name for a place" and has a nice ability to get more precise.

4 years ago by daleharvey

A location near me

geohash: gcuvz30z w3w: drift.march.donor

I had to switch tabs 3 times to copy across the geohash since the page I found it on wouldnt let me copy and paste, I remembered the w3w immediately.

I am 100% against something like a global addressing standard being owned by a single company, however the amount of people ignoring w3w's extremely obvious utility and pointing to pluscodes, geohashing, grid references, geocoords etc is pretty tiring.

We have a standards body for emojis, cant we just agree on an open w3w implementation.

4 years ago by jasonwatkinspdx

Nelson is right. A bunch of us have played around with designing our own human friendly geohash scheme.

If you divide earth up into 4m x 4m cells using something like the S2 projection, you can uniquely address these using phrases of 3 words drawn from a database just under 32,000 words. The S2 projection ensures that the words in the higher levels of the hierarchy will change quite slowly. In most local contexts there will be only a single commonly repeated word. In a small number of more exceptional points there'd be just 2 or 3 unique values in use. So in practice you could use longer addresses and a smaller dictionary if you liked. This is probably a good idea for ergonomic reasons, as we can pick a set of words that are readily translated between languages, weed out any that are overly similar, etc.

I never uploaded the code anywhere, and it's on a laptop that died, but I busted out a toy version of the above in an afternoon a couple years ago just for fun. I used 4 word addresses and a database of about 2,500 of the most common english words.

W3W's entire value proposition is that they should earn a global rent off the use of addresses vs the above. It's absurd. And they intentionally avoid hierarchical organization of the addresses specifically to make it necessary to use their app/service to look crap up, which makes their service hilariously worse than the scheme described above.

With an S2 style hierarchy people will intuitively begin to recognize the common words in their living area at each level, and can use them in conversation without even consulting software. This is completely impossible with W3W's scheme.

The problem isn't figuring out a good scheme. Most of the people on this forum can manage that in a couple hours. The problem is the same with all other standardization efforts: getting a critical mass of people who give enough of a crap to use the standard.

This is an area where Google could ship something in maps and it'd have a decent chance of adoption outside of it. If you work on maps consider pitching a more ergonomic version of plus codes.

4 years ago by tobr

> I remembered the w3w immediately.

But in what situation is this useful? How is it more convenient or useful than a regular address? Even if you’re somewhere where there are no addresses, and you need to describe a location that has no other descriptive characteristics, isn’t it a lot easier to just drop a pin/bookmark on the map in your smartphone? Especially considering you can’t reasonably run the algorithm without a digital device of some kind.

4 years ago by avianlyric

Classic example is emergency services.

Having the ability to turn your location into three words that will easily survive communication over a phone with dodge signal is very useful.

You could argue that that lat/lon or a street address should be fine. But memorising a two sequences of numbers, then repeating them accurately over a poor phone line is hard.

Street addresses are also hard, there’s frequently not a nearby road sign, or worse, the road name is ambiguous (your city may have multiple streets with the same name). Hell there’s no guarantee that the emergency services even have your street name in their database if it’s a new street, or it’s name was changed recently.

So basically any scenario where you need to transmit a location accurately between two parties who are operating two different mapping systems, and communicating over an unreliable comms link (phone, radio etc), is a good use case for W3W. W3W is like the phonetic alphabet for locations.

Personally I hate the fact the W3W are extracting rents from this services. But I can’t deny the utility of their service.

4 years ago by daleharvey

Transferring the location from one place to another

So aside from the fact that my address has around a 50/50 success rate with people trying to find it (My address is a building + street name that has a mirror building + street name across the other side of a motorway)

Transferring the address, the obvious example people give is emergency services but this applies to so many situations, I volunteer with a litter picking group and they have found w3w extremely useful to communicate locations, you "can" transfer a pin but you need those devices set up to do a transfer already, with w3w I can check the website for the location on my way out the door and enter it to my map on the way, people who arent setup as contacts can transfer it without having to trade phone numbers etc.

Its hard to understand how someone could not find a way to easily memorize and communicate a specific location a very useful thing?

4 years ago by physicsguy

It makes sense more in some countries than others.

In Japan, for e.g. there are no street numbers.

In the U.K. we have postal codes that generally refer to a street. But they sometimes can be only a portion of a street, sometimes they’re a building, sometimes they’re a large business who might be across multiple sites. At other times (usually rural) they’re very very long roads, so sticking the address into a Sat Nav takes you to that road but not the point on the road that you want to travel to. Because the system is so mixed, 99/100 nthe address is absolutely fine, but occasionally it isn’t. I agree dropping a pin is more useful though.

Ireland had no postal codes at all until recently.

4 years ago by kitd

But in what situation is this useful?

On the phone? There's a bit I don't like about W3W, but efficiency of communication is their killer feature.

4 years ago by real-dino

> cant we just agree on an open w3w implementation.

They don't want that to happen, at least not an open source implementation. They want to be in the position of Pantone and colour.

4 years ago by cybergibbons

Why 3 words? Why not 4?

4 years ago by aembleton

Because it is enough to provide sufficient accuracy.

4 years ago by Twisell

The fundamental flaw for me goes deeper.

Both services mimic the principle of URL shortener applied to geographic location. (Even if, as you point out, one is public domain machine readable and the other is proprietary human readable one).

But why are url shortener so popular in the first place? Why are they perfectly usable even if competing services exits? (and I'd bet most popular are self-generated like youtu.be/dQw4w9WgXcQ).

I think they are popular mainly because they rely on the ubiquitous HTTP protocol to be fully interoperable with almost any HTTP client.

There is no such universal protocol or library to manage geospatial data provided in "custom hashed format". But all of the various tools available will gladly accept a latitude/longitude XY coordinate pair because they are perfectly machine readable and while not human friendly they actually mean something so that professionals can make sense of the numbers if needed. Also as far as variable precision is needed, let's throw a round(,) function and be set with it.

So as of now both solution are essentially some kind of private API with no widely available and stadardised support to revert to a usable latitude/longitude. So I would guess many GIS power users look at this suspiciously secretly hoping that no one will ever share such obfuscated data format to them for any critical use.

NB : For more complex data representations there is a whole standard defining organisation https://www.ogc.org that define horribly complex (yet useful) data structures and gladly make the standard freely and publicly available. Maybe they could enforce a flavor of geohash into a FOSS standard so that tooling could refer to it, but I'm not sure it's on their roadmap.

4 years ago by rburhum

Hi Nelson, maybe you can explain this to me: why so much hate towards what3words? Chris once came to our office to see if there was a fit for us to use their invention. We did not find that it solved a pain point for us, so nothing came out of it... but the amount of rage that I see that they have been accumulating towards the years. Is there a story behind it beyond "I want to do something similar and can implement it, but <patent>?" Thanks in advance

4 years ago by NelsonMinar

Hi Ragi! My complaint with What3Words is 85% their proprietary business model. They are literally trying to convince less developed countries to adopt their system in lieu of developing their own addressing systems. So that What3Words can forever force that country and anyone sending mail to it to pay them fees. It is evil.

This is not theoretical; back in 2017 they were bragging at how they'd convinced the nations of Kiribati, Mongolia, Sint-Maarten, Côte d’Ivoire, Djibouti, Tonga, Nigeria and the Solomon Islands to use them as a national standard. I'm certain that the actual usage is exaggerated on their part but imagine if it were really true. Addressing is far too important to be held by a proprietary company. Addressing is a national service that countries should provide as part of being a nation, and there's a long history of countries developing addressing systems that does not rely on some predatory intellectual property system.

5% of my annoyance is What3Words pretends like they invented some brilliant new idea that's unique and should be IP protected. No, the idea is older than them. And the implementation is obvious to anyone with ordinary skill in the art of addressing. Anyone competent with GIS concepts can design a similar system in a few days' work. And there are many better alternatives.

The other 10% of my complaint is their static addressing model. It's not good that two houses nearby have completely different addresses; real addressing always has hierarchical names and there's a reason for that. There's also a problem using words in that they are localized to the country's language. That's better than forcing English everywhere but makes interoperability way more confusing. Finally there's all the problems people keep finding with their word lists; that's covered well by other writers so I won't go into it.

PS: I misspoke when I said Geohash is the oldest similar system. MGRS dates to the 1940s. There may be older ones, too.

4 years ago by wwalexander

A standard for global addressing should not rely on a proprietary algorithm or wordlist, as it means that there is no legal method for using the standard besides using (paying for) What3Words’ API. This fits the common definition of rent-seeking behavior.

This couples with the technological issues outlined in the article above. I believe the vitriol against W3W is fueled by the combination of rent-seeking behavior on a standard of poor technical quality.

I’m also bothered that one of the main use cases W3W’s marketing highlights is for specifying locations in emergencies, a totally non-commercial/public-good application. W3W presents itself convincingly to laymen as this sort of public-good open standard, but they’re just another startup with a proprietary product they’re hoping to monopolize.

4 years ago by myself248

Furthermore, there's a payola scheme behind the scenes, to make sure automakers implement it in their infotainment systems and stuff, which they hope will force others to license it.

It's a great idea, a mediocre implementation, and a set of business practices that make the antichrist jealous.

On top of that, their bloodthirsty pursuit of any criticism means that legitimate discussions of the system's flaws can't happen in the open. The lack of that criticism may have encouraged some services to adopt W3W without realizing its flaws, and there's already some hinting that such a flaw may have already interfered with actual rescue operations:

https://www.bbc.com/news/technology-56901363

Thankfully the rescue was able to proceed once they switched to another coordinate system, and that case didn't result in a fatality. How many other such cases are there? We might never know. The company may have literal blood on its hands at this point, but fixing the flaws would require discussing them first, which we can't do as long as they're empowered to shut down discourse.

4 years ago by jedberg

The hate is because they patented it. It is a useful idea, but it would be far more useful if the algorithm and wordlist were public domain and didn't require a license to use it.

4 years ago by howdoidothehack

There is an alternative to what3fucks https://www.fourkingmaps.co.uk/ idk how long it'll be up for though

4 years ago by micheljansen

I learned about pluscodes because of this and they seem quite elegant: https://maps.google.com/pluscodes/

4 years ago by ehsankia

The algorithm is also extremely simple, I've seen code golf with less than 100 char in most languages, and realistically just a couple lines of code.

I like that you can recursively add more precision after the plus, as well as truncate some of the start if you provide context like the country or city. For example all of Montreal lives within the "87Q8" block, so I can just truncate that away if I specify that I'm in Montreal.

4 years ago by wyager

Doesn’t the plus code algorithm rely on tiling the globe with a space filling curve? I’d be surprised if that could be done simply.

4 years ago by cyphar

No, the plus code system is purely grid based.

4 years ago by CA0DA

pluscodes are also known as "Open Location Code" and the github repo is: https://github.com/google/open-location-code

4 years ago by ianpenney

My personal opinion of W3W, while a cool experiment, has always been: over-engineered. To hear they're raising their hackles is disappointing.

From Amateur Radio: Maidenhead grid squares + NATO phonetic alphabet are technical but adequate for purpose. It's hard to confuse "Foxtrot November Zero Three" with "Delta Mike Zero Four" even if you're communicating over a VHF/UHF satellite flying overhead in LEO.

You can get more or less specific by adding or removing symbols. Eg: "EK64ab" or just "EK". It's a challenge to log radio contacts from all of the major [AZ##] grids. https://twitter.com/GridMasterHeat

4 years ago by maxehmookau

As an idea, what3words appears to exist only to make money for its owners.

The UK already has an open, easy to use, well documented and battle-tested way of defining any point in the country. Grid references.

What3Words doesn't actually solve a problem that isn't already solved.

4 years ago by gmac

Yes! The national mapping agency (Ordnance Survey) make the algorithm for highly accurate GPS conversion (OSTN02) available for free.

This is my implementation in C, for example: https://github.com/jawj/OSTN02c

Daily digest email

Get a daily email with the the top stories from Hacker News. No spam, unsubscribe at any time.

What3Words – The Algorithm