Date: 10/10/2018 23:38:06
From: party_pants
ID: 1287385
Subject: Human Word Population

There are an estimated 20,182,000 words in all languages combined. Since some languages contain the same word (sometimes with a totally different meaning) this drops down to about 15,000,000 unique words.

Reply Quote

Date: 10/10/2018 23:40:54
From: sarahs mum
ID: 1287386
Subject: re: Human Word Population

party_pants said:


There are an estimated 20,182,000 words in all languages combined. Since some languages contain the same word (sometimes with a totally different meaning) this drops down to about 15,000,000 unique words.

and slang?

Reply Quote

Date: 10/10/2018 23:56:08
From: Arts
ID: 1287390
Subject: re: Human Word Population

http://shakespeare-online.com/biography/wordsinvented.html

“The English language owes a great debt to Shakespeare. He invented over 1700 of our common words by changing nouns into verbs, changing verbs into adjectives, connecting words never before used together, adding prefixes and suffixes, and devising words wholly original. “

https://io9.gizmodo.com/no-william-shakespeare-did-not-really-invent-1-700-eng-1700049586?IR=T

“no”

Reply Quote

Date: 11/10/2018 00:07:54
From: Michael V
ID: 1287394
Subject: re: Human Word Population

In Jamaica, one of the words for “poofter” is “chichiman”. In Djubagai (a north-eastern Australian language) the word is “gigiman”. They are not-nice words in either language. Interesting, nonetheless.

Also, they are homophones…

Reply Quote

Date: 11/10/2018 02:36:02
From: mollwollfumble
ID: 1287463
Subject: re: Human Word Population

There are more than 5 million acronyms and abbreviations, according to https://www.acronymfinder.com/

With more than 1,000,000 definitions, Acronym Finder is the world’s largest and most comprehensive dictionary of acronyms, abbreviations, and initialisms. Combined with the Acronym Attic, Acronym Finder contains more than 5 million acronyms and abbreviations.

Information Technology (IT)
Information technology, Internet/Web, telecommunications, computing & computer science, hardware, software, etc. (over 86,000 definitions)
Examples: AJAX, CMM, DHCP, FTP, HTTP, PDA, RSS, SDK, TCP, WWW

Military & Government
Local, national and international governments, military, defense, defense industry, weapons systems, etc. (over 154,000 definitions)
Examples: DoD, ICBM, ICE, NHS, MoD, NOAA, NSA, OSHA, NZQA

Business & Finance
Business, finance, accounting, marketing, real estate, shipping, companies, stock markets, products, etc. (over 76,000 definitions)
Examples: BOE, CEO, EBIDTA, FOB, GAAP, IKEA, IPO, MLS, P&L, TVM

Science & Medicine
Popular science, hard science, medicine, nature, engineering, physics, space, astronomy, geology, chemistry, etc. (over 145,000 definitions)
Examples: ACL, DNA, HEPA, LASER, MRI, PTFE, SSRI, TIA, TENS, VOC

Organizations & Schools
Local, national, and international organizations, schools, colleges, universities, education, non-profits, NGOs, etc. (over 195,000 definitions)
Examples: ALA, ANWB, BBB, IEEE, MoMA, NEA, UCLA, UN, WTO

Slang & Pop Culture
Slang, chat, instant messaging, newsgroups, sports, people, pop culture, etc. (over 41,000 definitions)
Examples: AFAIK, BRB, IIRC, IMHO, JFK, LOL, MVP, RBI, ROFL

Reply Quote

Date: 11/10/2018 03:08:15
From: mollwollfumble
ID: 1287464
Subject: re: Human Word Population

GitHub is a text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

> There are more than 5 million acronyms and abbreviations, according to https://www.acronymfinder.com/

I once made a list of all the words in the English version of wikipedia.
Preparatory work for making a good spell-checker.
Hold on, found it.
The word list from the English wikipedia (text file) is 49 GB.
After sorting, this reduces to a 310 MB text file.
After extracting only those words that appear 100 or more times in English wikipedia, I get 9.23 MB containing 593,598 words.

So my sorted word list from wikipedia must have a length of something like 593,598 * 310 / 9.23 = 20 million words.

The longest word lists available on the web are maintained by hackers for the purpose of cracking passwords. Or, as they put it “password recovery”. They can be a lot bigger than 20 million words.

Reply Quote

Date: 11/10/2018 03:41:54
From: mollwollfumble
ID: 1287472
Subject: re: Human Word Population

mollwollfumble said:


GitHub is a text file containing 479k English words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion

> There are more than 5 million acronyms and abbreviations, according to https://www.acronymfinder.com/

I once made a list of all the words in the English version of wikipedia.
Preparatory work for making a good spell-checker.
Hold on, found it.
The word list from the English wikipedia (text file) is 49 GB.
After sorting, this reduces to a 310 MB text file.
After extracting only those words that appear 100 or more times in English wikipedia, I get 9.23 MB containing 593,598 words.

So my sorted word list from wikipedia must have a length of something like 593,598 * 310 / 9.23 = 20 million words.

The longest word lists available on the web are maintained by hackers for the purpose of cracking passwords. Or, as they put it “password recovery”. They can be a lot bigger than 20 million words.

Words starting with “z” that appear exactly 100 times in the english version of wikipedia.

Reply Quote

Date: 11/10/2018 07:02:25
From: The Rev Dodgson
ID: 1287475
Subject: re: Human Word Population

party_pants said:


There are an estimated 20,182,000 words in all languages combined. Since some languages contain the same word (sometimes with a totally different meaning) this drops down to about 15,000,000 unique words.

I’m surprised there are only about 15,000,000 words.

That’s not even enough for one each in Australia.

I wonder how many words the most multi-lingual person in the world knows.

Reply Quote

Date: 11/10/2018 09:48:06
From: mollwollfumble
ID: 1287506
Subject: re: Human Word Population

Reply Quote