1996-07-09 - Re: Word lists for passphrases

Header Data

From: “Mark M.” <markm@voicenet.com>
To: “Igor Chudov @ home” <ichudov@algebra.com>
Message Hash: 52a4420afa4f0d488b2a4c938794f051b57e79bf9b930df3908776b00a53c0e9
Message ID: <Pine.LNX.3.94.960709010857.1246B-100000@gak>
Reply To: <199607090210.VAA07394@manifold.algebra.com>
UTC Datetime: 1996-07-09 08:08:16 UTC
Raw Date: Tue, 9 Jul 1996 16:08:16 +0800

Raw message

From: "Mark M." <markm@voicenet.com>
Date: Tue, 9 Jul 1996 16:08:16 +0800
To: "Igor Chudov @ home" <ichudov@algebra.com>
Subject: Re: Word lists for passphrases
In-Reply-To: <199607090210.VAA07394@manifold.algebra.com>
Message-ID: <Pine.LNX.3.94.960709010857.1246B-100000@gak>
MIME-Version: 1.0
Content-Type: text/plain


On Mon, 8 Jul 1996, Igor Chudov @ home wrote:

> Ben Holiday wrote:
> > If you have access to a shell, and to the news spool, you can generate
> > some quick lists by hopping into the directory of any newsgroup that
> > interests you and doing:
> > 
> > cat * | tr -cs A-Za-z '\n' | tr A-Z a-z | sort | uniq > my-big-ol-wordlist
> > 
> > With most unixes that will generate an alphabetized list of all the unique
> > words in your source text, converted to lowercase. I've had some problems
> > with tr on a few machines, however. Adding a '-c' after 'uniq' will tell
> > you how many times each word occured (useful for grepping out words that
> > appear too infrequently, or too frequently) .. 
> Actually I am fairly sure that your selection of words will be mediocre
> at best. There are words (such as nethermost, insatiable, insufferable)
> that are almost never used in news.

According to Altavista:

nethermost   - 45
insatiable   - 200
insufferable - 200

I know I have too much free time.

- -- Mark

markm@voicenet.com              | finger -l for PGP key 0xe3bf2169
http://www.voicenet.com/~markm/ | d61734f2800486ae6f79bfeb70f95348
"Freedom is the freedom to say that two plus two make four.  If that
is granted, all else follows."  --George Orwell, _1984_

Version: 2.6.3
Charset: noconv