From: Anonymous <nowhere@bsu-cs>
To: cypherpunks@toad.com
Message Hash: aaa49b11559b58cf5cf54da33a60a98e8b1c8c6a7f43910216c32f770734b444
Message ID: <199405200013.TAA07260@bsu-cs.bsu.edu>
Reply To: N/A
UTC Datetime: 1994-05-20 00:14:17 UTC
Raw Date: Thu, 19 May 94 17:14:17 PDT
From: Anonymous <nowhere@bsu-cs>
Date: Thu, 19 May 94 17:14:17 PDT
To: cypherpunks@toad.com
Subject: No Subject
Message-ID: <199405200013.TAA07260@bsu-cs.bsu.edu>
MIME-Version: 1.0
Content-Type: text/plain
Newsgroups: sci.crypt,alt.security,alt.privacy
From: schneier@chinet.chinet.com (Bruce Schneier)
Subject: "Interesting Stuff" Checkers at the NSA
Message-ID: <Cq2934.q0@chinet.chinet.com>
Organization: Chinet - Public Access UNIX
Date: Thu, 19 May 1994 17:40:15 GMT
This is from a flyer that NSA people have been distributing:
NATIONAL SECURITY AGENCY -- TECHNOLOGY TRANSFER
Information Sorting and Retrieval by Language or Topic
Description: This technique is an extremely simple, fast,
completely general mathod of sorting and retrieving machine-
readable text according to language and/or topic. The
method is totally independent of the particular languages or
topics of interest, and relies for guidance solely upon
exemplars (e.g., existing documents, fragments, etc.)
provided by the user. It employs no dictionaries keywords,
stoplists, stemmings, syntax, semantics, or grammar;
nevertheless, it is capable of distinguishing among closely
related toopics (previously considered inseparable) in any
language, and it can do so even in text containing a great
many errors (typically 10 - 15% of all characters). The
technique can be quickly implemented in software on any
computer system, from microprocessor to supercomputer, and
can easily be implemented in inexpensive hardware as well.
It is directly scalable to very large data sets (millions of
documents).
Commercial Application:
Language and topic-independent sorting and retieval of
documents satisfying dynamic criteria defined only by
existing documents.
Clustering of topically related documents, with no
prior knowledge of the languages or topics that may be
present. It desired, this activity can automatically
generate document selectors.
Specializing sorting tasks, such as identification of
duuplicate or near-duplicate documents in a large set.
National Security Agency
Research and Technology Group - R
Office of Research and Technology Applications (ORTA)
9800 Savage Road
Fort George G. Meade, MD 20755-6000
(301) 688-0606
If this is the stuff they're giving out to the public, I can only
imagine what they're keeping for themselves.
Bruce
**************************************************************************
* Bruce Schneier
* Counterpane Systems For a good prime, call 391581 * 2^216193 - 1
* schneier@chinet.com
**************************************************************************
Return to May 1994
Return to “Anonymous <nowhere@bsu-cs>”
1994-05-20 (Thu, 19 May 94 17:14:17 PDT) - No Subject - Anonymous <nowhere@bsu-cs>