From: Anonymous <nowhere@bsu-cs>
 To: cypherpunks@toad.com
 Message Hash: aaa49b11559b58cf5cf54da33a60a98e8b1c8c6a7f43910216c32f770734b444
 Message ID: <199405200013.TAA07260@bsu-cs.bsu.edu>
 Reply To: N/A
 UTC Datetime: 1994-05-20 00:14:17 UTC
 Raw Date: Thu, 19 May 94 17:14:17 PDT
From: Anonymous <nowhere@bsu-cs>
Date: Thu, 19 May 94 17:14:17 PDT
To: cypherpunks@toad.com
Subject: No Subject
Message-ID: <199405200013.TAA07260@bsu-cs.bsu.edu>
MIME-Version: 1.0
Content-Type: text/plain
Newsgroups: sci.crypt,alt.security,alt.privacy
From: schneier@chinet.chinet.com (Bruce Schneier)
Subject: "Interesting Stuff" Checkers at the NSA
Message-ID: <Cq2934.q0@chinet.chinet.com>
Organization: Chinet - Public Access UNIX
Date: Thu, 19 May 1994 17:40:15 GMT
This is from a flyer that NSA people have been distributing:
     NATIONAL SECURITY AGENCY --  TECHNOLOGY TRANSFER
     Information Sorting and Retrieval by Language or Topic
     Description:  This technique is an extremely simple, fast,
     completely general mathod of sorting and retrieving machine-
     readable text according to language and/or topic.  The
     method is totally independent of the particular languages or
     topics of interest, and relies for guidance solely upon
     exemplars (e.g., existing documents, fragments, etc.)
     provided by the user.  It employs no dictionaries keywords,
     stoplists, stemmings, syntax, semantics, or grammar;
     nevertheless, it is capable of distinguishing among closely
     related toopics (previously considered inseparable) in any
     language, and it can do so even in text containing a great
     many errors (typically 10 - 15% of all characters).  The
     technique can be quickly implemented in software on any
     computer system, from microprocessor to supercomputer, and
     can easily be implemented in inexpensive hardware as well. 
     It is directly scalable to very large data sets (millions of
     documents).
     Commercial Application:
          Language and topic-independent sorting and retieval of
          documents satisfying dynamic criteria defined only by
          existing documents.
          Clustering of topically related documents, with no
          prior knowledge of the languages or topics that may be
          present.  It desired, this activity can automatically
          generate document selectors.
          Specializing sorting tasks, such as identification of
          duuplicate or near-duplicate documents in a large set.
     National Security Agency
     Research and Technology Group - R
     Office of Research and Technology Applications (ORTA)
     9800 Savage Road
     Fort George G. Meade, MD  20755-6000
     (301) 688-0606
If this is the stuff they're giving out to the public, I can only
imagine what they're keeping for themselves.
Bruce
**************************************************************************
* Bruce Schneier
* Counterpane Systems         For a good prime, call 391581 * 2^216193 - 1
* schneier@chinet.com
**************************************************************************
Return to May 1994
Return to “Anonymous <nowhere@bsu-cs>”
1994-05-20 (Thu, 19 May 94 17:14:17 PDT) - No Subject - Anonymous <nowhere@bsu-cs>