Re: [math-fun] [EXTERNAL] ISO a perfect pangram

3 Aug 2017

      Better algorithm:
No need to restrict the word list, gets a full search, fairly fast, no tree search.

Divide words into two sets, those with a A, and those without.
Form all legal combinations of each set.
(Treat the words with earliest letter B as a group, attempting merge with A list
and with (1 element, no bits, no words).   Then merge in the Cs, etc.
  This avoids trying to form the power set directly.)
No list can have more than 2^25 elements.

Sort the two lists.  Reverse the no-A list.

Do a "merge" of the two lists, looking for an arithmetic sum of exactly 2^26-1.
Record successes.  Either redo the run to notice the ingredients in the matches,
or, in the original list build, record the words or bit patterns for each item.

Rich

________________________________________
From: math-fun [math-fun-bounces+rschroe=sandia.gov@mailman.xmission.com] on behalf of Keith F. Lynch [kfl@KeithLynch.net]
Sent: Wednesday, August 2, 2017 10:46 PM
To: math-fun@mailman.xmission.com
Subject: [EXTERNAL] [math-fun] ISO a perfect pangram

Bill Gosper's recent anagrams reminded me of my search for a perfect
pangram, i.e. a sentence in English containing every letter once
and only once.  For instance, "Cwm fjord bank glyphs vext quiz" and
"Mr. Jock, TV quiz PhD, bags few lynx," neither of which is really
satisfactory.

My approach was to find a complete list of words, discard any words
containing duplicate letters (e.g. "containing," as it contains two
"n"s (and also two "i"s)), turn each of them into a 32-bit integer
which indicates which letters are present ("of" becomes 2^4 + 2^13 as
it contains the 5th and 14th letters).  Anagrams are merged (e.g. the
numbers for "opts," "post," "pots," "spot," "stop," and "tops" are
the same).  A depth-first tree search is then done during which the
numbers are ORed together.  If an AND gets a non-zero result, the
branch is immediately abandoned.  If 2^26-1 is reached, the result is
logged, so that I can manually try to arrange the words (or anagrams
of them) into a coherent sentence.

I tried this about 30 years ago, on a 286.  Needless to say, I didn't
get anywhere.  It probably would have run for eons.

I'm considering trying again, on the same general plan, but only
with the most common ten thousand words.  Better yet, the most
common thousand words containing "a", the most common thousand words
containing "b", ..., and the most common thousand words containing
"z," with the many duplicates merged, for a total of perhaps ten
thousand.  I might also include the most common thousand words *not*
containing "a", the most common thousand words not containing "b",
..., and the most common thousand words not containing "z."  Again,
duplicates would be merged.  And words that are anagrams would move
up in the rankings as the usage of each anagram would be summed as
if they were all the same word.

With memory as cheap as it is, I might also include every *pair* of
words in the list as a word, even though that would increase memory
usage from 40 kilobytes to 400 megabytes.

Has anyone tried this already?  Does anyone have a better algorithm?
Thanks.

_______________________________________________
math-fun mailing list
math-fun@mailman.xmission.com
https://mailman.xmission.com/cgi-bin/mailman/listinfo/math-fun

Re: [math-fun] [EXTERNAL] ISO a perfect pangram

Schroeppel, Richard