New subject: [math-fun] Lexical distance?

27 Aug 2010

      Besides comparing identical places in two strings, one might also care how close they are to being rearrangements of each other.  E.g., one might want to consider XYXYXY and YXYXYX much closer than XYXYXY and ZZZZZZ, even though both pairs differ in every place.

Assuming finite strings from a finite alphabet in which all pairs of letters are equally "distant", one might define lexical distance as the fewest "moves" to go from one string to another, where a move is either 1) the transposition T of two adjacent string elements, or 2) the substitution S of one letter by another in the same place.   In general the two types of moves would be weighted differently, depending on the application.

   D = p*(#T) + (1-p)*(#S)

For an alphabet where letters have various distances, this could be changed slightly to reflect that.

--Dan

Marc wrote:

<<
A very rough kind of ³lexical distance² between strings (possibly infinite) is just the number of symbols by which they differ.
...
...
_____________________________________________________________________
"It don't mean a thing if it ain't got that certain je ne sais quoi." --Peter Schickele

Re: [math-fun] Lexical distance?

Dan Asimov

Tom Karzes

Marc LeBrun

tags

participants (3)