text-similarity-cli

Compare any two text files and get a detailed similarity score using three algorithms. Zero dependencies.

install

$ pip install text-similarity-cli

Three algorithms, one score

Counts the minimum character edits needed to transform one string into another. Best for short texts and code.

Character-level

Compares unique word sets. Score = intersection / union. Best for keyword overlap and topic similarity.

Token-level

Builds term-frequency vectors and measures the angle between them. Best for longer documents and essays.

Vector-level

Levenshtein 72.4%

Jaccard 68.1%

Cosine 81.3%

Average 73.9%

Verdict: Highly similar

Flag	Default	Description
`--algo`	all	levenshtein, jaccard, cosine, or all
`--json`	off	Output results as JSON
`--no-color`	off	Disable ANSI color output
`--threshold N`	none	Exit code 1 if average score is below N%
`--version`		Print version and exit