The Text Repetition Finder looks in the provided text for text segments that occur more than once.
Segments can be of any length (made up of single words, word pairs, triples, etc). There is no maximum length. This allows the finder to extract multi-word terms instead of only finding individual words without their context.
The below block of text shows the repetitions found by the finder. The results are listed with the repetition count shown next to them.
The finder will ignore segments that are subsets of larger segments.
For example, if the term 'abdominal aortic aneurysm' occurs multiple times in a text, the results will not include the subset repetitions (shown in grey).
An exception is made for subset segments that also exist elsewhere in the text, not just as part of a larger segment. They are then included in results with a count reflecting all instances of the segment.
For example, if the term 'abdominal aortic aneurysm' occurs multiple times, but the term 'abdominal' also occurs on its own several times, the results would look more like this (greyed out results still being ignored):
You can raise the Minimum Segment Length to avoid detecting short segments. For example, you might not care about segments of short words such as he, the, an, are, I, etc.
If you're using the finder to get ideas for your text expander, focus only on segment lengths that are worth automating. We recommended setting the segment length to 7, adjusting up or down depending on your typing speed.
The finder ignores case, so 'repetition' and 'Repetition' are considered identical by the finder.
Special characters are replaced with a space prior to repetition-finding, so 'clear-cut' and 'clear cut' are considered identical by the finder.
The Text Repetition Finder runs within your browser and does not send text or information over the internet for processing. Your text stays private to you.