PepperText: Text Repetition Finder

How Does It Work?

How The Text Repetition Finder Works

Segments

The Text Repetition Finder looks in the provided text for text segments that occur more than once.

Segments can be of any length (made up of single words, word pairs, triples, etc). There is no maximum length. This allows the finder to extract multi-word terms instead of only finding individual words without their context.

Example

The below block of text shows the repetitions found by the finder. The results are listed with the repetition count shown next to them.

Subset Segments

The finder will ignore segments that are subsets of larger segments.

For example, if the term 'abdominal aortic aneurysm' occurs multiple times in a text, the results will not include the subset repetitions (shown in grey).

Exception

An exception is made for subset segments that also exist elsewhere in the text, not just as part of a larger segment. They are then included in results with a count reflecting all instances of the segment.

For example, if the term 'abdominal aortic aneurysm' occurs multiple times, but the term 'abdominal' also occurs on its own several times, the results would look more like this (greyed out results still being ignored):

Finer Details

Minimum Segment Character Length

You can raise the Minimum Segment Character Length to avoid detecting short segments. For example, you might not care about segments of short words such as he, the, an, are, I, etc.

Segment Length for Text Expanders

If you're using the finder to get ideas for your text expander, focus only on segment lengths that are worth automating. We recommended setting the segment length to 7, adjusting up or down depending on your typing speed.

Case Sensitivity

The finder ignores case, so 'repetition', 'Repetition', and 'REPETITION' are considered identical by the finder.

Special Characters

Special characters are replaced with a space prior to repetition-finding, so 'clear-cut' and 'clear cut' are considered identical by the finder.

Privacy

The Text Repetition Finder runs within your browser and does not send text or information over the internet for processing. Your text stays private to you.