Path: csiph.com!fu-berlin.de!uni-berlin.de!not-for-mail
From: Serhiy Storchaka <storchaka@gmail.com>
Newsgroups: comp.lang.python
Subject: Re: Detecting repeated subsequences of identical items
Date: Thu, 21 Apr 2016 14:56:07 +0300
Lines: 14
Message-ID: <mailman.10.1461239778.23626.python-list@python.org>
References: <571843f9$0$1585$c3e8da3$5496439d@news.astraweb.com> <nfaf4n$l8k$1@ger.gmane.org>
Mime-Version: 1.0
Content-Type: text/plain; charset=windows-1252; format=flowed
Content-Transfer-Encoding: 7bit
User-Agent: Mozilla/5.0 (X11; Linux i686; rv:38.0) Gecko/20100101 Thunderbird/38.6.0
In-Reply-To: <571843f9$0$1585$c3e8da3$5496439d@news.astraweb.com>
Precedence: list
Xref: csiph.com comp.lang.python:107448

On 21.04.16 06:07, Steven D'Aprano wrote:
> Now I want to group subsequences. For example, I have:
>
> "ABCABCABCDEABCDEFABCABCABCB"
>
> and I want to group it into repeating subsequences.

[...]

> How can I do this? Does this problem have a standard name and/or solution?

This is a part of lossless data compression algorithms. See for example 
LZ, LZW.