SPOJ 8222 NSUBSTR Substrings

SAM的简单应用....

由SAM可知从root到达的每个节点所经过的路径都对着应原串的一个子串，每个节点能到几次接收态就等于这个子串出现了几次。从最后一个节点往上走，就可以用DP更新出每个子串出现了多少次。

出现了5次的子串一定也出现了4，3，2，1次。。。所以最后再用长度长的给长度小的更新一下。。。。

Substrings

Time Limit: 1000MS

Memory Limit: Unknown

64bit IO Format: %lld & %llu

[Submit]
[Go Back] [Status]

Description

You are given a string S which consists of 250000 lowercase latin letters at most. We define F(x) as the maximal number of times that some string with length x appears in S. For example for string ‘ababa‘ F(3) will be 2 because there is a string ‘aba‘ that
occurs twice. Your task is to output F(i) for every i so that 1<=i<=|S|.

Input

String S consists of at most 250000 lowercase latin letters.

Output

Output |S| lines. On the i-th line output F(i).

Example

Input:
ababa

Output:
3
2
2
1
1

Source

Immagination

[Submit]
[Go Back] [Status]

#include <iostream>
#include <cstdio>
#include <cstring>
#include <algorithm>

using namespace std;

const int maxn=600000;

struct SAM_Node
{
    SAM_Node *fa,*next[26];
    int len,id,pos;
    SAM_Node(){}
    SAM_Node(int _len)
    {
        len=_len; fa=0;
        memset(next,0,sizeof(next));
    }
};

SAM_Node SAM_node[maxn*2],*SAM_root,*SAM_last;
int SAM_size;

SAM_Node *newSAM_Node(int len)
{
    SAM_node[SAM_size]=SAM_Node(len);
    SAM_node[SAM_size].id=SAM_size;
    return &SAM_node[SAM_size++];
}

SAM_Node *newSAM_Node(SAM_Node *p)
{
    SAM_node[SAM_size]=*p;
    SAM_node[SAM_size].id=SAM_size;
    return &SAM_node[SAM_size++];
}

void SAM_init()
{
    SAM_size=0;
    SAM_root=SAM_last=newSAM_Node(0);
    SAM_node[0].pos=0;
}

void SAM_add(int x,int len)
{
    SAM_Node *p=SAM_last,*np=newSAM_Node(p->len+1);
    np->pos=len; SAM_last=np;
    for(;p&&!p->next[x];p=p->fa)
        p->next[x]=np;
    if(!p)
    {
        np->fa=SAM_root;
        return ;
    }
    SAM_Node *q=p->next[x];
    if(q->len==p->len+1)
    {
        np->fa=q;
        return ;
    }
    SAM_Node *nq=newSAM_Node(q);
    nq->len=p->len+1;
    q->fa=nq; np->fa=nq;
    for(;p&&p->next[x]==q;p=p->fa)
        p->next[x]=nq;
}

char str[maxn];
bool vis[maxn];
int r[maxn],dp[maxn];

int Find(SAM_Node *p)
{
    if(vis[p->id]==true)
        return r[p->id];
    vis[p->id]=true;
    r[p->id]=0;
    for(int i=0;i<26;i++)
    {
        if(p->next[i])
            r[p->id]+=Find(p->next[i]);
    }
    return r[p->id];
}

void get_count()
{
    memset(r,0,sizeof(r));
    memset(vis,0,sizeof(vis));
    SAM_Node *p=SAM_last;
    vis[p->id]=true;
    for(;p;p=p->fa)
    {
        Find(p);
        r[p->id]++;
    }
}

int main()
{
while(scanf("%s",str)!=EOF)
{
    int n=strlen(str);
    SAM_init();
    for(int i=0;i<n;i++)
        SAM_add(str[i]-'a',i+1);
    get_count();
    memset(dp,0,sizeof(dp));
    for(int i=0;i<SAM_size;i++)
    {
        dp[SAM_node[i].len]=max(dp[SAM_node[i].len],r[SAM_node[i].id]);
    }
    for(int i=n-1;i>=1;i--)
    {
        dp[i]=max(dp[i],dp[i+1]);
    }
    for(int i=1;i<=n;i++)
    {
        printf("%d\n",dp[i]);
    }
}
    return 0;
}

SPOJ 8222 NSUBSTR Substrings

时间： 2024-12-25 21:04:59

SPOJ 8222 NSUBSTR Substrings的相关文章

●SPOJ 8222 NSUBSTR - Substrings

题链: http://www.spoj.com/problems/NSUBSTR/ 题解: 后缀自动机的水好深啊!懂不了相关证明,带着结论把这个题做了.看来这滩深水要以后再来了. 本题要用到一个叫 Right[P] 的数组,表示 P对应的子串在原串中出现的所有位置的末尾位置下标的集合.本题中,用这个数组存储集合大小就好了,即 P对应的子串在原串中出现了Right[p]次. 而Right[P]的值,等于从改点出发到结束状态的方案数.但这个不好求,而是要用到另一个求法:用 Parent树: (暂时由

SPOJ 8222 NSUBSTR(SAM)

这几天看了N多论文研究了下后缀自己主动机.刚開始蛋疼的看着极短的代码和clj的论文硬是看不懂,后来结合其它几篇论文研究了下.总算是明确了一些推荐文章http://blog.sina.com.cn/s/blog_70811e1a01014dkz.html 看了几篇文章认为还是这篇写的清晰明了,建议看几遍明确怎样建SAM再看了clj的论文. clj的论文中对性质的研究比較深入以下是clj论文里推荐的一题,题意:给一个字符串S,令F(x)表示S的全部长度为x的子串中,出现次数的最大值.求F(1).

SPOJ NSUBSTR - Substrings

NSUBSTR - Substrings no tags You are given a string S which consists of 250000 lowercase latin letters at most. We define F(x) as the maximal number of times that some string with length x appears in S. For example for string 'ababa' F(3) will be 2 b

SPOJ - DISUBSTR Distinct Substrings （不相同的子串的个数）

Distinct Substrings Time Limit: 159MS Memory Limit: 1572864KB 64bit IO Format: %lld & %llu Description Given a string, we need to find the total number of its distinct substrings. Input T- number of test cases. T<=20;Each test case consists of

后缀自动机小结 (spoj 8222)

后缀自动机理解关键点: 1. 根到任意一个结点都可以形成S的一个子串,并且S的所有子串都可以通过这种方式形成; 2. 到达该节点是所有路径就是一个right集合,一个拥有相同后缀的right集合; 3. 设某子串为str,这后缀自动机读入str后能到达的状态为right(str),即str在S中出现的位置的集合; 4. 假设node[b].fa = a,则状态a可以代替状态b进行识别. 附图: 更详细的资料: http://wenku.baidu.com/view/90f22eec551810a

【SPOJ】Distinct Substrings（后缀自动机）

[SPOJ]Distinct Substrings(后缀自动机) 题面 Vjudge 题意:求一个串的不同子串的数量题解对于这个串构建后缀自动机之后我们知道每个串出现的次数就是\(right/endpos\)集合的大小但是实际上我们没有任何必要减去不合法的数量我们只需要累加每个节点表示的合法子串的数量即可这个值等于\(longest-shortest+1=longest-parent.longest\) #include<iostream> #include<cstdio&g

SPOJ 8222 Substrings（后缀自动机）

[题目链接] http://www.spoj.com/problems/NSUBSTR/ [题目大意] 给一个字符串S,令F(x)表示S的所有长度为x的子串中,出现次数的最大值. 求出所有的F. [题解] 在SAM中,一个串出现的次数就是|Right(s)|,我们按长度从小到大分配内存单位, 从后往前计算可以获得Right值大小,用所有的Right去更新相应长度的答案即可. [代码] #include <cstdio> #include <cstring> #include <

并不对劲的[spoj nsubstr]substrings

题意是求一个字符串每个长度的子串出现次数最多的那个出现了多少次,也就是求每个到根的最长路的right集合最大值 . 先建后缀自动机,然后将每个前缀所在的集合的初值设为1,因为所有前缀的right集合肯定不相同,而且它们包含了所有位置. 接下来按到根的最长距离从大到小排序,将right集合累加到parent上.这么排序是因为到根的最长距离长的状态肯定不是到根的最长距离短的状态的parent. 最后直接求到根的不同的最长距离的最大的right集合就行. #include<iostream> #in

spoj 8222

8222. Substrings Problem code: NSUBSTR You are given a string S which consists of 250000 lowercase latin letters at most. We define F(x) as the maximal number of times that some string with length x appears in S. For example for string 'ababa' F(3) w