HDOJ 4416 Good Article Good sentence

题解转自:http://blog.csdn.net/dyx404514/article/details/8807440

2012杭州网络赛的一道题,后缀数组后缀自己主动机都行吧。

题目大意:给一个字符串S和一系列字符串T1~Tn,问在S中有多少个不同子串满足它不是T1~Tn中随意一个字符串的子串。

思路:我们先构造S的后缀自己主动机,然后将每个Ti在S的SAM上做匹配,类似于LCS,在S中的每个状态记录一个变量deep,表示T1~Tn,在该状态能匹配的最大长度是多少,将每个Ti匹配完之后,我们将S的SAM做拓扑排序,自底向上更新每个状态的deep,同一时候计算在该状态上有多少个子串满足题目要求。详细过程例如以下:

1:对于当前状态,设为p,设p的par为q,则更新q->deep为q->deep和p->deep中的较大值。

2:若p->deep<p->val,则表示在状态p中,长度为p->deep+1~p->val的子串不是T1~Tn中随意字符串的子串,所以答案加上p->val-p->deep。否则表示状态p中全部字串均不满足要求,跳过就可以。

(注意若p->deep==0,表示状态p中全部的子串均满足题目要求,可是答案不是加上p->val-0,而是加上 p->val-p->par->val,这表示状态p中的字符串个数,所以对于p->deep==0要特殊处理)

最后输出答案就可以。

Good Article Good sentence

Time Limit: 6000/3000 MS (Java/Others)    Memory Limit: 32768/32768 K (Java/Others)

Total Submission(s): 2112    Accepted Submission(s): 587

Problem Description

In middle school, teachers used to encourage us to pick up pretty sentences so that we could apply those sentences in our own articles. One of my classmates ZengXiao Xian, wanted to get sentences which are different from that of others, because he thought the
distinct pretty sentences might benefit him a lot to get a high score in his article.

Assume that all of the sentences came from some articles. ZengXiao Xian intended to pick from Article A. The number of his classmates is n. The i-th classmate picked from Article Bi. Now ZengXiao Xian wants to know how many different sentences she could pick
from Article A which don‘t belong to either of her classmates?Article. To simplify the problem, ZengXiao Xian wants to know how many different strings, which is the substring of string A, but is not substring of either of string Bi. Of course, you will help
him, won‘t you?

Input

The first line contains an integer T, the number of test data.

For each test data

The first line contains an integer meaning the number of classmates.

The second line is the string A;The next n lines,the ith line input string Bi.

The length of the string A does not exceed 100,000 characters , The sum of total length of all strings Bi does not exceed 100,000, and assume all string consist only lowercase characters ‘a‘ to ‘z‘.

Output

For each case, print the case number and the number of substrings that ZengXiao Xian can find.

Sample Input

3
2
abab
ab
ba
1
aaa
bbb
2
aaaa
aa
aaa

Sample Output

Case 1: 3
Case 2: 3
Case 3: 1

Source

2012 ACM/ICPC Asia Regional Hangzhou Online

#include <iostream>
#include <cstdio>
#include <cstring>
#include <algorithm>

using namespace std;

const int maxn=110000;

struct SAM_Node
{
    SAM_Node *fa,*next[26];
    int len,id,pos;
    SAM_Node(){}
    SAM_Node(int _len)
    {
        len=_len;
        fa=0; memset(next,0,sizeof(next));
    }
};

SAM_Node SAM_node[maxn*2],*SAM_root,*SAM_last;
int SAM_size;

SAM_Node *newSAM_Node(int len)
{
    SAM_node[SAM_size]=SAM_Node(len);
    SAM_node[SAM_size].id=SAM_size;
    return &SAM_node[SAM_size++];
}

SAM_Node *newSAM_Node(SAM_Node *p)
{
    SAM_node[SAM_size]=*p;
    SAM_node[SAM_size].id=SAM_size;
    return &SAM_node[SAM_size++];
}

void SAM_init()
{
    SAM_size=0;
    SAM_root=SAM_last=newSAM_Node(0);
    SAM_node[0].pos=0;
}

void SAM_add(int x,int len)
{
    SAM_Node *p=SAM_last,*np=newSAM_Node(p->len+1);
    np->pos=len; SAM_last=np;
    for(;p&&!p->next[x];p=p->fa)
        p->next[x]=np;
    if(!p)
    {
        np->fa=SAM_root;
        return ;
    }
    SAM_Node *q=p->next[x];
    if(q->len==p->len+1)
    {
        np->fa=q;
        return ;
    }
    SAM_Node *nq=newSAM_Node(q);
    nq->len=p->len+1;
    q->fa=nq; np->fa=nq;
    for(;p&&p->next[x]==q;p=p->fa)
        p->next[x]=nq;
}

char A[maxn],B[maxn];
int c[maxn*2],LCS[maxn*2];
SAM_Node *top[maxn*2];

int main()
{
    int T_T,T,cas=1;
    scanf("%d",&T_T);
    while(T_T--)
    {
        scanf("%d",&T);
        scanf("%s",A);
        int len=strlen(A);
        SAM_init();
        for(int i=0;i<len;i++)
            SAM_add(A[i]-'a',i+1);

        memset(c,0,sizeof(c));
        memset(LCS,0,sizeof(LCS));
        memset(top,0,sizeof(top));

        for(int i=0;i<SAM_size;i++)
            c[SAM_node[i].len]++;
        for(int i=1;i<=len;i++)
            c[i]+=c[i-1];
        for(int i=0;i<SAM_size;i++)
            top[--c[SAM_node[i].len]]=&SAM_node[i];
        while(T--)
        {
            scanf("%s",B);
            int len2=strlen(B);
            int temp=0; SAM_Node *now=SAM_root;

            for(int i=0;i<len2;i++)
            {
                int x=B[i]-'a';
                if(now->next[x])
                {
                    temp++;
                    now=now->next[x];
                    LCS[now->id]=max(LCS[now->id],temp);
                }
                else
                {
                    while(now&&!now->next[x])
                        now=now->fa;
                    if(now)
                    {
                        temp=now->len+1;
                        now=now->next[x];
                        LCS[now->id]=max(LCS[now->id],temp);
                    }
                    else
                    {
                        temp=0; now=SAM_root;
                    }
                }
            }
        }
        long long int ans=0;
        for(int i=SAM_size-1;i>=1;i--)
        {
            SAM_Node *p=top[i];
            if(LCS[p->id])
            {
                if(p->fa)
                    LCS[p->fa->id]=max(LCS[p->fa->id],LCS[p->id]);
                if(LCS[p->id]<p->len)
                {
                    ans+=p->len-LCS[p->id];
                }
            }
            else
            {
                ans+=p->len-p->fa->len;
            }
        }
        printf("Case %d: %I64d\n",cas++,ans);
    }
    return 0;
}
时间: 2024-10-14 16:36:19

HDOJ 4416 Good Article Good sentence的相关文章

hdu 4416 Good Article Good sentence (后缀数组)

题目大意: 给出一个A串和很多个B串,求出A中有多少个子串,是所有的B中没有出现的. 思路分析: 后缀数组的作用很容易的求出来整个串中不同的子串个数. 现在要求的是A中不同的,且在B中没有出现过的. 先把AB 串全部连接,跑一遍suffix array.然后求出有多少个不同的子串. 然后再单独用B 串跑 suffix array.再求出单独在B 中有多少个不同的 子串. 然后结果就是 ans1 - ans2 ... 需要注意的问题就是,连接的时候需要把每一个串后面加一个特殊符.但是求不同串的时候

[hdu 4416]Good Article Good sentence

最近几天一直在做有关后缀自动机的题目 感觉似乎对后缀自动机越来越了解了呢!喵~ 这题还是让我受益颇多的,首先搞一个后缀自动机是妥妥的了 可是搞完之后呢? 我们来观察 step 这个变量,每个节点的 step 是从根节点到此节点所经过的最长步数 那么也就是以该点为结尾的最长的后缀长度 如何统计不被 Bi 串包含的子串呢? 其实很简单,维护每个节点所能匹配的最长的字符串长度 然后 节点->step-max(该节点所能匹配的最长的字符串长度, 节点->fail->step) 就是答案了 因为

hdu 4416 Good Article Good sentence(后缀自动机)

题目链接:hdu 4416 Good Article Good sentence 题意: 给你一个串A和n个串B,问你A有多少个子串不是这n个B的子串. 题解: 将A串建立后缀自动机,对于每个B串都拿去匹配一下,并记录后缀自动机中每个节点的最大匹配长度. 然后拓扑排序,更新每个节点的fail节点.最后对于每个节点的贡献就是ml[i]-max(is[i],mx[f[i]]) (is[i]是该节点的最大匹配长度) 1 #include<bits/stdc++.h> 2 #define F(i,a,

HDOJ 题目4416 Good Article Good sentence(后缀数组求a串子串在b串中不出现的种类数)

-每周六晚的BestCoder(有米!) Good Article Good sentence Time Limit: 6000/3000 MS (Java/Others)    Memory Limit: 32768/32768 K (Java/Others) Total Submission(s): 2784    Accepted Submission(s): 785 Problem Description In middle school, teachers used to encour

hdu 4416 Good Article Good sentence(后缀数组&amp;思维)

Good Article Good sentence Time Limit: 6000/3000 MS (Java/Others)    Memory Limit: 32768/32768 K (Java/Others) Total Submission(s): 2308    Accepted Submission(s): 649 Problem Description In middle school, teachers used to encourage us to pick up pre

HDU 4416 Good Article Good sentence(后缀自动机)

[题目链接] http://acm.hdu.edu.cn/showproblem.php?pid=4416 [题目大意] 给出一个字符串,然后,给出一个字符串集合,问在该字符串中出现,且不在字符串集合中出现的子串总数. [题解] 将集合中所有的子串在自动机上跑,保存匹配到的位置的最长匹配, 用于在parent树上计算每个位置的最长匹配,对于一个位置, 如果不存在匹配,那么他对答案的贡献就是其value值, 如果存在匹配且匹配长度小于其长度那么取其差作为答案的贡献.最后输出即可. [代码] #in

Good Article Good sentence HDU - 4416 (后缀自动机)

Good Article Good sentence \[ Time Limit: 3000 ms\quad Memory Limit: 32768 kB \] 题意 给出一个 \(S\) 串,在给出 \(n\) 个 \(T\) 串,求出 \(S\) 串中有多少子串没有在任意一个 \(T\) 串中出现过 思路 \(\quad\) 首先可以对 \(S\) 串构建后缀自动机,然后在插入 \(n\) 个 \(T\) 串,每两个串之间用 \(27\) 隔开,然后可以求出这个自动机上每个节点出现的最左位置

HDOJ 3507 Print Article

斜率优化DP Print Article Time Limit: 9000/3000 MS (Java/Others)    Memory Limit: 131072/65536 K (Java/Others) Total Submission(s): 5519    Accepted Submission(s): 1707 Problem Description Zero has an old printer that doesn't work well sometimes. As it is

hdu4416---Good Article Good sentence

Good Article Good sentence Time Limit: 6000/3000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Others) Total Submission(s): 2586 Accepted Submission(s): 728 Problem Description In middle school, teachers used to encourage us to pick up pretty se