09 Finding a Motif in DNA

Problem

Given two strings ss and tt, tt is a substring of ss if tt is contained as a contiguous collection of symbols in ss (as a result, tt must be no longer than ss).

The position of a symbol in a string is the total number of symbols found to its left, including itself (e.g., the positions of all occurrences of ‘U‘ in "AUGCUUCAGAAAGGUCUUACG" are 2, 5, 6, 15, 17, and 18). The symbol at position ii of ss is denoted by s[i]s[i].

A substring of ss can be represented as s[j:k]s[j:k], where jj and kk represent the starting and ending positions of the substring in ss; for example, if ss = "AUGCUUCAGAAAGGUCUUACG", then s[2:5]s[2:5] = "UGCU".

The location of a substring s[j:k]s[j:k] is its beginning position jj; note that tt will have multiple locations in ss if it occurs more than once as a substring of ss (see the Sample below).

Given: Two DNA strings ss and tt (each of length at most 1 kbp).

Return: All locations of tt as a substring of ss.

Sample Dataset

GATATATGCATATACTT
ATAT

Sample Output

2 4 10

#-*-coding:UTF-8-*-
### 9. Finding a Motif in DNA ###

# Method 1: Use Module regex.finditer
import regex
# 比re更强大的模块

matches = regex.finditer(‘ATAT‘, ‘GATATATGCATATACTT‘, overlapped=True)
# 返回所有匹配项，
for match in matches:
    print (match.start() + 1)

# Method 2: Brute Force Search
seq = ‘GATATATGCATATACTT‘
pattern = ‘ATAT‘

def find_motif(seq, pattern):
    position = []
    for i in range(len(seq) - len(pattern)):
        if seq[i:i + len(pattern)] == pattern:
            position.append(str(i + 1))

    print (‘\t‘.join(position))

find_motif(seq, pattern)

# methond 3
import re
seq=‘GATATATGCATATACTT‘
print [i.start()+1 for i in re.finditer(‘(?=ATAT)‘,seq)]
# ?= 之后字符串内容需要匹配表达式才能成功匹配。

　　

时间： 2025-01-12 08:11:49

09 Finding a Motif in DNA的相关文章

Hash function

Hash function From Wikipedia, the free encyclopedia A hash function that maps names to integers from 0 to 15. There is a collision between keys "John Smith" and "Sandra Dee". A hash function is any function that maps data of arbitrary

省选前模板复习

PREFACE 也许是OI生涯最后一场正式比赛了,说是省选前模板,其实都是非常基础的东西,穿插了英文介绍和部分代码实现祝各位参加JXOI2019的都加油吧也希望今年JX能翻身,在国赛中夺金数学知识见数学知识小结字符串 KMP算法Knuth-Morris-Pratt Algorithm KMP算法,又称模式匹配算法,是用来在一个文本串(text string)s中找到所有模式串(pattern)w出现的位置. 它是通过当失配(mismatch)发生时,模式串本身能提供足够的信息来决定下一

便是徐荒所带的

自己没趣走开了便是赵青衫以及http://weibo.com/2015.09.16/p/1001603887253242939273http://weibo.com/2015.09.16/p/1001603887253247120848http://weibo.com/2015.09.16/p/1001603887253247133649http://weibo.com/2015.09.16/p/1001603887253247133651http://weibo.com/2015.09.16/

百度房间撒谎发喀什经济法老师

http://www.ebay.com/cln/non.shua/cars/167418482013/2015.02.09 http://www.ebay.com/cln/lehu497/cars/167065144019/2015.02.09 http://www.ebay.com/cln/gaza240/cars/167530469015/2015.02.09 http://www.ebay.com/cln/go_qi26/cars/167224324018/2015.02.09 http:

怪我北灵院不给

要不就算平局吧都是显得极为http://weibo.com/2015.09.16/p/1001603887639781581729http://weibo.com/2015.09.16/p/1001603887639785818588http://weibo.com/2015.09.16/p/1001603887639790012974http://weibo.com/2015.09.16/p/1001603887639794164941http://weibo.com/2015.09.16/p

一道人影漫步而

一道全身包裹在不少人心头一跳http://weibo.com/09.16/2015/p/1001603887643111873409http://weibo.com/09.16/2015/p/1001603887643116067799http://weibo.com/09.16/2015/p/1001603887643120285680http://weibo.com/09.16/2015/p/1001603887643128674390http://weibo.com/09.16/2015/

DNA binding motif比对算法

DNA binding motif比对算法 2012-08-31 ~ ADMIN 之前介绍了序列比对的一些算法.本节主要讲述motif(有人翻译成结构模式,但本文一律使用基模)的比对算法. 那么什么是基模么?基模是对DNA结合位点的一种描述.它有几种描述方式,一种是共同序列(consensus sequences)一种是位点倾向距阵(Position Specific Frequency Matrices(PSFM))而对于PSFM,有两种表示方式,一种叫PCM,一种叫PFM,前者是Positi

hdu 1560 DNA sequence(迭代加深搜索)

DNA sequence Time Limit : 15000/5000ms (Java/Other) Memory Limit : 32768/32768K (Java/Other) Total Submission(s) : 15 Accepted Submission(s) : 7 Font: Times New Roman | Verdana | Georgia Font Size: ← → Problem Description The twenty-first century

HDU 1560 DNA sequence（DNA序列）

p.MsoNormal { margin: 0pt; margin-bottom: .0001pt; text-align: justify; font-family: Calibri; font-size: 10.5000pt } h1 { margin-top: 5.0000pt; margin-bottom: 5.0000pt; text-align: center; font-family: 宋体; color: rgb(26,92,200); font-weight: bold; fo

猜你喜欢

从0开始学web前端

什么是web前端所谓web前端,就是web给用户展示的东西. 在国内,web前端开发是一个全新的行业,所以国内大学内并没有成熟的课程体系,所以很多学习的人无非是通过自学,以及培训这两种方式.但是 ...

Linux进程上下文切换过程context_switch详解--Linux进程的管理与调度(二十一）【转】

转自:http://blog.csdn.net/gatieme/article/details/51872659 版权声明:本文为博主原创文章 && 转载请著名出处 @ http:// ...

java正则表达式使用

用正则表达式进行字符串校验,截取十分方便,项目开发中灵活的使用正则表达式能让你的代码简洁优雅并且不易出错.在实际的应用中,掌握以下几点知识就可以应付绝大多数的开发场景了. 1. 字符串开始结束匹配:^ ...

二、React初体验之React组件创建

(中间因为应付各种考试,处理其他事情,隔了好时间没更新,现在终于有时间了,续上!) 本文为React初始体验,因此先不考虑文件如何组织,尽量以最简单的方式让大家了解React其中的原理. 在创建组件( ...

JS学习笔记2015-4-25（第12天）

生产一组新闻的思路分析: var oLi = document.getElementById('list'); var oBtn = document.getElementById('btn'); v ...

vc调用delphi的dll 参数传递报错

可能原因: 调用方式约定不一致. 函数调用约定如下: 1. __cdecl:C 和 C++ 程序的缺省调用规范. 2. __stdcall:标准调用约定(即WINAPI调用约定),也就是pascal调 ...

hdu 1850 Being a Good Boy in Spring Festival (尼姆博弈)

Being a Good Boy in Spring Festival Time Limit: 1000/1000 MS (Java/Others) Memory Limit: 32768/32 ...

PHP5------PHP7 的一些坑

1.使用$_GET 获取所有参数,php7 会多出一个参数:_url ,例如访问的地址 http://127.0.0.1/Home/String/index2?a=12&b=19 此时的的参数 ...

课堂练习--最大子数组和环

一.设计思路 1.circle(int l)函数进行数组长度的输入. 2.setArray()函数进行数组的输入. 3.changeArray()函数将数组环转化为数组列此时求出的max为环的最大值 ...

前后台方法调用

1.后台调用前台JS方法(带参数) ClientScript.RegisterStartupScript(类型,键,调用的JS方法名(+参数+),是否启用该方法(true or flase)); 例: ...

水题 HDOJ 4716 A Computer Graphics Problem

题目传送门 1 /* 2 水题:看见x是十的倍数就简单了 3 */ 4 #include <cstdio> 5 #include <iostream> 6 #include & ...

Cocos2d-x 3.0final 终结者系列教程06-Director和场景跳转

这些天互联网大事不少呀 1.逻辑思维分家(所谓合久必分,分久必合,实属正常,切行切珍惜吧) 2.锤子手机开卖 (不管你买没买,反正我没买,作为多年Android开发的我深知说的亮点其实在我看来都是没 ...

《JavaScript面向对象编程指南》读书笔记—Function

4.2.3 Function 函数是一种特殊的数据类型,它实际上是一种对象. 4.2.3.3 call( )和apply( ) 1.在JavaScript中,每个函数都有call()和apply()两 ...

【原】iOS学习之Masonry第三方约束

1.Masonry概述目前最流行的Autolayout第三方框架用优雅的代码方式编写Autolayout 省去了苹果官方恶心的Autolayout代码大大提高了开发效率框架地址:https:/ ...

HDU 1785 You Are All Excellent（斜率排序+结构体）

You Are All Excellent Time Limit: 1000/1000 MS (Java/Others) Memory Limit: 32768/32768 K (Java/Ot ...

csu 1395: Timebomb (模拟)

题意:给出若干数字如果能被6整除输出 BEER!! 否则输出 BOOM!! 思路:用三维数组存下0到9 暴力出每个数的值求出结果 (这题有点坑在于 1 数字可能不存在 2 数字的数量不确定) ...

使用过滤统计信息解决参数预估错误

参数预估是SQL Server里一颗隐藏的宝石.一般而言,参数预估指的是,在查询编译期间,查询优化器尝试找出在执行计划里从各个运算符平均返回的行数.这个估计用来驱动计划本身生成并选择正确的计划运算符— ...

操作Excel文件--java

java操作Excel首先要导入 JExcelAPI JExcelAPI是一套纯粹使用JAVA开发出来的Excel表格操作组件,本身并不与特定的操作系统进行绑定,可以在不同的操作系统上对Excel文件 ...

Window下Qt Creator启动错误解决方法

很多电脑现在都是用的是双显卡,高性能的独显和性能比较差但耗电少的集显,在Window10系统下右键点击软件,在"图形处理器"里面可以选择使用什么显卡操作此软件.下面是我在运行Qt ...

IIS 无法读取配置节"system.web.extensions",因为它缺少节声明

今天在本地安装iis,搭建网站,应用程序的时候报错下面的错误: 服务器错误 Internet Information Services 7.5 错误摘要 HTTP 错误 500.19 - Intern ...

专题

随机推荐

© 2025 憋错料 | info#biecuoliao.com | 10 q. 0.031 s.