Dissertations / Theses on the topic 'Multiple alignment'
Create a spot-on reference in APA, MLA, Chicago, Harvard, and other styles
Consult the top 50 dissertations / theses for your research on the topic 'Multiple alignment.'
Next to every source in the list of references, there is an 'Add to bibliography' button. Press on it, and we will generate automatically the bibliographic reference to the chosen work in the citation style you need: APA, MLA, Harvard, Chicago, Vancouver, etc.
You can also download the full text of the academic publication as pdf and read online its abstract whenever available in the metadata.
Browse dissertations / theses on a wide variety of disciplines and organise your bibliography correctly.
Starrett, Dean. "Optimal Alignment of Multiple Sequence Alignments." Diss., The University of Arizona, 2008. http://hdl.handle.net/10150/194840.
Full textSammeth, Michael. "Integrated multiple sequence alignment." [S.l.] : [s.n.], 2005. http://deposit.ddb.de/cgi-bin/dokserv?idn=98148767X.
Full textAuer, Jens. "Metaheuristic Multiple Sequence Alignment Optimisation." Thesis, University of Skövde, School of Humanities and Informatics, 2004. http://urn.kb.se/resolve?urn=urn:nbn:se:his:diva-899.
Full textThe ability to tackle NP-hard problems has been greatly extended by the introduction of Metaheuristics (see Blum & Roli (2003)) for a summary of most Metaheuristics, general problem-independent optimisation algorithms extending the hill-climbing local search approach to escape local minima. One of these algorithms is Iterated Local Search (ILS) (Lourenco et al., 2002; Stützle, 1999a, p. 25ff), a recent easy to implement but powerful algorithm with results comparable or superior to other state-of-the-art methods for many combinatorial optimisation problems, among them the Traveling Salesman (TSP) and Quadratic Assignment Problem (QAP). ILS iteratively samples local minima by modifying the current local minimum and restarting
a local search porcedure on this modified solution. This thesis will show how ILS can be implemented for MSA. After that, ILS will be evaluated and compared to other MSA algorithms by BAliBASE (Thomson et al., 1999), a set of manually refined alignments used in most recent publications of algorithms and in at least two MSA algorithm surveys. The runtime-behaviour will be evaluated using runtime-distributions.
The quality of alignments produced by ILS is at least as good as the best algorithms available and significantly superiour to previously published Metaheuristics for MSA, Tabu Search and Genetic Algorithm (SAGA). On the average, ILS performed best in five out of eight test cases, second for one test set and third for the remaining two. A drawback of all iterative methods for MSA is the long runtime needed to produce good alignments. ILS needs considerably less runtime than Tabu Search and SAGA, but can not compete with progressive or consistency based methods, e. g. ClustalW or T-COFFEE.
Siu, Wing-yan, and 蕭穎欣. "Multiple structural alignment for proteins." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2008. http://hub.hku.hk/bib/B4068748X.
Full textSiu, Wing-yan. "Multiple structural alignment for proteins." Click to view the E-thesis via HKUTO, 2008. http://sunzi.lib.hku.hk/hkuto/record/B4068748X.
Full textCarroll, Hyrum D. "Biologically Relevant Multiple Sequence Alignment." Diss., CLICK HERE for online access, 2008. http://contentdm.lib.byu.edu/ETD/image/etd2623.pdf.
Full textNguyen, Ken D. "Multiple Biolgical Sequence Alignment: Scoring Functions, Algorithms, and Evaluations." Digital Archive @ GSU, 2011. http://digitalarchive.gsu.edu/cs_diss/62.
Full textZola, Jaroslaw. "Parallel server for multiple sequence alignment." Grenoble INPG, 2005. http://www.theses.fr/2005INPG0187.
Full textLn this work we investigate application of parallel processing and web-caching as a method to improve the efficiency of multiple sequence alignment. We develop a generic framework for distributed and local cache implementation, and we design decentralised caching system storing intermediate results of sequence alignment. Finally, we create a parallel server for multiple sequence alignment which utilises above techniques to speedup processing of large sequence sets. The server is based on the PhylTree method which is a generic scheme for multiple sequence alignment with simultaneous phylogeny, developed in the Laboratory ID-IMAG. Ln our work we propose also sorne extensions of PhylTree, like for example the application of simulated annealing to improve the efficiency of phylogenetic analysis
DeBlasio, Daniel Frank. "Parameter Advising for Multiple Sequence Alignment." Diss., The University of Arizona, 2016. http://hdl.handle.net/10150/612932.
Full textGuasco, Luciano M. "Multiple sequence alignment correction using constraints." Master's thesis, Faculdade de Ciências e Tecnologia, 2010. http://hdl.handle.net/10362/5143.
Full textOne of the most important fields in bioinformatics has been the study of protein sequence alignments. The study of homologous proteins, related by evolution, shows the conservation of many amino acids because of their functional and structural importance. One particular relationship between the amino acid sites in the same sequence or between different sequences, is protein-coevolution, interest in which has increased as a consequence of mathematical and computational methods used to understand the spatial, functional and evolutionary dependencies between amino acid sites. The principle of coevolution means that some amino acids are related through evolution because mutations in one site can create evolutionary pressures to select compensatory mutations in other sites that are functionally or structurally related. With the actual methods to detect coevolution, specifically mutual information techniques from the information theory field, we show in this work that much of the information between coevolved sites is lost because of mistakes in the multiple sequence alignment of variable regions. Moreover, we show that using these statistical methods to detect coevolved sites in multiple sequence alignments results in a high rate of false positives. Due to the amount of errors in the detection of coevolved site from multiple sequence alignments, we propose in this work a method to improve the detection efficacy of coevolved sites and we implement an algorithm to fix such sites correcting the misalignment produced in those specific locations. The detection part of our work is based on the mutual information between sites that are guessed as having coevolved, due to their high statistical correlation score. With this information we search for possible misalignments on those regions due to the incorrect matching of amino acids during the alignment. The re-alignment part is based on constraint programming techniques, to avoid the combinatorial complexity when one amino acid can be aligned with many others and to avoid inconsistencies in the alignments. In this work, we present a framework to impose constraints over the sequences, and we show how it is possible to compute alignments based on different criteria just by setting constraint between the amino acids. This framework can be applied not only for improving the alignment and detection of coevolved regions, but also to any desired constraints that may be used to express functional or structural relations among the amino acids in multiple sequences. We show also that after we fix these misalignments, using constraints based techniques, the correlation between coevolved sites increases and, in general, the new alignment is closer to the correct alignment than the MSA alignment. Finally, we show possible future research lines with the objective of overcoming some drawbacks detected during this work.
Ye, Yongtao, and 叶永滔. "Aligning multiple sequences adaptively." Thesis, The University of Hong Kong (Pokfulam, Hong Kong), 2014. http://hdl.handle.net/10722/206465.
Full textpublished_or_final_version
Computer Science
Master
Master of Philosophy
Lu, Yue. "Improving the quality of multiple sequence alignment." [College Station, Tex. : Texas A&M University, 2008. http://hdl.handle.net/1969.1/ETD-TAMU-3111.
Full textRodrigues, Kaio Wagner Lima, and 92991221146. "Removing DUST using multiple alignment of sequences." Universidade Federal do Amazonas, 2016. https://tede.ufam.edu.br/handle/tede/6557.
Full textApproved for entry into archive by Secretaria PPGI (secretariappgi@icomp.ufam.edu.br) on 2018-08-23T19:08:57Z (GMT) No. of bitstreams: 3 license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) kaio-tese.pdf: 3615178 bytes, checksum: dc547b203670c1159f46136e021a4825 (MD5) kaio-folha-de-aprovacao.jpg: 3343904 bytes, checksum: b00e5c4807f5a7e10eddc2eed2de5f12 (MD5)
Approved for entry into archive by Divisão de Documentação/BC Biblioteca Central (ddbc@ufam.edu.br) on 2018-08-24T13:43:58Z (GMT) No. of bitstreams: 3 license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) kaio-tese.pdf: 3615178 bytes, checksum: dc547b203670c1159f46136e021a4825 (MD5) kaio-folha-de-aprovacao.jpg: 3343904 bytes, checksum: b00e5c4807f5a7e10eddc2eed2de5f12 (MD5)
Made available in DSpace on 2018-08-24T13:43:58Z (GMT). No. of bitstreams: 3 license_rdf: 0 bytes, checksum: d41d8cd98f00b204e9800998ecf8427e (MD5) kaio-tese.pdf: 3615178 bytes, checksum: dc547b203670c1159f46136e021a4825 (MD5) kaio-folha-de-aprovacao.jpg: 3343904 bytes, checksum: b00e5c4807f5a7e10eddc2eed2de5f12 (MD5) Previous issue date: 2016-09-21
FAPEAM - Fundação de Amparo à Pesquisa do Estado do Amazonas
A large number of URLs collected by web crawlers correspond to pages with duplicate or near-duplicate contents. These duplicate URLs, generically known as DUST (Different URLs with Similar Text), adversely impact search engines since crawling, storing and using such data imply waste of resources, the building of low quality rankings and poor user experiences. To deal with this problem, several studies have been proposed to detect and remove duplicate documents without fetching their contents. To accomplish this, the proposed methods learn normalization rules to transform all duplicate URLs into the same canonical form. This information can be used by crawlers to avoid fetching DUST. A challenging aspect of this strategy is to efficiently derive the minimum set of rules that achieve larger reductions with the smallest false positive rate. As most methods are based on pairwise analysis, the quality of the rules is affected by the criterion used to select the examples and the availability of representative examples in the training sets. To avoid processing large numbers of URLs, they employ techniques such as random sampling or by looking for DUST only within sites, preventing the generation of rules involving multiple DNS names. As a consequence of these issues, current methods are very susceptible to noise and, in many cases, derive rules that are very specific. In this thesis, we present a new approach to derive quality rules that take advantage of a multi-sequence alignment strategy. We demonstrate that a full multi-sequence alignment of URLs with duplicated content, before the generation of the rules, can lead to the deployment of very effective rules. Experimental results demonstrate that our approach achieved larger reductions in the number of duplicate URLs than our best baseline in two different web collections, in spite of being much faster. We also present a distributed version of our method, using the MapReduce framework, and demonstrate its scalability by evaluating it using a set of 7.37 million URLs.
Um grande número de URLs obtidas por coletores corresponde a páginas com conteúdo duplicado ou quase duplicado, conhecidas em Inglês pelo acrônimo DUST, que pode ser traduzido como Diferentes URLs com Texto Similar. DUST são prejudiciais para sistemas de busca porque ao serem coletadas, armazenadas e utilizadas, contribuem para o desperdício de recursos, a criação de rankings de baixa qualidade e, consequentemente, uma experiência pior para o usuário. Para lidar com este problema, muita pesquisa tem sido realizada com intuito de detectar e remover DUST antes mesmo de coletar as URLs. Para isso, esses métodos se baseiam no aprendizado de regras de normalização que transformam todas as URLs com conteúdo duplicado para uma mesma forma canônica. Tais regras podem ser então usadas por coletores com o intuito de reconhecer e ignorar DUST. Para isto, é necessário derivar, de forma eficiente, um conjunto mínimo de regras que alcance uma grande taxa de redução com baixa incidência de falsos-positivos. Como a maioria dos métodos propostos na literatura é baseada na análise de pares, a qualidade das regras é afetada pelo critério usado para selecionar os exemplos de pares e a disponibilidade de exemplos representativos no treino. Para evitar processar um número muito alto de exemplos, em geral, são aplicadas técnicas de amostragem ou a busca por DUST é limitada apenas a sites, o que impede a geração de regras que envolvam diferentes nomes de DNS. Como consequência, métodos atuais são muito suscetíveis a ruído e, em muitos casos, derivam regras muito específicas. Nesta tese, é proposta uma nova técnica para derivar regras, baseada em uma estratégia de alinhamento múltiplo de sequências. Em particular, mostramos que um alinhamento prévio das URLs com conteúdo duplicado contribui para uma melhor generalização, o que resulta na geração de regras mais efetivas. Através de experimentos em duas diferentes coleções extraídas da Web, observa-se que a técnica proposta, além de ser mais rápida, filtra um número maior de URLs duplicadas. Uma versão distribuída do método, baseada na arquitetura MapReduce, proporciona a possibilidade de escalabilidade para coleções com dimensões compatíveis com a Web.
Rausch, Tobias [Verfasser]. "Dissecting multiple sequence alignment methods : the analysis, design and development of generic multiple sequence alignment components in SeqAn / Tobias Rausch." Berlin : Freie Universität Berlin, 2010. http://d-nb.info/1024541460/34.
Full textLightner, Carin Ann. "A Tabu Search Approach to Multiple Sequence Alignment." NCSU, 2008. http://www.lib.ncsu.edu/theses/available/etd-05312008-191232/.
Full textZhang, Xiaodong. "A Local Improvement Algorithm for Multiple Sequence Alignment." Ohio University / OhioLINK, 2003. http://rave.ohiolink.edu/etdc/view?acc_num=ohiou1049485762.
Full textSun, Hong. "DETECTING MULTIPLE PROTEIN FOLDING TRAJECTORIES AND STRUCTURAL ALIGNMENT." The Ohio State University, 2011. http://rave.ohiolink.edu/etdc/view?acc_num=osu1319744262.
Full textFloden, Evan 1985. "Alignment uncertainty, regressive alignment and large scale deployment." Doctoral thesis, Universitat Pompeu Fabra, 2018. http://hdl.handle.net/10803/665300.
Full textUn alineament de seqüència múltiple (MSA) proporciona una descripció de la relació entre seqüències biològiques on les columnes representen una ascendència compartida a través d'un conjunt implicat d'esdeveniments evolutius. La majoria de la investigació en el camp s'ha centrat a millorar la precisió dels alineaments dins del marc d'alineació progressiva i ha permès inferències poderoses, incloent-hi la reconstrucció filogenètica, el modelatge d'homologia i la predicció de malalties. Malgrat això, quan s'aplica als conjunts de dades de genòmica moderns, que sovint comprenen desenes de milers de seqüències, sorgeixen nous reptes en la construcció d'un MSA precís. Aquests problemes es poden generalitzar per formar tres problemes bàsics. En primer lloc, a mesura que augmenta el nombre de seqüències, les metodologies d'alineació progressiva presenten una disminució espectacular de la precisió de l'alineació. A més, per a un conjunt de dades, existeixen molts MSA com a possibles solucions un problema que s'agreuja amb un nombre creixent de seqüències a causa de la incertesa d'alineació. Finalment, les dificultats tècniques obstaculitzen el desplegament d'aquests fluxos de treball d'anàlisi genòmica, especialment de manera reproduïble, sovint presenten una gran barrera per als professionals fins i tot qualificats. Aquest treball té com a objectiu abordar aquesta trifecta de problemes a través d'un servidor web per a l'extensió ràpida d'homologia basada en MSA, dos nous mètodes per a la millora de l'arrencada filogenètica permeten incorporar incertesa d'alineació, un nou procediment d'alineació que millora els alineaments a gran escala anomenat MSA regressivu i, finalment, un marc de flux de treball permet el desplegament d'anàlisis reproduïbles a gran escala a través de clústers i computació al núvol anomenat Nextflow. En conjunt, es pot veure que aquest treball proporciona tant avanços conceptuals com tècniques que proporcionen millores substancials als mètodes MSA existents i les conseqüències resultants.
Zhang, Ching. "Genetic algorithm approaches for efficient multiple molecular sequence alignment." Thesis, National Library of Canada = Bibliothèque nationale du Canada, 1998. http://www.collectionscanada.ca/obj/s4/f2/dsk2/tape17/PQDD_0013/NQ30660.pdf.
Full textZhou, Rong. "Memory-efficient graph search applied to multiple sequence alignment." Diss., Mississippi State : Mississippi State University, 2005. http://library.msstate.edu/etd/show.asp?etd=etd-06282005-015428.
Full textLloyd, G. Scott. "Accelerated Large-Scale Multiple Sequence Alignment with Reconfigurable Computing." BYU ScholarsArchive, 2011. https://scholarsarchive.byu.edu/etd/2729.
Full textHerman, Joseph L. "Multiple sequence analysis in the presence of alignment uncertainty." Thesis, University of Oxford, 2014. http://ora.ox.ac.uk/objects/uuid:88a56d9f-a96e-48e3-b8dc-a73f3efc8472.
Full textAhmed, Nova. "Parallel Algorithm for Memory Efficient Pairwise and Multiple Genome Alignment in Distributed Environment." Digital Archive @ GSU, 2004. http://digitalarchive.gsu.edu/cs_theses/2.
Full textHöhl, Michael. "Is multiple sequence alignment required for accurate inference of phylogeny? /." [St. Lucia, Qld.], 2006. http://www.library.uq.edu.au/pdfserve.php?image=thesisabs/absthe19790.pdf.
Full textDeBlasio, Daniel. "NEW COMPUTATIONAL APPROACHES FOR MULTIPLE RNA ALIGNMENT AND RNA SEARCH." Master's thesis, University of Central Florida, 2009. http://digital.library.ucf.edu/cdm/ref/collection/ETD/id/4070.
Full textM.S.
School of Electrical Engineering and Computer Science
Engineering and Computer Science
Computer Science MS
Sarver, Michael. "STRUCTURE-BASED MULTIPLE RNA SEQUENCE ALIGNMENT AND FINDING RNA MOTIFS." Bowling Green State University / OhioLINK, 2006. http://rave.ohiolink.edu/etdc/view?acc_num=bgsu1151076710.
Full textDeBlasio, Dan, and John Kececioglu. "Core column prediction for protein multiple sequence alignments." BIOMED CENTRAL LTD, 2017. http://hdl.handle.net/10150/623957.
Full textHe, Jintai. "MULTIPLE SEQUENCES ALIGNMENT FOR PHYLOGENETIC TREE CONSTRUCTION USING GRAPHICS PROCESSING UNITS." Available to subscribers only, 2008. http://proquest.umi.com/pqdweb?did=1674095441&sid=1&Fmt=2&clientId=1509&RQT=309&VName=PQD.
Full text"Department of Computer Science." Keywords: GPU computing, Sequence alignment. Includes bibliographical references (p. 34). Also available online.
Kemena, Carsten 1983. "Improving the accuracy and the efficiency of multiple sequence alignment methods." Doctoral thesis, Universitat Pompeu Fabra, 2012. http://hdl.handle.net/10803/128678.
Full textEl alineamiento es uno de los métodos básicos en la comparación de secuencias biológicas, y a menudo el primer pasó en análisis posteriores. Por su posición privilegiada al principio de muchos estudios, la calidad del alineamiento es de gran importancia, de hecho cada resultado basado en un alineamiento depende en gran medida de la calidad de ´este. Este hecho se ha confirmado en diversos artículos recientes, en los cuales se ha investigado los efectos de la elección del método de alineamiento en la reconstrucción filogenética y la estimación de la selección positiva. En esta tesis, presento varios proyectos enfocados en la implementación de mejoras tanto en los métodos de alineamiento múltiple de secuencias como en la evaluación de estos. Concretamente, he tratado problemas como la evaluación de alineamientos estructurales de proteínas, la construcción de alineamientos estructurales y precisos de ARN y también el alineamiento de grandes conjuntos de secuencias.
Elias, Isaac. "Computational problems in evolution : Multiple alignment, genome rearrangements, and tree reconstruction." Doctoral thesis, KTH, Numerical Analysis and Computer Science, NADA, 2006. http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-4170.
Full textReconstructing the evolutionary history of a set of species is a fundamental problem in biology. This thesis concerns computational problems that arise in different settings and stages of phylogenetic tree reconstruction, but also in other contexts. The contributions include:
• A new distance-based tree reconstruction method with optimal reconstruction radius and optimal runtime complexity. Included in the result is a greatly simplified proof that the NJ algorithm also has optimal reconstruction radius. (co-author Jens Lagergren)
• NP-hardness results for the most common variations of Multiple Alignment. In particular, it is shown that SP-score, Star Alignment, and Tree Alignment, are NP hard for all metric symbol distances over all binary or larger alphabets.
• A 1.375-approximation algorithm for Sorting By Transpositions (SBT). SBT is the problem of sorting a permutation using as few block-transpositions as possible. The complexity of this problem is still open and it was a ten-year-old open problem to improve the best known 1.5-approximation ratio. The 1.375-approximation algorithm is based on a new upper bound on the diameter of 3-permutations. Moreover, a new lower bound on the transposition diameter of the symmetric group is presented and the exact transposition diameter of simple permutations is determined. (co-author Tzvika Hartman)
• Approximation, fixed-parameter tractable, and fast heuristic algorithms for two variants of the Ancestral Maximum Likelihood (AML) problem: when the phylogenetic tree is known and when it is unknown. AML is the problem of reconstructing the most likely genetic sequences of extinct ancestors along with the most likely mutation probabilities on the edges, given the phylogenetic tree and sequences at the leafs. (co-author Tamir Tuller)
• An algorithm for computing the number of mutational events between aligned DNA sequences which is several hundred times faster than the famous Phylip packages. Since pairwise distance estimation is a bottleneck in distance-based phylogeny reconstruction, the new algorithm improves the overall running time of many distancebased methods by a factor of several hundred. (co-author Jens Lagergren)
Wang, Jingjing. "A PARALLEL APPROACH TO MULTIPLE SEQUENCES ALIGNMENT AND PHYLOGENETIC TREE LABELING." OpenSIUC, 2010. https://opensiuc.lib.siu.edu/theses/246.
Full textChoi, Kwangbom. "P-Coffee a new divide-and-conquer method for multiple sequence alignment /." NCSU, 2005. http://www.lib.ncsu.edu/theses/available/etd-01182005-060947/.
Full textPaten, Benedict John. "Large-scale multiple alignment and transcriptionally-associated pattern discovery in vertebrate genomes." Thesis, University of Cambridge, 2007. http://ethos.bl.uk/OrderDetails.do?uin=uk.bl.ethos.612811.
Full textLassmann, Timo. "Algorithms for building and evaluating multiple sequence alignments /." Stockholm, 2006. http://diss.kib.ki.se/2006/91-7140-887-8/.
Full textWheeler, Travis John. "EFFICIENT CONSTRUCTION OF ACCURATE MULTIPLE ALIGNMENTS AND LARGE-SCALE PHYLOGENIES." Diss., The University of Arizona, 2009. http://hdl.handle.net/10150/195143.
Full textChiri, Ali. "Business Alignment Strategies for Middle East Real Estate Construction Projects." ScholarWorks, 2017. https://scholarworks.waldenu.edu/dissertations/4609.
Full textHelal, Manal Computer Science & Engineering Faculty of Engineering UNSW. "Indexing and partitioning schemes for distributed tensor computing with application to multiple sequence alignment." Awarded by:University of New South Wales. Computer Science & Engineering, 2009. http://handle.unsw.edu.au/1959.4/44781.
Full textYing, Chung Li, and 鍾立穎. "Multiple Sequence Alignment using Pairwise Suboptimal Alignment." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/78872248303598965142.
Full text國立臺灣科技大學
資訊工程系
94
Multiple sequence alignment is an important tool to analysis biological sequence from searching similar sequence in database to protein structure. The optimal solution of dynamic programming is not always real biological solution when the number of sequence is increasing. Another method is progressive algorithm, it combined most similar sequence and then added next similar sequence. But the order of combining sequence have different alignment. Due to the optimal alignment is not always the best alignment in biological alignment, combining the pairwise suboptimal alignment have the possibility to find a better solution. The method also can decrease the time complexity. On the other hand, there is a possibility to find better alignment when we take a few time to try all combination.
Wang, Shu 1973. "On multiple sequence alignment." Thesis, 2007. http://hdl.handle.net/2152/3715.
Full textHan-Chiou, Yu, and 邱毓翰. "Constrained Multiple Sequences Alignment." Thesis, 2002. http://ndltd.ncl.edu.tw/handle/18516751892104681181.
Full text國立清華大學
資訊工程學系
90
We design a new algorithm of computing constrained multiple sequence alignment (CMSA) for guaranteeing that generated alignment satisfies the user- specified constraints that some particular residues should be aligned together. The first step of our strategy is design a constrained pairwise sequence alignment. Next, based on the concept of progressive alignment, we use the constrained pairwise sequence alignment to progressively merge the sequences. The time complexity of our CMSA algorithm for aligning K sequences is O(Kn4),where n is the maximum of lengths of sequences. We experimented our algorithm on RNases sequences with known structure and results of our experiment are all the important residues of active sites are well aligned together.
Xiao, Bo Weng, and èåæ. "Block Alignment: An Approach for Multiple Sequence Alignment Containing Clusters." Thesis, 2004. http://ndltd.ncl.edu.tw/handle/19616460760964410139.
Full text國立暨南國際大學
資訊工程學系
92
Multiple sequence alignment is a fundamental problem in computational molecular biology. It has been known as an NP-hard problem. To find its optimal solution will take a lot of time. For a reasonable wait and an acceptable solution, we have progressive methods. These methods perform pairwise alignment, and then combine them in to a multiple sequence alignment. In this thesis, we focus on multiple sequence alignment containing clusters. We try to take another view point to deal with sequence alignment. We use a matrix to present a sequence. Every sequence will be represented as a matrix. After two sequences (matrices) are aligned, the result of the alignment will again be represented by a matrix and then the original two sequences (matrices) will be discarded. That is, the result of aligning a set of sequences will always be considered as a block and represented by a matrix. This is thus different from the old ways in which only two sequences are aligned, not a group of aligned sequences and another group of aligned sequences. In this thesis, we will show some experimental results to test our proposed method. Block alignment outperforms those progressive methods for sequences containing clusters.
Hsu, Ta-Chin, and 徐大欽. "Multiple Sequence Alignment via Phylogenetic Tree." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/59691564789727790301.
Full text東海大學
數學系
94
Simultaneous alignment of numerous nucleotide or amino acid sequence is a necessary tool when analyzing the structure of the sequences in the field of molecular biology. Multiple sequence alignment can be used to search for and depict the conserved domain in the protein family, examine and prove if there is any homogeneity between the new sequence and the existed protein family, to help predict the secondary or tertiary structures of new sequence, and to serve as the first step in molecular evolutionary analysis. The problems of large amounts of calculated time and space needed in the multidimensional dynamic programming make the multiple sequence alignment difficult to perform. The practical methods presented so far are just heuristic, and they can’t be assured that the results have any biological meaning. However, evaluating the multiple sequence alignment always depends on scoring scheme and gap penalty which reflect the evolutionary relation. Therefore, the use of phylogenetic trees to guide multiple sequence alignment captures the evolutionary relation is an essential way. This thesis discusses the simultaneous construct of phylogenetic tree and multiple sequence alignment based on the principle of parsimony. This idea was first put forward by Sankoff in 1973. In 1989, Hein presented the feasible algorithm by further using sequence graph combined with dynamic programming. We modified the methods that Hein constructed his guide tree in the algorithm. According to the edit distances for every pair of sequences, we build the guide tree by neighboring joining method and apply it to guide the order of alignment. Also, we use three kinds of perturbation of tree topology to reconstruct the phylogenetic tree and multiple alignments according to the principle of parsimony.
Wang, San-yuan, and 王三源. "Constrained Multiple RNA Secondary Structure Alignment." Thesis, 2006. http://ndltd.ncl.edu.tw/handle/w9z3st.
Full text靜宜大學
資訊管理學系研究所
94
Multiple RNA secondary structure alignment and multiple sequence alignment are two important problems in computational biology. In the present criteria of the two problems, they just focus on absolute optimal solution, such as longest arc-preserving common subsequence and tree edit distance. In the sequence alignment problem, to align a set of sequences such that specified structures must be aligned together. This is desirable since one has the knowledge about the structures that are necessary for some function to work. This is the gold of the sequence alignment with constraints problem. In this thesis, we propose three multiple RNA secondary structure alignment algorithms for different applications. 1. The algorithm which is directed against the constraints of structure can let the user-specified constraints be aligned together. It’s time complexity is O(rkn2+k2n) where r, k, n are the number of constraints, sequences and the length of the longest sequence. 2. In the algorithm which is directed against the constraints of sequence, we add some annotated characters to the RNA sequences according to the structure information to produce the annotated RNA sequence. Then we align the annotated RNA sequences with constraints. The user can get not only the similarity of RAN but also the relationship of the RNA family. It’s time complexity is O(rk2n2). 3. Besides, we also propose the stem-loop algorithm which is directed against the feature s of tRNA structure. It’s time complexity is O(k2q+kq2) where q is the number of stem-loop.
Pátek, Zdeněk. "Multiple sequence alignment pomocí genetických algoritmů." Master's thesis, 2012. http://www.nusl.cz/ntk/nusl-304119.
Full text林世杰. "Multiple resolution and scale sequence alignment." Thesis, 2001. http://ndltd.ncl.edu.tw/handle/74036857840582056102.
Full text國立清華大學
資訊工程學系
89
We design a new sequence alignment system that base on L. Delcher’s [1] idea to aligning very long sequences. Because the traditional sequence alignment methods [12] spend too much time and memory space when align very long sequences. We use an idea, longest common subsequence (LCS), instead of suffix tree [5], and then we can solve sequence alignment problems more flexible. But it spends more time than using suffix tree. We improve LCS by using hash table, and it is a kind of trade off method between time and space. We also find some properties of LCS. So we can combine two shorter LCS’s results to get a longer LCS’s approximate result. In our experiment, we show our method is an effective method in practice.
"CSA-X: Modularized Constrained Multiple Sequence Alignment." Thesis, 2015. http://hdl.handle.net/10388/ETD-2015-10-2276.
Full textJiang, Yanan master of cellular and molecular biology. "Manual alignment of IVS sequences and its implication in multiple sequence alignment." Thesis, 2011. http://hdl.handle.net/2152/ETD-UT-2011-12-4706.
Full texttext
Sammeth, Michael [Verfasser]. "Integrated multiple sequence alignment / by Michael Sammeth." 2005. http://d-nb.info/98148767X/34.
Full textChen, Ying-li, and 陳穎立. "Multiple Alignment Analysis: Using Recursive Dynamic Programming." Thesis, 2005. http://ndltd.ncl.edu.tw/handle/88573776921218031318.
Full text義守大學
資訊管理學系碩士班
93
Sequence alignment analysis is the most important work for gene and protein research. Dynamic programming is one of the most used algorithms to study pairwise sequence alignments. However, when the number of sequences (k) is greater than 2, two obstacles hinder the application of this method: (1)The computing time and storage space requirements increase proportionally to O(2k.nk) and O(nk), respectively, n being the length of the sequence. (2)The difficulty in the coding to deal with dynamic k. This study presents a modified algorithm, the recursive dynamic programming, to resolve the second obstacle.
Cheng, Mei-Ling, and 程美齡. "Improving Multiple Alignment of RNA Tertiary Structures." Thesis, 2012. http://ndltd.ncl.edu.tw/handle/52922523689895268330.
Full text國立交通大學
生物資訊及系統生物研究所
100
Recently, there is a fast growing interest in noncoding RNAs (ncRNAs) because they play a lot of essential roles in many cellular processes, even though the transcripts of these ncRNAs are not translated into proteins. Actually, the function of most available ncRNAs is still unknown and needs to be determined. Since molecular structures are typically more evolutionarily conserved than sequences, detecting structural similarities among RNA three-dimensional (3D) structures can bring more significant insights into their functional and even evolutionary relationships that would not be detected by sequence information alone. Therefore, the purpose of this study is to design a software tool that can efficiently and accurately compute the structural similarity of multiple RNA 3D structures. Our method first uses a new structure alphabet to transform RNA 3D structures into 1D SA-encoded sequences and then uses a traditional multiple sequence alignment tool CLUSTAL W and a new BLOSUM-like scoring matrix to align the SA-encoded sequences of several RNAs for detecting the structural similarity of these RNAs. Next, we have implemented the above method into a software tool called iMARTS. Finally, we have tested iMARTS on some RNA 3D structures and compared its experimental results with those obtained by our previously developed tool MARTS. Consequently, the experimental results show that iMARTS indeed has a better performance when compared with MARTS. Therefore, we believe that iMARTS can serve as a useful tool in the study of structural biology.