Skip to main content

Consensus sequence Contents Biological significance Sequence analysis Software See also References Navigation menuWebLogoGestalt WorkbenchInstitute for Systems Biology"Consensus Sequence Zen"185246415130839

BioinformaticsDNA


molecular biologybioinformaticsnucleotideamino acidsequence alignmentsequence alignmentssequence motifsRNA polymerasenucleotidesgenometranscription factorspromotersgenesrestriction enzymespalindromicTransposonssplice sitesexonintronDNA binding sitepattern recognitiongeneticsmolecular biologybioinformaticssequence motifsregulatory sequencessignal sequencesevolutionDNAnotationpyrimidinepurinesequence logoWebLogoGestalt WorkbenchInstitute for Systems BiologyJalViewUGENE












Consensus sequence




From Wikipedia, the free encyclopedia






Jump to navigation
Jump to search


The most common variant of a genetic sequence across samples

In molecular biology and bioinformatics, the consensus sequence (or canonical sequence) is the calculated order of most frequent residues, either nucleotide or amino acid, found at each position in a sequence alignment. It represents the results of multiple sequence alignments in which related sequences are compared to each other and similar sequence motifs are calculated. Such information is important when considering sequence-dependent enzymes such as RNA polymerase.[1]




Contents





  • 1 Biological significance


  • 2 Sequence analysis

    • 2.1 Notation



  • 3 Software


  • 4 See also


  • 5 References




Biological significance


A protein binding site, represented by a consensus sequence, may be a short sequence of nucleotides which is found several times in the genome and is thought to play the same role in its different locations. For example, many transcription factors recognize particular patterns in the promoters of the genes they regulate. In the same way, restriction enzymes usually have palindromic consensus sequences, usually corresponding to the site where they cut the DNA. Transposons act in much the same manner in their identification of target sequences for transposition. Finally, splice sites (sequences immediately surrounding the exon-intron boundaries) can also be considered as consensus sequences.


Thus a consensus sequence is a model for a putative DNA binding site: it is obtained by aligning all known examples of a certain recognition site and defined as the idealized sequence that represents the predominant base at each position. All the actual examples shouldn't differ from the consensus by more than a few substitutions, but counting mismatches in this way can lead to inconsistencies.[2]


Any mutation allowing a mutated nucleotide in the core promoter sequence to look more like the consensus sequence is known as an up mutation. This kind of mutation will generally make the promoter stronger, and thus the RNA polymerase forms a tighter bind to the DNA it wishes to transcribe and transcription is up-regulated. On the contrary, mutations that destroy conserved nucleotides in the consensus sequence are known as down mutations. These types of mutations down-regulate transcription since RNA polymerase can no longer bind as tightly to the core promoter sequence.



Sequence analysis


Developing software for pattern recognition is a major topic in genetics, molecular biology, and bioinformatics. Specific sequence motifs can function as regulatory sequences controlling biosynthesis, or as signal sequences that direct a molecule to a specific site within the cell or regulate its maturation. Since the regulatory function of these sequences is important, they are thought to be conserved across long periods of evolution. In some cases, evolutionary relatedness can be estimated by the amount of conservation of these sites.



Notation


The conserved sequence motifs are called consensus sequences and they show which residues are conserved and which residues are variable. Consider the following example DNA sequence:


A[CT]NAYR

In this notation, A means that an A is always found in that position; [CT] stands for either C or T; N stands for any base; and A means any base except A. Y represents any pyrimidine, and R indicates any purine.


In this example, the notation [CT] does not give any indication of the relative frequency of C or T occurring at that position. An alternative method of representing a consensus sequence uses a sequence logo. This is a graphical representation of the consensus sequence, in which the size of a symbol is related to the frequency that a given nucleotide (or amino acid) occurs at a certain position. In sequence logos the more conserved the residue, the larger the symbol for that residue is drawn; the less frequent, the smaller the symbol. Sequence logos can be generated using WebLogo, or using the Gestalt Workbench, a publicly available visualization tool written by Gustavo Glusman at the Institute for Systems Biology.[2]



Software


Bioinformatics tools are able to calculate and visualize consensus sequences. Examples of the tools are JalView and UGENE.



See also


  • Position-specific scoring matrix


  • Regular expression — denoting multiple sequences of symbols in formal language theory

  • Sequence motif

  • Sequence logo


References




  1. ^ Pierce, Benjamin A. 2002. Genetics : A Conceptual Approach. 1st ed. New York: W.H. Freeman and Co.


  2. ^ ab Schneider TD (2002). "Consensus Sequence Zen". Appl Bioinform. 1 (3): 111–119. PMC 1852464. PMID 15130839..mw-parser-output cite.citationfont-style:inherit.mw-parser-output .citation qquotes:"""""""'""'".mw-parser-output .citation .cs1-lock-free abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/6/65/Lock-green.svg/9px-Lock-green.svg.png")no-repeat;background-position:right .1em center.mw-parser-output .citation .cs1-lock-limited a,.mw-parser-output .citation .cs1-lock-registration abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/d/d6/Lock-gray-alt-2.svg/9px-Lock-gray-alt-2.svg.png")no-repeat;background-position:right .1em center.mw-parser-output .citation .cs1-lock-subscription abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/a/aa/Lock-red-alt-2.svg/9px-Lock-red-alt-2.svg.png")no-repeat;background-position:right .1em center.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registrationcolor:#555.mw-parser-output .cs1-subscription span,.mw-parser-output .cs1-registration spanborder-bottom:1px dotted;cursor:help.mw-parser-output .cs1-ws-icon abackground:url("//upload.wikimedia.org/wikipedia/commons/thumb/4/4c/Wikisource-logo.svg/12px-Wikisource-logo.svg.png")no-repeat;background-position:right .1em center.mw-parser-output code.cs1-codecolor:inherit;background:inherit;border:inherit;padding:inherit.mw-parser-output .cs1-hidden-errordisplay:none;font-size:100%.mw-parser-output .cs1-visible-errorfont-size:100%.mw-parser-output .cs1-maintdisplay:none;color:#33aa33;margin-left:0.3em.mw-parser-output .cs1-subscription,.mw-parser-output .cs1-registration,.mw-parser-output .cs1-formatfont-size:95%.mw-parser-output .cs1-kern-left,.mw-parser-output .cs1-kern-wl-leftpadding-left:0.2em.mw-parser-output .cs1-kern-right,.mw-parser-output .cs1-kern-wl-rightpadding-right:0.2em











Retrieved from "https://en.wikipedia.org/w/index.php?title=Consensus_sequence&oldid=874233690"










Navigation menu


























(window.RLQ=window.RLQ||[]).push(function()mw.config.set("wgPageParseReport":"limitreport":"cputime":"0.120","walltime":"0.170","ppvisitednodes":"value":201,"limit":1000000,"ppgeneratednodes":"value":0,"limit":1500000,"postexpandincludesize":"value":2566,"limit":2097152,"templateargumentsize":"value":262,"limit":2097152,"expansiondepth":"value":7,"limit":40,"expensivefunctioncount":"value":1,"limit":500,"unstrip-depth":"value":1,"limit":20,"unstrip-size":"value":3854,"limit":5000000,"entityaccesscount":"value":1,"limit":400,"timingprofile":["100.00% 151.371 1 -total"," 75.06% 113.622 1 Template:Reflist"," 66.49% 100.647 1 Template:Cite_journal"," 23.28% 35.237 1 Template:Short_description"," 21.92% 33.186 1 Template:Pagetype"," 2.06% 3.116 1 Template:Main_other"],"scribunto":"limitreport-timeusage":"value":"0.076","limit":"10.000","limitreport-memusage":"value":1703528,"limit":52428800,"cachereport":"origin":"mw1275","timestamp":"20190319221824","ttl":2592000,"transientcontent":false););"@context":"https://schema.org","@type":"Article","name":"Consensus sequence","url":"https://en.wikipedia.org/wiki/Consensus_sequence","sameAs":"http://www.wikidata.org/entity/Q1295754","mainEntity":"http://www.wikidata.org/entity/Q1295754","author":"@type":"Organization","name":"Contributors to Wikimedia projects","publisher":"@type":"Organization","name":"Wikimedia Foundation, Inc.","logo":"@type":"ImageObject","url":"https://www.wikimedia.org/static/images/wmf-hor-googpub.png","datePublished":"2004-07-01T00:01:31Z","dateModified":"2018-12-17T23:25:38Z"(window.RLQ=window.RLQ||[]).push(function()mw.config.set("wgBackendResponseTime":113,"wgHostname":"mw1262"););

Popular posts from this blog

How to write a 12-bar blues melodyI-IV-V blues progressionHow to play the bridges in a standard blues progressionHow does Gdim7 fit in C# minor?question on a certain chord progressionMusicology of Melody12 bar blues, spread rhythm: alternative to 6th chord to avoid finger stretchChord progressions/ Root key/ MelodiesHow to put chords (POP-EDM) under a given lead vocal melody (starting from a good knowledge in music theory)Are there “rules” for improvising with the minor pentatonic scale over 12-bar shuffle?Confusion about blues scale and chords

What if the end-user didn't have the required library?What is setup.py?What is a clean, pythonic way to have multiple constructors in Python?What does Ruby have that Python doesn't, and vice versa?What is the reason for having '//' in Python?How do I create a namespace package in Python?How to package shared objects that python modules depend on?setuptools vs. distutils: why is distutils still a thing?Navigation in Windows 10 vs code not going to virtualenv library when the same library is installed at user levelPython create package for local usePackaging a project that uses multiple python versionsWhy is permission denied on pip install except for when “--user” is included at end of command?

Esgonzo ibérico Índice Descrición Distribución Hábitat Ameazas Notas Véxase tamén "Acerca dos nomes dos anfibios e réptiles galegos""Chalcides bedriagai"Chalcides bedriagai en Carrascal, L. M. Salvador, A. (Eds). Enciclopedia virtual de los vertebrados españoles. Museo Nacional de Ciencias Naturales, Madrid. España.Fotos