This Quiz Section introduces some informatics tools for genomics. In particular, we will explore genome browsing and BLAST searching. All you need to perform these activities can be found on the internet. We will use the UCSC Mouse Genome Browser and the NCBI BLAST Database.
On the right is an image of the Mouse Genome Browser. The red box indicates where you can make a query. You can query a piece of DNA by specifying the chromosome and base pair coordinates as shown. If you want to find a particular gene, you can just type in the gene name, such as TRPM8. | ![]() |
Here is a gene view of TrpM8. The red circle indicates the known gene sequence, displayed as a piece of genomic DNA with both exons and introns. Clicking on the Trpm8 inside the red circle brings up additional gene information, including the DNA bases. | ![]() |
Below the gene viewer are several parameters you can set. In this image, I am setting the map to display TrpM8 ESTs found in organisms other than mouse. Once the new parameters have been set, just hit the refresh button at the bottom of the gene viewer page to see the new results. | ![]() |
The NCBI BLAST Database contains many tools for searching and comparing gene and protein sequences. For the purposes of this quiz section, we are most concerned with the tools shown on the right. You can query for genes using DNA sequence (nucleotide blast), or query for proteins using amino acid sequence (protein blast), or query for proteins using DNA sequence (blastx). | ![]() |
The image on the right shows an example of using nucleotide blast. Sequence is entered in the box at the top. The database you want to search is chosen from the pull-down menu shown here. Once your parameters are set, just press the BLAST button in the lower left corner to begin your search. | ![]() |
![]() |
Above is an example of a query using the mouse TrpM8 mRNA sequence and searched against the Reference mRNA sequence (refseq_RNA). Notice that there is highly conserved sequence found in other organisms. Each result has an Accession number that you can click on to obtain more information about the sequence from the database. When viewing the results, Query coverage indicates what fraction of the nucleotides align to the database sequence. E value indicates the probability of this match happening by chance, thus values are better the closer they are to 0. Max ident is the identity between the sequences. In other words a value of 100% means the query and database sequences match exactly, and a value of 90% indicates that 10% of the bases differ between the two sequences. Below shows one of the sequence alignments found in the list. You can use these to visualize the similarity between two sequences. |
![]() |
GTATCCAACGGTTGTGTGAGTAAAATTCTGGGCAGGTATTACGAGACTGGCTCCATCAGA CCCAGGGCAATCGGAGGGAGTAAGCCAAGAGTGGCGACTCCAGAAGTTGTAAGCAAAATA GCCCAGTATAAACGGGAGTGCCCTTCCATCTTTGCTTGGGAAATCCGAGACAGATTATTA TCCGAGGGGGTCTGTACCAACGATAACATACCCAGT
TTCCTACGAAGGAGAGCAGAAATCAATGCTCCTATTGCAGATAAATACATCCTAACATGT ATCTTAACTTTTACTGATTGTCCTAGTCACAGTGGATGCAGTGAAAAAATATTATGGATT TTTATGTGATGAGAATCTTGTCTTGTATTTACTATTTTGTCCCATTTTCCACATACTCTA TTTGACAAAGTGTGGTGTTAACAGGAGAATAAGATTTTACAGTGGTTAACATACCATTCA GATATCACAAAGACTTATAACAATGAAGTTGTGACTAATTCCCAGCAGGCATAACGATGC AGGGAAGAAAGGGAAAAAAAGGAAAAAAAAAATCACCTGTGTGTGAAGCCTACCTACACT ATTCTTCAGCCTTTTTAAAATTAGTATTTTTTTTCATCCCGTTCATGTGGGTGGAAGCAA GCTGCTACATTTAAGCCAGGATCCAGCAAAACAATTGGACATGTTCTTCAATGATGAAGA TCACAAATATTTGAGAGCATGTAACTGGGCTCTTCAGTCTGCTTATGGTGAGTTAAATGA GCCTCTTTTCGAAGGATCAGGAGCATGAGCTGTTTTACAAAAGCTCACTGCAAAACACTT CATGCCTTACGTGGTTTACTGAAATACTAGAGGCAGTTAAATCTCTGTGCTGAACTGTTA ACATACATTTTTTCAAGCTCATTATCTATGTGACTACAGGACTTGATTCAGCTCACTCAG AACTACAAAATGATCTCTCTGGTCCCCATTGACATTGCCAGAATTTCATGAGACGACCCA TGTCAACTCACTTAGGACAGAAGAAGAACCTTTGAACAGAACATGAGATGTTGCAAATAC TCAGGAAACCCAAACAGCAATCAAAGATCAAGTGTCCTTTTAAGTCTTTCACTGGATCAA GTTCTAGATGTCTAACACCTCTGGTTTTCAGCCTGACCTTCTGCACTACTGTAAAACTGC CACTCCAAATGCCGATGCATCATAGTATCATATATTGTTGTAAAGTGTCAGAGAGCATAA TCAAGATTAGCTCAAAACCCTCAATAATTTACTCCTATAACTGCATTAAGCTATAAGTGT AATAACTAAAGTTATGCTTTACAAAGCATGATGCAAAGAAATACATTAAATGTCCGATGT ATTAAAAGAATCTATAGAGAGGCAGAAGTTCCTCCCCACATTCTGTATCAAACACAAGTA GTTACTGCTGATTACTCAGAAAGTGAAGCATGCATGTTCTGAAACAAAGATAGTATTTTC TCATTAAACTTCAAGAAATGGATTTTTTTTTTCCCAGCACTGTACCTCAGAGAGATTTTG GCCAGAGACTTATTTCTCCTTTGACCCTGCAGCTCACAGTACCAGCCTGTGCCATACTAG CGCAACTCCAGCATTCAGTACCCTGCCTTTGTCTTTCTCCTGTGGGCATGAACAGCAAAT ACCACAGCAGATTTACCTCTGGAGGCATTTCATAAGCCTCATTCTCAGGATCCACAGGAA TATCAGTATTATTCACCATTCCTTCCTGCAGAAAACCTTCTTCATTCTACATTTAAAACA GGGAGAGAAAAAAATAAGGTGTTAGATCAACTGCTTTTCCAATTTATTGCAAAGCATGTC TTTCCTCTCCACACAGCAGATACTGGCTGTAAAGGCATGCAGAAGTAAGAGAAGGACAAA AGGAATTGGATAGTATCAAGACACGAAAATGAAACATAATTCTATTATGTATTTTGTTAT AGGCAGATAACTCACCCTCCTACACCTTTATTTCATTTTACGGTTTTTACCCTTCTCATA TTACATTTCAAAGTGGCCAAAGGAGGAAGAACAACTATAATTTATTGCATGTTGTTGTTC CCCCCCCCCCCCCCTACAATTTTAGAGCCTAATGACATTGGGAAACGTTTTTCCCTTAAT GAGCCATATTTTTATATCCTTTATTGTGCTGAGTAGCACCTTACTCTGTGCAGTATATTT ATGACACATGACACTGCCTGAStep #1: Using the tools available in this quiz section, identify the exon(s) in the sequence.
1 mtarglalgl lllllcpaqv fsqscvwyge cgiaygdkry nceysgppkp lpkdgydlvq 61 elcpgfffgn vslccdvrql qtlkdnlqlp lqflsrcpsc fynllnlfce ltcsprqsqf 121 lnvtatedyv dpvtnqtktn vkelqyyvgq sfanamynac rdveapssnd kalgllcgkd 181 adacnatnwi eymfnkdngq apftitpvfs dfpvhgmepm nnatkgcdes vdevtapcsc 241 qdcsivcgpk pqpppppapw tilgldamyv imwitymafl lvffgaffav wcyrkryfvs 301 eytpidsnia fsvnasdkge asccdpvsaa fegclrrlft rwgsfcvrnp gcviffslvf 361 itacssglvf vrvttnpvdl wsapssqarl ekeyfdqhfg pffrteqlii rapltdkhiy 421 qpypsgadvp fgppldiqil hqvldlqiai enitasydne tvtlqdicla plspyntnct 481 ilsvlnyfqn shsvldhkkg ddffvyadyh thflycvrap aslndtsllh dpclgtfggp 541 vfpwlvlggy ddqnynnata lvitfpvnny yndteklqra qawekefinf vknyknpnlt 601 isftaersie delnresdsd vftvvisyai mflyislalg hikscrrllv dskvslgiag 661 ilivlssvac slgvfsyigl pltlivievi pflvlavgvd nifilvqayq rderlqgetl 721 dqqlgrvlge vapsmflssf setvafflga lsvmpavhtf slfaglavfi dfllqitcfv 781 sllgldikrq eknrldifcc vrgaedgtsv qasesclfrf fknsysplll kdwmrpivia 841 ifvgvlsfsi avlnkvdigl dqslsmpdds ymvdyfksis qylhagppvy fvleeghdyt 901 sskgqnmvcg gmgcnndslv qqifnaaqld nytrigfaps swiddyfdwv kpqssccrvd 961 nitdqfcnas vvdpacvrcr pltpegkqrp qggdfmrflp mflsdnpnpk cgkgghaays 1021 savnillghg trvgatyfmt yhtvlqtsad fidalkkarl iasnvtetmg ingsayrvfp 1081 ysvfyvfyeq yltiiddtif nlgvslgaif lvtmvllgce lwsavimcat iamvlvnmfg 1141 vmwlwgisln avslvnlvms cgisvefcsh itraftvsmk gsrveraeea lahmgssvfs 1201 gitltkfggi vvlafaksqi fqifyfrmyl amvllgathg liflpvllsy igpsvnkaks 1261 cateerykgt ererllnfQuestion #1: Find the closest homolog to NPC1 in C. elegans.