LOCUS E.coli 3592 bp RNA RNA 11-JAN-1980 DEFINITION Escherichia coli. ACCESSION No information KEYWORDS No information. SOURCE Escherichia coli. ORGANISM Escherichia coli. REFERENCE 1 AUTHORS No information JOURNAL E.coli:who:Published STANDARD No information COMMENTS Organism information Culture collection: ? Sequence information (bases 1 to 3592) Corresponding GenBank entry: J01695 Phylo:Eubacteria,Purple,gamma,3 BASE COUNT 762 a 639 c 912 g 591 t 688 others ORIGIN 1 |--|------ ---GGUUAAG CGACUAAGCG UACACGGUGG AUG|CCCUGG |CAGUCAG|A 61 G|GCGAUGAA GGA|CGUG(C UAAUC)U|GC GAUAAGCGUC GGU|AAGGU( GAUAUGA)AC 121 CGUUAUAACC GGCGAUU|UC -CGAAUGGG( GAAA-)CCC| AGUGUGU--- ----(UUCG- 181 )-------AC AC-ACU-AUC AUUAACU(GA -AUC-CA-UA )GGU-UAAUG AGGCGAAC|C 241 GGGGGAACU| GA(AACA)UC UAAGUACCCC GAGGAAAAGA AAUC(AA--C C)GAGA-U|U 301 CCCCCAGUAG C(GGCGA)GC GAACGGGGAG CAGCCCA--- ---------- ---------- 361 ---------- ---------- ---------- ---gagccug -----(---- -)-----aau 421 cagug--ugu guguuagug- GAAg-cguc- (uggaaa)-g gcgcgc|g(a ua)caGGG(U 481 GACAGC)CCC GUA-cacaaa aaugcacaug cugu---gag cucGAU---- GAGUAGGGCG 541 GGA(CACGUG GUA)UCCUGU CUGAAU-AUG GGGG(GACCA U--)CCUCCA AGGCUAAAUA 601 CUCCUGACUG ACCGAUAGUG AAC-CAGUAC C(GUGA)GGG AAAGGCGAAA AGAACCCC(G 661 GCGA)GGGGA GUGAAAAAGA ACCUGAAACC GUGUACGUAC A-AGCAGUGG GAGCACGC-- 721 ---------- ---------- ---------- --(uuag-)- ---------- ---------- 781 ---------- GCGUGUGACU GCGUACCU(U UUGUAU|A)A UGG|GUCAGC GACUUAUA-U 841 UCUGUAGCAA GGUUAACC-- (--GAAUA)- -GGGGAGCCG AAGG(GAAA) CCG------- 901 ---------- ---------- ---------- ---AGUCU(U AACU)GGGCG UuaaGUUGCA 961 GGGUAUAGAC CCGAAACCCG GUGAUCUAGC CAUGGG|CAG GUUGAAGGUU GGG(UAAC)A 1021 CUAACUGGAG GACCG-AA-C CGAC-UA-AU (GUUGAA-AA )AUUA-G|CG GAUGACUUGU 1081 GGCUGGGGGU (GAAAG)GCC AAUCAAACCG GG-AGAUAGC UGGUUCUCCC CGAAAGCUAU 1141 (UUAG)GUAG CGC|CUCGUG AAUU|CAU|- --CUCCGG|G GGUAG|AGC- ACUGUUUCGG 1201 CAA--GG|GG GUC(AUCCC) GACUUACCAA CCCG-AUGCA AACUGCGAAU A|CCGGAGAA 1261 UGU---UA|U CACGGG-AGA CACACGGC-G -GGU(GCUAA C)GUCC-GUC GUGAAGAGGG 1321 (AAACAA)CC CAGACCGCCA GCUAAGGU|C CCAAAGUCAU GGUUA-AGUG --------GG 1381 AAACGAUGUG GGAAGGCCCA GACAGCCAGG AUGUUGGC(U UAGAAGCA)G CCAUCAU|U( 1441 UAA)A|GAAA GC(GUAAUA) GC|UCACUGG UCGAGUCGGC CUGC|GCGGA AGAUGUAACG 1501 GGGCU-AAAC CAUG|C|ACC GAAGCU|GCG G|cagcgacg c-(uuau--) -gcguugu-- 1561 ugGGUAGGGG AG|CGUUCUG UAAGCCUGCG AAGGUGUG-C U(GUGA)GGC AUGCUGG|AG 1621 GUAUCAGAAG UGCGAAUGCU GAC|AUAAGU AACG--AUAA AGCGGG(UGA AAA-G)CCC- 1681 GCU|CGCCGG AA-GACCAAG GGUUCCUGUC CAACG-(UUA AU)CG|GGGC AGGGUGAGU| 1741 CGA|CCCCUA AGGCGAG-GC C(-GAAA)GG C---GUAGUC GAU--GGGAA ACAGG(UUAA 1801 UAUU)CCUGU |AC|uuggug uu-acu---- ---------- --------gc gaag-ggggg 1861 acggagaagg cuauguug|g ccggg(---- cga--cgguu g----u)ccc ggu|uuaagc 1921 gug--uaggc ugguuuucca gg(caaau)c c|ggaaaauc aaggcuga-- ggcgugauga 1981 |cga|ggcac (--uacg)gu gcugaagcaa caaaugcccu g|cuucc-ag gaaaagccuc 2041 uaagcauca- ggu-aacauc a--a|A|UCG UACCCCAAAC C(GACACA)G G|UGGUCAGG 2101 UA(-GAGAA- )UACCAAGGC GC||U-UGAG -AG-AACUCG GGUGAAGGAA CUAGGCAAAA 2161 UGGUGCCGUA AC(UUCG)GG AGAAGGCAC| GCUGAUaugu aggugagguc c(--cucgc) 2221 ggauggagcu gaaAUCAGU| C(GAA)GAUA CCAGCUGGCU G|CAACUGU( UUAUUAAAA) 2281 ACACAGCACU GUG|CAAACA C(GAA-A)GU GGACGUAUAC GGUGUGACGC CUGCCCGGU| 2341 GCCGGAAGGU UAAUUGAUGG GGUUAGC(GC AA)GCGAAGC UCUUGAUCGA AGCC|CCGGU 2401 AAAC|GGC|G GCCGU(AACU AUA)ACGGUC |C(UAA)GGU AGCGAAAUUC CUU|GUCGGG 2461 (UA-----AG U)UCCGAC|C UGCACGAAUG GCGUAAUGAU GGCCAGGCUG UCUCCACCCG 2521 AG-ACUCA|G UGAAAUU|-G AACU-CGCUG (-UGAAGA-U G)CAGUGUAC CCGCGGCAAG 2581 ACGGAAAGAC CCCGUGA|AC CUUUACUAU| AGCUUGACAC U|GAACAUUG AGCCUUGAUG 2641 UGUAGGAUAG GUGGGAGGCU UUG---AAGU GUGGAC(GCC A-)GUCUGCA U-GGAGCCGA 2701 CCUUGAAAUA CCACC|CUUU AAUGUUUGAU GUUCUAACGU UGACCCG(UA AUC-)CGGGU 2761 UGCGGACAGU GUCUGGUGGG UAGUUUGACU G(GGG)CGGU -C|UCCUCC( UAAAGAGUAA 2821 C)GGAGGA|G |C|ACGAAGG UUGGC|UAAU CCUGG(UCGG ACA)UCAGGA GGUUA|GU(G 2881 |C-AAUG)GC AUAAGCCAGC UUGACUGCGA GCG(---UGA CGG-)CGCGA GCAGGUGC(G 2941 AAA)GCAGGU |CAUAGUGAU CCGGUG-GUU CUG(---AAU GGA)AGG|GC CAUCGCUCAA 3001 C|GGAUAAAA GGUACUC|CG GG|GAUAACA G|GCUGAUAC CGCCCAAGA( GUUCAUA)UC 3061 GACGGCGGUG UUUGGCACCU CGAUGUC|GG CU|C|AUCAC |AUCC|UGGG GCU(GAAGUA 3121 )GGUCCCAAG GGU|AUGGC( UGUUC)GCCA UUUAAAGUGG UACGCGAGCU GGGUUUAGAA 3181 C|GUC(GUGA )GACAGUUCG GUCCCUAUCU |GCCGUGGG- |CG-CUGGAG AACU|GA|GG 3241 GGG--GCUGC UCC(UAGUAC (GAGA)GGAC C)GGAGUGG| ACGCAUCACU GGUGUUCGGG 3301 U|UGUCAU(G CCA)AUGGCA -CUGCCCGGU A-GCUAAAUG CGGAAGAGAU AAGUGCU(GA 3361 AAGCAUCUA) AGCACGAAAC UUGCCCC|GA G-AUGAGUUC UCC|CU|GAC CCU------- 3421 -------(UU A--)------ --------AG GGUCCUGAAG G|AA|CGUUG (AAGACGA)C 3481 GACGUUGAUA G|GCCGG|GU GUGUAAGCGC A-------(G CGA--)---- ---UGCGU|U 3541 GAGCUA|A|C CGGUACUAAU GAACCGUGAG |GCUUAACCU U------||| || // LOCUS EURYARCHAE 3592 bp RNA RNA ~?~???? DEFINITION . ACCESSION No information KEYWORDS No information. SOURCE No information. ORGANISM No information. REFERENCE 1 AUTHORS No information JOURNAL No information TITLE No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: DIVIDER BASE COUNT 0 a 0 c 0 g 0 t 3592 others ORIGIN 1 x~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~-~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 61 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 121 ~~~~~~~~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~-~~~~~ ~~~~~~~~-- ----~~~~~- 181 ~~------~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~-~~~~| ~~~~~~~~~~ ~~~~~~~~~~ 241 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~|~ ~~~~~~~--~ ~~~~~~-~~~ 301 ~~~~~~~~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~~~~~~| ~(~~~-)~|~ ~~~~~~~~~~ 361 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~|~~~ ~~~~~~~~~~ -----(---- -)-----~~~ 421 ~~~~~--~~~ ~~~~~~~~~- ~~~~~~~~~- ~~~~~~~~-~ ~|~~~~~~~~ ~~~~~~~~~~ 481 ~~~~~-~~~~ ~~~-~~~~~~ ~~~~~~~~~~ ~~~~---~~~ ~~~~~~---- ~|~~~~~~~~ 541 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~|-~~~ ~~~~~~~~~~ ~--~~~~~~~ ~~~~~~~~~~ 601 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~-~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 661 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~|~~ ~-~~~~~~~~ ~~~~~~~~~~ 721 ~~~~~~~~~~ ~~-------- ---------- --~~~~~~~~ ~~~~~~~~~~ ~~~------- 781 ---------- ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~|~)~ ~~~~~~~~~~ ~~~~~~~~-~ 841 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~|~--- 901 ---------- ---------- ---------- ---~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 961 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1021 ~~~~~~~~~~ ~~~~~-~~-~ ~~~~-~~-~~ ~~~~~~~-|~ (~~~~-)~~~ ~~~~~~~~~~ 1081 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1141 ~~~~-~~~~| ~~~~~~~-~~ ~~~~~~~~~~ --~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1201 |~~~-~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~-~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1261 ~~~~~~~~~~ ~~|~~~-~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~-~-~~~ ~|~~~~~~~~ 1321 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~-~~~~ ~|~|~|--~~ 1381 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~|~~(~ ~-~~~~~~~~ ~~~~~~~~~~ 1441 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ ~~~~~~~(~~ 1501 ~~~~~~~|~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~| ~~~~~~|~~~ ~~~~~~~~-- 1561 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ 1621 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ -~~~--~~|~ |~~~~~~~~~ ~~~~~~~~~- 1681 ~~~~~~~~~~ ~~-~~~~~~~ ~|||||~~~~ ~~~~~~(~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1741 ~~~~~~~~~~ ~~~~~~~-~~ ~~~~~~~~~~ ~---~~~~~~ ~~~--~~~~~ ~~~~~~~~~~ 1801 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~-~~~~~ 1861 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~---- ~~~--~~~~~ ~----~~~~~ ~~~~~~~~~~ 1921 ~~~--~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~-- ~~~~~~~~~~ 1981 ~~~~~~~~~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~-~~ ~~~~~~~~~~ 2041 ~~~~~~~~~~ ~~~~~~~~~~ ~--~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2101 ~~~~~~~~~- ~~~~~~~~~~ ~~~~~-~~~~ -~~-~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2161 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~-~~~~~~ 2221 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2281 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~-~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2341 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2401 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2461 ~~~-----~~ ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2521 ~~-~~~~~~~ ~~~~~~~~-~ ~~~~-~~~~~ ~-~~~~~~-~ ~~~~~~~~~~ ~~~~~~~~~~ 2581 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2641 ~~~~~~~~~~ ~~~~~~~~~~ ~~~---~~~~ ~~~~~~~~~~ ~-~~~~~~~~ ~-~~~~~~~~ 2701 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~-~~~~~~ 2761 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~~~~~~~ 2821 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2881 ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~---~~~ ~~~-~~~~~~ ~~~~~~~~~~ 2941 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~-~~~ ~~~(---~~~ ~~~)~~~~~~ ~~~~~~~~~~ 3001 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3061 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3121 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3181 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~- ~~~~~~~~~~ ~~~~~~~~~~ 3241 ~~~~-~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3301 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~-~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3361 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~-~~~~~~~~ ~~~~~~~~~~ ~~~------- 3421 -------~~~ ~~~~------ --------~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3481 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~-----~~ ~~~~-~~~~~ ---~~~~~~~ 3541 ~~~~~~|~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~|| || // LOCUS M.jannas.A 3592 bp RNA RNA 27-AUG-1996 DEFINITION Methanococcus jannaschii.. ACCESSION No information KEYWORDS No information. SOURCE Methanococcus jannaschii. ORGANISM Methanococcus jannaschii. REFERENCE 1 AUTHORS Bult,C.J., White,O., Olsen,G.J., Zhou,L., Fleischmann,R.D., TITLE Complete genome sequence of the methanogenic archaeon, JOURNAL Science 273 (5278), 1058-1073 (1996) STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U67472 LOCUS U67472 14222 bp DNA BCT 28-JAN-1998 DEFINITION Methanococcus jannaschii section 14 of 150 of the complete genome. ACCESSION U67472 L77117 NID g2826253 KEYWORDS . SOURCE Methanococcus jannaschii. ORGANISM Methanococcus jannaschii Archaea; Euryarchaeota; Methanococcales; Methanococcaceae; Methanococcus. REFERENCE 1 (bases 1 to 14222) AUTHORS Bult,C.J., White,O., Olsen,G.J., Zhou,L., Fleischmann,R.D., Sutton,G.G., Blake,J.A., FitzGerald,L.M., Clayton,R.A., Gocayne,J.D., Kerlavage,A.R., Dougherty,B.A., Tomb,J., Adams,M.D., Reich,C.I., Overbeek,R., Kirkness,E.F., Weinstock,K.G., Merrick,J.M., Glodek,A., Scott,J.D., Geoghagen,N.S., Weidman,J.F., Fuhrmann,J.L., Nguyen,D.T., Utterback,T., Kelley,J.M., Peterson,J.D., Sadow,P.W., Hanna,M.C., Cotton,M.D., Hurst,M.A., Roberts,K.M., Kaine,B.B., Borodovsky,M., Klenk,H.P., Fraser,C.M., Smith,H.O., Woese,C.R. and Venter,J.C. TITLE Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii JOURNAL Science 273 (5278), 1058-1073 (1996) MEDLINE 96337999 REFERENCE 2 (bases 1 to 14222) AUTHORS Bult,C.J., White,O., Olsen,G.J., Zhou,L., Fleischmann,R.D., Sutton,G.G., Blake,J.A., FitzGerald,L.M., Clayton,R.A., Gocayne,J.D., Kerlavage,A.R., Dougherty,B.A., Tomb,J., Adams,M.D., Reich,C.I., Overbeek,R., Kirkness,E.F., Weinstock,K.G., Merrick,J.M., Glodek,A., Scott,J.D., Geoghagen,N.S., Weidman,J.F., Fuhrmann,J.L., Nguyen,D.T., Utterback,T., Kelley,J.M., Peterson,J.D., Sadow,P.W., Hanna,M.C., Cotton,M.D., Hurst,M.A., Roberts,K.M., Kaine,B.B., Borodovsky,M., Klenk,H.P., Fraser,C.M., Smith,H.O., Woese,C.R. and Venter,J.C. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA COMMENT On Jan 30, 1998 this sequence version replaced gi:1590907. FEATURES Location/Qualifiers source 1..14222 /organism="Methanococcus jannaschii" /db_xref="taxon:2190" rRNA complement(11233..14122) /gene="MJrrnA23S" /product="Ribosomal RNA" gene complement(11233..14122) /gene="MJrrnA23S" BASE COUNT 4449 a 2720 c 3034 g 4019 t BASE COUNT 649 a 830 c 1080 g 451 t 582 others ORIGIN 1 |---ggauac ucugccgggc c--ACCAGCC CACCUGGUGG AUG|GCUCGG |CUCGGGG|C 61 G|CCGAGGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|GAGGC( GCAGGCA)GC 121 CGUG-GAACC CGGGAUC|CC -CGAAUGGG( ACUU-)CCU| gcccc----- ----(auuu- 181 )--------- gg-ggc-gcu ccc----(-- -guu--a--- )-----ggga gcggGAAC|G 241 CGGGGAAAA| GA(AGCA)UC CGAGUACCCG CAGGAAAAGA AACC(aa--c a)GGGA-U|G 301 CCGGGAGUAG G(GGCGA)CC GAAACCGGCA CAGGGCAaac cgaa--uccc uaccc(-gua 361 a)ggguaggg a--gaugugg aguugc-agg g-cccccaau -----(---- -)------ac 421 agacccccac ugggaagcc- GAAgucccc- (uggaau)-g gggcgc|c(a ua)gaGGG(U 481 GAAAGC)CCC GUA-ggcgua accaguuggg ggucu---ug gggugucccu GAGUACCGCG 541 CGu(---ugg aua)uCGCGC GGGAAG-CUG GGAG(acauu agg)CUUCCA ACCCUaaAua 601 CG-UCCCGAG ACCGAUAGCG AAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGCACCCC(U 661 -UGC)GGGGG GUG-AAAAGA GCCUGAAACC AGGUGGGUac g-gaauggca cggccc--ga 721 aagg-----u aaccaccccg aaggaaacuc cc(gcga-)g ggag---gag uacgag-ggg 781 uggcaugccg --gggucgug ccGUCCGU(U UCGAAA|A)A CGG|GCCGGG GAGUGUAC-G 841 GGUGUGGCGA GCCUAAgggg (--uucaa)c cccGGAGGCG UaGG(GAAA) CCgacaugcc 901 cgcaGccc-- -(uuau)--- gggugagggg cggGGUCU(u aau-)GGGCC CGGAGUCACA 961 CCCGUACGAC CCGAAACCGG GCGAUCUAGG CCGGGG|UAG GGUGAAGCCC CUC(GCCA)G 1021 AGGGGUGGAG GCCCg-ca-g gg-g-ug-uu (accgcgcaa )agugcu|cc ucuGACCCCG 1081 GUCUAGGGGU (GAAAA)GCC AAUCGAGCCC GG-AGAUAGC UGGUUCCCCC CGAAAUAACU 1141 (CGCA)GGUU AGC|CGGGGG UUA-|GGU|- aGAUGGCG|G GGUAG|AGCC ACGGAUAGGG 1201 UGuuuaG|GG GGC(gaga-) GCC-U-Cg-g CACCCUGUCA AACUCCGAAC C|CGUCAUCG 1261 CCgu---a|g CCCCCG-AGU GAGGGCAU-A -CGG(G-UAA G)CCGU-AUG UCCGAGAGGG 1321 (GAACAA)CC CGGACCCG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG uaaa---uga 1381 gaaGGGAGUC CCUGGCCUAA GACAGCGGGg AGGUUGGC(U UAGAAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCCG UCGAGGUCAG GGGC|CCCGA AGAUAAC-GG 1501 GGGCU-AAGC CGGC|C|GCC GAGACC|CGG G|ggggcu-- --(gaaa--) ----agccau 1561 ccGGUAGGGG GG|CGUCCCG CG-GGGGUAG AAGCUCGG-C C(GUGA)GGU CGGGUGG|AC 1621 CCCGUGGGAA CGAGAAUCCC GGC|AGUAGU AACagca-aa GUGGGG(UGA GAA-U)CCC- 1681 CAC|CGCCGA AG-GGGCCAG GUUUCCACAG CAACG-(GUC GU)CA|GCUG UGGGUUAGC| 1741 CGG|-UCCUA ACCCCCG-GG G(UAAUU)CC C--ugGGGGG GAAA-GGGAA GCGGG(UUAA 1801 UAUU)CCCGC |GC-caccgg gg-uacg--- ugcg(-gcaa )cgcaa-ggc cagc-uccug 1861 acgcuucggg guaggccg|a ccacc(---- -cc-cgucgg ------)ggu ggc|-caagc 1921 gca-uaagcc CGGGGAGUG- CC(GUAAU)G G|CGAGAACC G--ggcaa-a agcgugaug- 1981 |ggc|ccucc (--guua)gg aggguu-cgg cugagcccug -|gagcccgu gaaaagggag 2041 cuggcaa--- ggauccccgg u--g|A|CCG UACCCAGAAC C(GACACA)G G|UGCCCCUA 2101 GG(cgaguau )CCUAA-GGC GU||GUCGGG -AG-AAUCCG GCCCAGGGAA GUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g gucuucuc-- -(--uaagu) 2221 --gaggggac c--GCAG-U| C(GCA)GUGG CCAGGGGGGU C|CGACUGU( UUAAUAAAA) 2281 ACACAGGUCU UGG|CUAGCC C(GUA-A)GG GUGUGUACCA AGGCCGACGC CUGCCCAGU| 2341 GCCGGUACGU GAA----acc cggg---(ua ca)----acc ggg----cGA AGCG|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUUC CUU|GUCGGG 2461 (UA-----AG U)UCCGAC|C UGCAUGAAUG GCGUAACGAG ACCCCCACUG UCCCGGGCCG 2521 GA-ACCCG|G UGAACCU|-A CCAU-UCCGG (-UGCAAA-G G)CCGGAGAC CCCCAGUGGG 2581 AAGCGAAGAC CCCGUGG|AG CUUUACUGC| AGCCUGUCGU U|GGGGCAUG GCCGUGGGUG 2641 CACAGCGUAG GUGGGAGCCG UCG---AAGC CACCCC(UCC G-)GGGGUGG U-GGAGGCGC 2701 CCAUGGGACA CCACC|CACC CAUGGCCAUG UCCCuaac-- ----ccc(gu aaa-)ggg-- 2761 ---ggacACC GGCAGGUGGG CAGUUUGGCU G(GGG)CGGC AC|CCCCCU( GAAAAGGCAU 2821 C)AGGGGG|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUGGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACUUGGU CGG(uaaaag agg-)CCGAC CAAGAGGC(G 2941 AAA)GCCGGG |CCUAGCGAA CCCCUGUGC- CUC(--accg aug)GGG|GC CAGGG-auaa 3001 c|aga--aAA GCUACCC|CG GG|GAUAACA G|AGUUGUCG CGGGCAAGA( GCCCAUA)UC 3061 GACCCCGCGG CUUGCUACAU CGAUGUC|GG UU|C|UUCCC |AUCC|UGGG CCU(GCAGCA 3121 )GGGCCCAAG GGU|GGGGC( UGUUC)GCCC AUUAAAGGGG AUCGUGAGCU GGGUUUAAAC 3181 C|GUC(GUGA )GACAGGUUG GUUGCUAUCU |GCUGGGGG- |UG-UUGGCC GCCU|GA|GG 3241 GGAA-GGUGG CUC(UAGUAC (GAGA)GGAA C)GAGCCGC| CGGCGCCUCU GGUCUACCGG 3301 U|UGUCC-(G ACA)-GGGCA -uuGCCGGGC A-GCUACGCG CUA-AGGGAU AAGGGCU(GA 3361 AGGCAUCUA) AGCCCGAAAC CCUCCCC|GA A-AAUAGGCG GCC|ag|ucc ---------- 3421 -------(uu cg-)------ ---------- ggga--cgag g|gc|UCUCC (UAUAAGA)G 3481 GAGGUUGAUA G|GCCGG|GG GUGUAAGCGC Cgagggc-(u uu---)-gcc cgaGGCGU|U 3541 CAGCCC|G|C CGGUACUAau cgccca--ag |ggcccggca gggu---||| || // LOCUS M.jannas.B 3592 bp RNA RNA 24-JAN-1996 DEFINITION Methanococcus jannaschii. ACCESSION No information KEYWORDS No information. SOURCE Methanococcus jannaschii. ORGANISM Methanococcus jannaschii. REFERENCE 1 AUTHORS Gocayne,J.D., Kerlavage,A.R., Dougherty,B.A., Tomb,J.-F., Aams ... JOURNAL Science STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U67517 LOCUS U67517 12730 bp DNA BCT 28-JAN-1998 DEFINITION Methanococcus jannaschii section 59 of 150 of the complete genome. ACCESSION U67517 L77117 NID g2826310 KEYWORDS . SOURCE Methanococcus jannaschii. ORGANISM Methanococcus jannaschii Archaea; Euryarchaeota; Methanococcales; Methanococcaceae; Methanococcus. REFERENCE 1 (bases 1 to 12730) AUTHORS Bult,C.J., White,O., Olsen,G.J., Zhou,L., Fleischmann,R.D., Sutton,G.G., Blake,J.A., FitzGerald,L.M., Clayton,R.A., Gocayne,J.D., Kerlavage,A.R., Dougherty,B.A., Tomb,J., Adams,M.D., Reich,C.I., Overbeek,R., Kirkness,E.F., Weinstock,K.G., Merrick,J.M., Glodek,A., Scott,J.D., Geoghagen,N.S., Weidman,J.F., Fuhrmann,J.L., Nguyen,D.T., Utterback,T., Kelley,J.M., Peterson,J.D., Sadow,P.W., Hanna,M.C., Cotton,M.D., Hurst,M.A., Roberts,K.M., Kaine,B.B., Borodovsky,M., Klenk,H.P., Fraser,C.M., Smith,H.O., Woese,C.R. and Venter,J.C. TITLE Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii JOURNAL Science 273 (5278), 1058-1073 (1996) MEDLINE 96337999 REFERENCE 2 (bases 1 to 12730) AUTHORS Bult,C.J., White,O., Olsen,G.J., Zhou,L., Fleischmann,R.D., Sutton,G.G., Blake,J.A., FitzGerald,L.M., Clayton,R.A., Gocayne,J.D., Kerlavage,A.R., Dougherty,B.A., Tomb,J., Adams,M.D., Reich,C.I., Overbeek,R., Kirkness,E.F., Weinstock,K.G., Merrick,J.M., Glodek,A., Scott,J.D., Geoghagen,N.S., Weidman,J.F., Fuhrmann,J.L., Nguyen,D.T., Utterback,T., Kelley,J.M., Peterson,J.D., Sadow,P.W., Hanna,M.C., Cotton,M.D., Hurst,M.A., Roberts,K.M., Kaine,B.B., Borodovsky,M., Klenk,H.P., Fraser,C.M., Smith,H.O., Woese,C.R. and Venter,J.C. TITLE Direct Submission JOURNAL Submitted (27-AUG-1996) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA COMMENT On Jan 30, 1998 this sequence version replaced gi:1591419. FEATURES Location/Qualifiers source 1..12730 /organism="Methanococcus jannaschii" /db_xref="taxon:2190" rRNA 9705..12653 /gene="MJrrnB23S" /product="Ribosomal RNA" gene 9705..12653 /gene="MJrrnB23S" BASE COUNT 3786 a 2392 c 3166 g 3386 t BASE COUNT 650 a 830 c 1080 g 452 t 580 others ORIGIN 1 |---ggauac ucugccgggc c--ACCAGCC CACCUGGUGG AUG|GCUCGG |CUCGGGG|C 61 G|CCGAGGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|GAGGC( GCAGGCA)GC 121 CGUG-GAACC CGGGAUC|CC -CGAAUGGG( ACUU-)CCU| gcccc----- ----(auuu- 181 )--------- gg-ggc-gcu ccc----(-- -guu--a--- )-----ggga gcggGAAC|G 241 CGGGGAAAA| GA(AGCA)UC CGAGUACCCG CAGGAAAAGA AACC(aa--c a)GGGA-U|G 301 CCGGGAGUAG G(GGCGA)CC GAAACCGGCA CAGGGCAaac cgaa--uccc uaucc(-gua 361 a)ggguaggg a--gaugugg aguugc-agg g-cccccaau -----(---- -)------au 421 agacccccac ugggaagcc- GAAgucccc- (uggaau)-g gggcgc|c(a ua)gaGGG(U 481 GAAAGC)CCC GUA-ggcgua accaguuggg ggucu----g gggugucccu GAGUACCGCG 541 CGu(---ugg aua)uCGCGC GGGAAG-CUG GGAG(acauu agg)CUUCCA ACCCUaaAua 601 CG-UCCCGAG ACCGAUAGCG AAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGCACCCC(U 661 -UGC)GGGGG GUG-AAAAGA GCCUGAAACC AGGUGGGUac g-gaauggca cggccc--ga 721 aagg-----u aaccaccccg aaggaaacuc cc(gcga-)g ggag---gag uacgag-ggg 781 uggcaugccg --gggucgug ccGUCCGU(U UCGAAA|A)A CGG|GCCGGG GAGUGUAC-G 841 GGUGUGGCGA GCCUAAgggg (--uucaa)c cccGGAGGCG UaGG(GAAA) CCgacaugcc 901 cgcaaccc-- -(uuau)--- gggugagggg cggGGUCU(u aau-)GGGCC CGGAGUCACA 961 CCCGUACGAC CCGAAACCGG GCGAUCUAGG CCGGGG|UAG GGUGAAGCCC CUC(GCCA)G 1021 AGGGGUGGAG GCCCg-ca-g gg-g-ug-uu (accgcgcaa )agugcu|cc ucuGACCCCG 1081 GUCUAGGGGU (GAAAA)GCC AAUCGAGCCC GG-AGAUAGC UGGUUCCCCC CGAAAUAACU 1141 (CGCA)GGUU AGC|CGGGGG UUA-|GGU|- aGAUGGCG|G GGUAG|AGCC ACGGAUAGGG 1201 UGuuuaG|GG GGC(gaga-) GCCCU-Cg-g CACCCUGUCA AACUCCGAAC C|CGUCAUCG 1261 CCgu---a|g CCCCCG-AGU GAGGGCAU-A -CGG(G-UAA G)CCGU-AUG UCCGAGAGGG 1321 (GAACAA)CC CGGACCCG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG uaaa---uga 1381 gaaGGGAGUC CCUGGCCUAA GACAGCGGGg AGGUUGGC(U UAGAAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCCG UCGAGGUCAG GGGC|CCCGA AGAUAAC-GG 1501 GGGCU-AAGC CGGC|C|GCC GAGACC|CGG G|ggggcu-- --(gaaa--) ----agccau 1561 ccGGUAGGGG GG|CGUCCCG CG-GGGGUAG AAGCUCGG-C C(GUGA)GGU CGGGUGG|AC 1621 CCCGUGGGAA CGAGAAUCCC GGC|AGUAGU AACagca-aa GUGGGG(UGA GAA-U)CCC- 1681 CAC|CGCCGA AG-GGGCCAG GUUUCCACAG CAACG-(GUC GU)CA|GCUG UGGGUUAGC| 1741 CGG|-UCCUA ACCCCCG-GG G(UAAUU)CC C--ugGGGGG GAAA-GGGAA GCGGG(UUAA 1801 UAUU)CCCGC |GC-caccgg gg-uacg--- ugcg(-gcaa )cgcaa-ggc cagc-uccug 1861 acgcuucggg guaggccg|a ccacc(---- -cc-cgucgg ------)ggu ggc|-caagc 1921 gca-uaagcc CGGGGAGUG- CC(GUAAU)G G|CGAGAACC G--ggcaa-a agcgugaug- 1981 |ggc|ccucc (--guua)gg aggguu-cgg cugagcccug -|gagcccgu gaaaagggag 2041 cuggcaa--- ggauccccgg u--g|A|CCG UACCCAGAAC C(GACACA)G G|UGCCCCUA 2101 GG(cgaguau )CCUAA-GGC GU||GUCGGG -AG-AAUCCG GCCCAGGGAA GUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g gucuucuc-- -(--uaagu) 2221 --gaggggac c--GCAGGU| C(GCA)GUGG CCAGGGGGGU C|CGACUGU( UUAAUAAAA) 2281 ACACAGGUCU UGG|CUAGCC C(GUA-A)GG GUGUGUACCA AGGCCGACGC CUGCCCAGU| 2341 GCCGGUACGU GAA----acc cggg---(ua ca)----acc ggg----cGA AGCG|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUUC CUU|GUCGGG 2461 (UA-----AG U)UCCGAC|C UGCAUGAAUG GCGUAACGAG ACCCCCACUG UCCCGGGCCG 2521 GA-ACCCG|G UGAACCU|-A CCAU-UCCGG (-UGCAAA-G G)CCGGAGAC CCCCAGUGGG 2581 AAGCGAAGAC CCCGUGG|AG CUUUACUGC| AGCCUGUCGU U|GGGGCAUG GCCGUGGGUG 2641 CACAGCGUAG GUGGGAGCCG UCG---AAGC CACCCC(UCC G-)GGGGUGG U-GGAGGCGC 2701 CCAUGGGACA CCACC|CACC CAUGGCCAUG UCCCuaac-- ----ccc(gu aaa-)ggg-- 2761 ---ggacACC GGCAGGUGGG CAGUUUGGCU G(GGG)CGGC AC|CCCCCU( GAAAAGGCAU 2821 C)AGGGGG|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUGGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACUUGGU CGG(uaaaag agg-)CCGAC CAAGAGGC(G 2941 AAA)GCCGGG |CCUAGCGAA CCCCUGUGC- CUC(--accg aug)GGG|GC CAGGG-auaa 3001 c|aga--aAA GCUACCC|CG GG|GAUAACA G|AGUUGUCG CGGGCAAGA( GCCCAUA)UC 3061 GACCCCGCGG CUUGCUACAU CGAUGUC|GG UU|C|UUCCC |AUCC|UGGG CCU(GCAGCA 3121 )GGGCCCAAG GGU|GGGGC( UGUUC)GCCC AUUAAAGGGG AUCGUGAGCU GGGUUUAAAC 3181 C|GUC(GUGA )GACAGGUUG GUUGCUAUCU |GCUGGGGG- |UG-UUGGCC GCCU|GA|GG 3241 GGAA-GGUGG CUC(UAGUAC (GAGA)GGAA C)GAGCCGC| CGGCGCCUCU GGUCUACCGG 3301 U|UGUCC-(G ACA)-GGGCA -uuGCCGGGC A-GCUACGCG CUA-AGGGAU AAGGGCU(GA 3361 AGGCAUCUA) AGCCCGAAAC CCUCCCC|GA A-AAUAGGCG GCC|ag|ucc c--------- 3421 -------(uu cg-)------ ---------- ggga--cgag g|gc|UCUCC (UAUAAGA)G 3481 GAGGUUGAUA G|GCCGG|GG GUGUAAGCGC Cgagggc-(u uu---)-gcc cgaGGCGU|U 3541 CAGCCC|G|C CGGUACUAau cgccca--ag |ggcccggca gggu--~||| || // LOCUS M.vannieli 3592 bp RNA RNA 03-JAN-1985 DEFINITION Methanococcus vannielli. ACCESSION No information KEYWORDS No information. SOURCE Methanococcus vannielli. ORGANISM Methanococcus vannielli. REFERENCE 1 AUTHORS No information JOURNAL M.vannieli:who:published STANDARD No information COMMENTS Organism information Culture collection: ? Sequence information (bases 1 to 3592) Corresponding GenBank entry: X02729 Phylo:Archaebacteria,Methanogens and relatives,Methanococcales BASE COUNT 744 a 682 c 910 g 622 t 634 others ORIGIN 1 |--------- ------uauc u--AUUACCC UACCUGGGGA AUG|GCUUGG |CUUGAAa|c 61 g|cCGAUGAA GGA|CGUG(G UAAGC)U|GC GAUAAGCCUA GGC|GAGGC( GCAUACA)GC 121 CUUU-GAACC UAGGAUU|UC -CGAAUGGG( ACUU-)CCU| ac-------- ----(uuuu- 181 )--------- ----gu-aau cc-----(-- -gua--a--- )------gga uuggUAAC|G 241 CGGGGGAUU| GA(AGCA)UC UUAGUACCCG CAGGAAAAGA AAUC(AA--C U)GAGA-U|U 301 CCGUUAGUAG A(GGCGA)UU GAACACGGAU CAGGGCAaac ugaa------ -uccc(-uuc 361 g)ggga---- ---gaugugg uguuau-agg g-ccuucuuu -----(---- -)------uc 421 gccug--uug agaaaagcu- GAAguugac- (uggaac)-g ucacac|u(a ua)gaGGG(U 481 GAAAGU)CCC GUA-agcgca aucgauucag guu-----ug aagugucccu GAGUACCGUG 541 CGu(---ugg aua)uCGCGC GGGAAU-UUG GGAG(Gcauc aA-)CUUCCA ACUCUAAAUA 601 cg-uUUCAAG ACCGAUAGCG UAC-UAGUAC C(GCGA)GGG AAAGCUGAAA AGCACCCU(u 661 -aac)AGGGU GUG-AAAAGA GCCUGAAACC AGGUAGGuau g-gaauggcg uggccc--ca 721 aagg-----c aacuguucug aaggaaaccg uc(gcaa-)g gcgg---cug uacgaa-gaa 781 caga--gcca --ggguugcg ucCUCCGU(U UCGAAA|A)A CGG|GCCGGG GAGUGUAU-U 841 GUUGUGGCGA GCUUAAgauc (--uucac)g aucGAAGGCG UAGG(GAAA) CCAacaaguc 901 cgcaga---- -(auc-)--- --uuuaggga cggGGUCU(u aa--)GGGCC CGGAGUCACA 961 GCAAUACGAC CCGAAACCGG GCGAUCUAGG CCGGGG|CAA GGUGAAGUCC CUC(AACU)G 1021 AGGGAUGGAG GCCUG-ca-g ag-u-ug-uu (gccguucga )agcacu|cu ucuGACCUCG 1081 GUCUAGGGGU (GAAAG)GCC AAUCGAGCCC GG-AGAUAGC UGGUUCCCCU CGAAGUGACU 1141 (CUCA)GGUC AGC|CAGAGU Uca-|ggu|- aguCGGCA|G GGUAG|AGC- ACUGAUAAGA 1201 ugguuaG|GG GAA(gaaa-) UUCCU-Cg-C UGUUUUGUCA AACUCCGAAC C|UGUCGUCG 1261 CCgu---a|g GCUCUG-AGU GAGGGCAU-A -CGG(G-UAA G)CUGU-AUG UCCGAGACGG 1321 (GAAUAG)CC GAGACUUG-G GUUAAGGC|C CCUAAAUGCC GAUUA-AGUG ugaa---cac 1381 gaaGGGCGUC CUUGGUCUAA GACAGCAGGG AGGUUGGC(U UAGAAGCA)G CCACCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCUG UCGAGAUCAA GGGC|CCCGA AAAUGGA-CG 1501 GGGCU-AAAU CGGC|U|GCC GAGACC|CAA G|ggcacc-- --(gcaa--) ----ggugau 1561 ccCGUAGGGG GG|CGUUCUG CG-AGGGCAG AAGUUCGG-C U(GUGA)AGU CGAGUGG|AC 1621 CUCGUAGAAA UGAAGAUCCC GGU|AGUAGU AACagcauaa GUGGGG(UGA GAA-U)CCC- 1681 CAC|CGCCGA AG-GGGCAAG GGUUCCACAG CAAUG-(UUU GU)CA|GCUG UGGGUAAGC| 1741 CGG|-uCCUA ACUCUCG-AG G(UAACU)CC U--uuGAGAG GAAA-GGGAA ACAGG(UUAA 1801 UAUU)CCUGU |GC-caucua ga-uacg--- cgug(-gcaa )cacaa-ggu uagu-uucca 1861 acgcuucugg guaggcug|a guguu(---- -cu-ugucug ------)gac auu|-caagc 1921 uua-uaaguc CGGGGAGAG- UU(GUAAU)A A|CGAGAACC G--gauga-a agagugaug- 1981 |agc|ucucc (--guua)gg agaguu-cgg ccgaucucug -|gagcccgu gaaaagggaa 2041 cuagcaa--- ggauucuaga u--g|U|CCG UACCCAGAAC C(GACACU)G G|UGCCCCUA 2101 GG(ugaguau )CCUAA-GGC GU||AGCGGA -UG-AAUCUA GUCGAGGGAA GUCGGCAAAU 2161 UGGCUCCGUA AC(UUCG)GG AGAAGGAGU| GCCAGU---g aucuugu--- -(-uuaaau) 2221 ---augggau c--GCUGGU| C(GCA)GUGA CCAGGGAGGU C|CGACUGU( UUAAUACAA) 2281 ACAUAGGUCU UAG|CGAGCC U(GAA-A)AG GUGUGUACUA AGGCCGACGC CUGCCCAGU| 2341 GCUGGUACGU GAA----ccc cggu---(uc ca)----acc ggg----cGA AGCG|CCAGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUUC CUU|GUCGGG 2461 (CA-----AG U)UCCGAC|C UGCAUGAAUG GCGUAACGAG ACCUCCACUG UCCCCGACUA 2521 GA-AUCCG|G UGAACCU|-A CCAU-UCCGG (-CGCAAA-G G)CCGGAGAC UUCCAGUGGG 2581 AAGCGAAGAC CCCGUGG|AG CUUUACUGC| AGCCUGUCGU U|GGGGCAUG GUUGUGAGUG 2641 UACAGUGUAG GUGGGAGCCA UCG---AAAC CUUUUC(GCC A-)GGAAAGG U-GGAGGCGA 2701 UCCUGGGACA CCACC|CUCU CAUGACCAUG UUCCucac-- ----ccu(-- uuu-)agg-- 2761 ---ggacACC GGUAGGUGGG CAGUUUGGCU G(GGG)CGGU AC|CCUCCU( AAAAAUGCAU 2821 C)AGGAGG|G |C|CCAAAGG UUGGC|UCAA GCGGG(UCAG GAC)UCCGCU GUUGA|GU(G 2881 |U-AAGG)GC AAAAGCCAGC CUGACUUUGU UGC(caacaa aac-)GCAAC GAAGAGGC(G 2941 AAA)GCCGGG |CCUAACGAA CCCCUGUGC- CUC(--acug aug)GGG|GC CAGGG-auga 3001 c|aaa--aAA GCUACCC|CG GG|GAUAACA G|AGUUGUCG CGGGCAAGA( GCCCAUA)UC 3061 GACCCCGCGG CUUGCUACCU CGAUGUC|GG UU|U|UUCCC |AUCC|UGGG UCU(GCAGCA 3121 )GGACCCAAG GGU|GGGGC( UGUUC)GCCC AUUAAAGGGG AUCAUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUUG GUUGCUAUCU |GCUGGAUG- |UG-UAGGCU GUCU|GA|GG 3241 GAAA-GGUGG CUC(UAGUAC (GAGA)GGAA C)GGGCCGU| CGGCGCCUCU AGUCGAUCGG 3301 U|UGUCu-(G ACA)-AGGCA -cuGCCGAGC A-GCCACGCG CCA-AGAGAU AAGAGCU(GA 3361 AAGCAUCUA) AGCUCGAAAU UCAUCCU|GA A-AAUAGACA GCC|gu|uuc c--------- 3421 -------(uu cg-)------ ---------- ggaa--cgag a|ac|UCCCG (UAGAAGA)C 3481 GGGUUUGAUA G|GCUAG|GG GUGUACGCAU Caagg---(u ucuu-)---c cgaGAUGU|U 3541 CAGCCC|G|C UAGUACUAac aguucgag-- |agaua---a -------||| || // LOCUS M.thermo.1 3592 bp RNA RNA 17-JAN-1987 DEFINITION Methanobacterium thermoautotrophicum. ACCESSION No information KEYWORDS No information. SOURCE Methanobacterium thermoautotrophicum. ORGANISM Methanobacterium thermoautotrophicum. REFERENCE 1 AUTHORS No information JOURNAL Fri Apr 17 09:53:22 1987 checked STANDARD No information COMMENTS Organism information Culture collection: ? Sequence information (bases 1 to 3592) Corresponding GenBank entry: X05482 Phylo:Archaebacteria,Methanogens and relatives,Methanobacteriales BASE COUNT 616 a 662 c 1051 g 690 t 573 others ORIGIN 1 |--------- ---------c uuuuuUAUGC CGUCUGGGGG AUG|GCUUGG |CUUGAGu|c 61 g|cUGAUGAA GGC|CGUG(G CAAGC)U|GC GAUAAGCCCA GGG|gAGGA( GCAUGCA)UC 121 CUUg-GAUCC UGGGAUU|GC -CGAAUGGG( ACUU-)CCC| agccaaccc- ----(uucg- 181 )-----gggu ugugcu-acu cccu---(-- -guu-au--- )----gggga ggggGAAC|C 241 CGCCGAACU| GA(AACA)UC UUAGUAGGCG GAGGAAGAGA AAGC(AA-au U)GCGAcU|G 301 CCGUGAGUAA U(GGCGA)AU GAAAGCGGUG CAGGACAaac ugaa-ccccu ucgca(-gug 361 a)uguguugg gg-gaugugg uguugu--cg aucggugcgu -----(augg g)-----ggu 421 gccgggugug uggu--guu- GAAcuugggc (uggaau)gc ccgggc|c(g ua)gaGGG(U 481 UAAAGC)CCC GUA-gau--- gcccaugcuu ggcucccugc acc-uuuccu GAGUAGCGUC 541 CAu(---ugg aua)uUGGGC GUGAAG-CUG GGAG(Gcauc gA-)CUCCUA AUCCUAAAca 601 cg-uCUCAAG UCCGAUAGCG AAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGUACCCC(u 661 gaua)GGGGU GUG-AAAAGU GCCUGAAACC AGGCGGUGac a-gcccggca cggcau--gg 721 aaggaaugug gcugccccug uaagaaacca ug(guaa-)c augg---gag uauguguggg 781 ugguugaaca --gugucgug ucGUCCGU(C UUGAAA|C)A CGG|GCCAGG GAGUUUAGUG 841 GUUGUGGCGA GGCUAAGAag (ugugucg)c uuuguAGUCG UAGG(GAAA) CCGacagguc 901 cgcagcagcc -(uuug)-ug cugugaggga cggGGUCU(u aaua)GGGCC UGGAGUCACA 961 GCUCUAAAAC CCGAAGCCGG UCGAUCUAGC CCUGGG|UAG GGUGAAGUCG CUC(UUAC)G 1021 AGUGAUGGAG GCCCG-ca-g gg-g-ug-uu (gucgugcga )aacauu|cc ucuAACCUGG 1081 GGUUAGUGGU (GAAAG)GCC AAUCAAGGCC GG-UGACAGC UGGUUCCACC CGAAAUGGCU 1141 (CGUA)GGCC AGC|CUGACU gga-|gau|- aggUGGCG|G GGUAG|AGC- ACUUAUUGGG 1201 UGuuuag|gg gga(gaga-) UCCCU-CG-G CAUCCUGUAA AACUCCGAAC U|cgucaccg 1261 ucgu-uga|a gguugg-AGU CAGGGGCG-C -GGG(G-UAA G)CCUG-UGU CCCGAGAGAG 1321 (GAACAA)CU CAGACUGG-G GUUAAGGU|C CCUAAAUGCC GGCUA-AGUC u--------- 1381 -aaGGGGGUC UUUGGCCCUA GACAAUGGGA AGGUGGGC(U UAGAAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU UC(GUAACA) GA|UCACCCA UCGAGGUCAA AGGC|ACCGA AAAUGGA-GG 1501 GGAAUUAAGC CGGC|U|ACC GAUACC|UCA G|agcaccac --(uggu--) --gugguggu 1561 cuUGUAGGGU GG|CGUCCGG UU-GGGGUUG AAGUGGGG-G C(GUGA)GCU CCUGUGG|AC 1621 CCGGCUGGAA UGAGGAUCCU GGU|AGUAGU AGCagcg-aa GUGAGG(UGU GAA-U)CCU- 1681 UAC|CGCCGG AG-GGGCUAG GGUUCCUUGG CAAUG-(UUC GU)CA|GCCA AGGGUUAGU| 1741 CGG|-uccUA Aggccgu-GG G(UAAUG)UC Cauuuugguc GAAA-GGGUA ACGGG(UUAA 1801 UAUU)CCUGU |AC-ggucca gg-uacu--- ugcg(-guga )cgcuggguu gggc-uucug 1861 acgcuuuggg guaggcug|a gcggg(---- -au-uuucgu ------)ccu guu|-uaagg 1921 guu-gaagcc UGGGGAGAG- CC(GUAAU)G G|CGAGAACC A-ugguga-a ggccugaau- 1981 |agc|caucc (cuugu-)gg gugguu-ugg cugugcccug -|gaguccuu gaaaagggag 2041 uccuucuug- ggauccugga u--c|G|CCG UACCGAGAUC C(GACACU)G G|UGCCCCUA 2101 GC(ugaguag )GCUAA-GGU GU||GUUGGG -GU-AACCUG GCUAAGGGAA AUCGGCAAAU 2161 UGGCCCCGUA AC(UUUG)GG AGAAGGGGU| GCCAGC---c au-------- -(---gcgg) 2221 --------au g--GCUGGU| C(GCA)GUGA CAGGGGGGGC C|CGACUGU( UUAAUAAAA) 2281 ACAUAGCUCC UAG|CUAGCC C(GUG-A)GG GUGUGUACUG GGGGCGACAC CUGCCCAGU| 2341 GCCGGCACGU GAA----gcc cugg---(uu ca)----acg ggg----uGA AGCG|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUGC CUU|GCCGGA 2461 (UA-----AG U)ACCGGC|C UGCAUGAAUG GUUGAACGAG GUCCCUACUG UCCCUAGCCA 2521 GG-ACCUA|G UGAAGCU|-G CUGU-UCUGG (-UGCACA-A G)CCAGAGAC UCCCAGUGGG 2581 AAGCGAAGAC CCCGUAG|AG CUUUACUGC| AGUCUGCUGU U|GGGGCUUG GUCAUGGGUA 2641 UGCAGUGUAG GUGGGAGGCG UCG--augCC AUGGUC(GCC A-)GGCUGUG GUGGAGUCGG 2701 UCAUGAGACA CCACC|UUCC UGUGACUGUG UCUCuaac-- --cccau(-g uuu-)guggg 2761 ---ggacAUC GGUAGAUGGG CAGUUUGGCU G(GGG)CGGC AC|GCGCUU( GAAAUGGUAU 2821 C)AAGCGC|G |C|CCUAAGG UCGGC|UCAG GCGGG(ACAG AGA)UCCGCU GUAGA|GU(G 2881 |U-AAGG)GC AUAAGCCGGC UUGACUGUGC UCC(uacuag uag-)GGGGU GCAGGUGC(G 2941 AGA)GCAGGG |CCUAGCGAA CCCCAGAGU- CCU(-cgucg gug)GGG|GC CUGGG-auga 3001 c|aga--aAA GCUACCU|CG GG|GAUAACU G|GGUGGUCG CAGGCAAGA( GCCCAUA)UC 3061 GACCCUGCGG CUUGCUACUU CGAUGUC|GG UU|C|UUUCC |AUCC|UGGG UGU(GCAGCA 3121 )GCACCCAAG GGU|GGGGU( UGUUC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUUG GUUGCUAUCU |ACUGGGAG- |UGUGuGGUU GCCU|GA|GG 3241 GGAA-GGUGG UUC(CAGUAC (GAGA)GGAA C)GGACCGU| CGGCGCCUCU GGUUUACCGG 3301 U|UAUCC-(G AGU)-GGGUA -uuGCCGGGC G-GCUACGCG CUAUGAUUAU AAAGGCU(GA 3361 AGGCAUCUA) AGCCUGAggu uuuCCCU|GA A-AAUAGGUG GCU|u-|--- ---------- 3421 -------(-- ---)------ ---------- ------gugg a|cu|GCGGG (UAGAAGA)C 3481 CUGUUUGUUG G|GGCGG|GG GUGUGAGCUU Cgaggccu(g uuuu-)gggc cgaGUUGU|U 3541 UAGCCU|G|C CGUUUCCAag guuuuu---- |--------u gucccu-||| || // LOCUS M.thermo.2 3592 bp RNA RNA 13-NOV-1997 DEFINITION Methanobacterium thermoautotrophicum.; . ACCESSION AE0009 0 AE00 666 KEYWORDS . SOURCE Methanobacterium thermoautotrophicum. ORGANISM Methanobacterium thermoautotrophicum. REFERENCE 1 (bases 1 to 15553) AUTHORS Prabhakar,S., McDougall,S., Shimer,G., Goyal,A., Pietrovski,S., Church,G.M., Daniels,C.J., Mao,J.-i., Rice,P., Nolling,J. and Reeve,J.N. Smith,D.R., Doucette-Stamm,L.A., Deloughery,C., Lee,H.-M., Dubois,J., Aldredge,T., Bashirzadeh,R., Blakely,D., Cook,R., Gilbert,K., Harrison,D., Hoang,L., Keagle,P., Lumm,W., Pothier ,B., Qiu,D., Spadafora,R., Vicare,R., Wang,Y., Wierzbowski,J., Gibson,R., Jiwani,N., Caruso,A., Bush,D., Safer,H., Patwell,D., TITLE comparative genomics Complete genome sequence of Methanobacterium thermoautotrophicum delta H: functional analysis and JOURNAL J. Bacteriol. 179, 7135-7155 (1997) STANDARD No information REFERENCE 2 (bases 1 to 15553) AUTHORS Smith,D.R. TITLE Direct Submission JOURNAL Submitted (10-AUG-1997) Genomics and Technology Development, Genome Therapuetics Corporation, 100 Beaver Street, Waltham, MA 02154-8448, USA STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AE000930 BASE COUNT 625 a 672 c 1048 g 684 t 563 others ORIGIN 1 |--------- ---------c uuauuUAUGC CGUCUGGGGG AUG|GCUUGG |CUUGAGu|c 61 g|cUGAUGAA GGC|CGUG(G CAAGC)U|GC GAUAAGCCCA GGG|gAGGA( GCAUGCA)UC 121 CUUg-GAUCC UGGGAUU|GC -CGAAUGGG( ACUU-)CCU| ggccgccccu gcac(ucu-- 181 )gugcggggg cugguc-acu cccc---(-- -uucucu--- )----gggga ggggGAAC|C 241 CGCUGAACU| GA(AACA)UC UUAGUAGGCG GAGGAAGAGA AAGC(AA-ua U)GCGA-U|G 301 CCGUGAGUAA U(GGCGA)AU GAAAGCGGUG GAGGACAaac ugaa--ucca ucaug(-gug 361 a)cguggugg a--gaugugg uguug--cgg a-cccccuua -----(---g g)-----ggu 421 uccaggugug guugg-guu- GAAcuugggc (uggaau)gc ccgggc|c(g ua)gaGGG(U 481 UAAAGC)CCC GUA-gac-uu gcug-ugccu ggcccugcag ggguguuccu GAGUAGCGUC 541 CAu(---ugg aua)uUGGGC GUGAAG-CUG GGAG(Gcauc gA-)CUCCUA AUCCUAAAca 601 cg-uCUCAAG UCCGAUAGCG AAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGUACCCC(u 661 gaua)GGGGU GUG-AAAAGU GCCUGAAACC AGGCGGUGac a-gcccggca cggcau--gg 721 aaggaaugag gcuugcccug uaagaaacca ug(gcaa-)c augg---gag uaugug-ggc 781 uggcugauca --gugucgug ucAUCCGU(C UUGAAA|C)A CGG|GCCAGG GAGUUUAGUG 841 GUUGUGGCGA GACUAAGAag (ugugucg)c uuugaAGUCG UAGG(GAAA) CCGacagguc 901 cgcagcagca -(ucug)-ug cugugaggga cggGGUCU(u aaua)GGGCC UGGAGUCACA 961 GCCCUAAAAC CCGAAGCCGG UCGAUCUAGC CCUGGG|UAG GAUGAAGUCG CUC(UUAC)G 1021 AGUGAUGGAG GUCCG-ca-g gg-g-ug-uu (gucgugcga )aacauu|cc ucuAACCUGG 1081 GGUUAGUGGU (GAAAG)GCC AAUCAAGGCC GG-UGACAGC UGGUUCCACC CGAAAUGGCU 1141 (CGUA)GGCC AGC|CUGACU gga-|ggu|- uggUGGCG|G GGUAG|AGC- ACUUAUUGGG 1201 UGuuuag|gg gga(gaaa-) UCCCU-CG-G CAUCCUGUAA AACUCCGAAC U|cgucaccg 1261 cugu-uga|a gguugg-AGU CAGGGGCG-C -GGG(G-UAA G)CCUG-UGU CCCGAGAGAG 1321 (GAACAA)CU CAGACUGG-G GUUAAGGU|C CCUAAAUGCC GGCUA-AGUC u--------- 1381 -aaGGGGGUC UUUGGCCCUA GACAAUGGGA AGGUGGGC(U UAGAAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU UC(GUAACA) GA|UCACCCA UCGAGGUCAA AGGC|ACCGA AAAUGGA-GG 1501 GGAAUUAAGC CGGC|U|ACC GAUACC|UCA G|agcaccac --(uuuugu) --gugguggu 1561 cuUGUAGGGU GG|CGucccG UU-GGGGUUG AAGUGGGG-G C(GUGA)GCU CCUGUGG|AC 1621 CCUGCGGGAA UGAGGAUCCU GGU|AGUAGU AGCagca-aa GUGAGG(UGA GAA-U)CCU- 1681 UAC|CGCCGG AG-GGGCUAG GGUUCCUUGG CAAUG-(UUC GU)CA|GCCA AGGGUUAGU| 1741 CGG|-uccUA Aggccau-GG G(UAAUG)UC Cau-gugguc GAAA-GGGUA ACAGG(UUAA 1801 UAUU)CCUGU |AC-ggucca gg-uacu--- ugcg(-guga )cgcuggguu gggc-uucug 1861 acgcuucggg guaggcug|a gcggg(---- -au-uuucgu ------)ccu guu|-uaagg 1921 guu-gaagcc UGGGGAGAG- CC(GUAAU)G G|CGAGAACU Auugguga-a ggccugaau- 1981 |agc|cacuc (cuuguu)gg gugguu-cgg cugugcccug -|gaguccuu gaaaagggag 2041 uccuucuug- ggauccugga u--c|G|CCG UACCGAGAUC C(GACACU)G G|UGCCCCUA 2101 GC(ugaguag )GCUAA-GGC GU||GUUGGG -GU-AACCUG GCUAAGGGAA AUCGGCAAAU 2161 UAGCCCCGUA AC(UUUG)GG AGAAGGGGU| GCCAGC---c au-------- -(---guag) 2221 --------au g--GCUGGU| C(GCA)GUGA CAGGGGGGGC C|CGACUGU( UUAAUAAAA) 2281 ACAUAGCUCC UAG|CUAGCC C(GUG-A)GG GUGUGUACUG GGGGCGACAC CUGCCCAGU| 2341 GCCGGCACGU GAA----gcc cugg---(uu ca)----acg ggg----uGA AGCG|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUGC CUU|GCCGGA 2461 (UA-----AG U)ACCGGC|C UGCAUGAAUG GUUGAACGAG GUCCCUACUG UCCCUAGCCA 2521 GG-ACCUG|G UGAAGCU|-G CUGU-UCUGG (-UGCACA-A G)CCAGAGAC UCCCAGUGGG 2581 AAGCGAAGAC CCCGUAG|AG CUUUACUGC| AGUCUGCUGU U|GGGGCUUG GUCAUGGGUA 2641 UGCAGUGUAG GUGGGAGGUG UCG--augCC AUGGUC(GCC A-)GGCCGUG GUGGAGCCGG 2701 UCAUGAGACA CCACC|UUCC UGUGACUGUG UCUCuaac-- --ccccc(au ugu-)ggggg 2761 ---ggacAUC GGUAGAUGGG CAGUUUGGCU G(GGG)CGGC AC|GCGCUU( GAAAUGGUAU 2821 C)AAGCGC|G |C|CCUAAGG UCGGC|UCAG GCGGG(ACAG AGA)UCCGCU GUAGA|GU(G 2881 |U-AAGG)GC AUAAGCCGGC UUGACUGUGC UCC(uacuag uag-)GGUGU GCAGGUGC(G 2941 AGA)GCAGGG |CCUAGCGAA CCCCAGAGU- CCU(-cgucg gug)GGG|GC CUGGG-auga 3001 c|aga--aAA GCUACCU|CG GG|GAUAACU G|GGUGGUCG CAGGCAAGA( GCCCAUA)UC 3061 GACCCUGCGG CUUGCUACUU CGAUGUC|GG UU|C|UUUCC |AUCC|UGGG UGU(GCAGCA 3121 )GCACCCAAG GGU|GGGGU( UGUUC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUUG GUUGCUAUCU |ACUGGGAG- |UGUGuGGUU GCCU|GA|GG 3241 GGAA-GGUGG UUC(CAGUAC (GAGA)GGAA C)GGACCGU| CGGCGCCUCU GGUUUACCGG 3301 U|UAUCC-(G AGU)-GGGUA -uuGCCGGGC G-GCUACGCG CUAUGAUUAU AAAGGCU(GA 3361 AGGCAUCUA) AGCCUGAggu uuuCCCU|GA A-AAUAGGCG GCU|u-|--- ---------- 3421 -------(-- ---)------ ---------- ------gugg a|cc|GCGGG (UAGAAGA)C 3481 CUGUUUGUUG G|GGCGG|GG GUGUGAGCUU Cgaggccu(g uuuuu)gggu cgaGUUGU|U 3541 UAGCCU|G|C CGUUUCCAag guuuuu---- |--------u gucccu-||| || // LOCUS M.thermo.3 3592 bp RNA RNA 13-NOV-1997 DEFINITION Methanobacterium thermoautotrophicum.; . ACCESSION AE0009 0 AE00 666 KEYWORDS . SOURCE Methanobacterium thermoautotrophicum. ORGANISM Methanobacterium thermoautotrophicum. REFERENCE 1 (bases 1 to 15288) AUTHORS Gibson,R., Jiwani,N., Caruso,A., Bush,D., Safer,H., Patwell,D., Prabhakar,S., McDougall,S., Shimer,G., Goyal,A., Pietrovski,S., Church,G.M., Daniels,C.J., Mao,J.-i., Rice,P., Nolling,J. and Reeve,J.N. Smith,D.R., Doucette-Stamm,L.A., Deloughery,C., Lee,H.-M., Dubois,J., Aldredge,T., Bashirzadeh,R., Blakely,D., Cook,R., Gilbert,K., Harrison,D., Hoang,L., Keagle,P., Lumm,W., Pothier ,B., Qiu,D., Spadafora,R., Vicare,R., Wang,Y., Wierzbowski,J., TITLE thermoautotrophicum delta H: functional analysis and comparative genomics Complete genome sequence of Methanobacterium JOURNAL J. Bacteriol. 179, 7135-7155 (1997) STANDARD No information REFERENCE 2 (bases 1 to 15288) AUTHORS Smith,D.R. TITLE Direct Submission JOURNAL Submitted (10-AUG-1997) Genomics and Technology Development, Genome Therapuetics Corporation, 100 Beaver Street, Waltham, MA 02154-8448, USA STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AE000940 BASE COUNT 625 a 669 c 1051 g 690 t 557 others ORIGIN 1 |--------- ---------c uuauuUAUGC CGUCUGGGGG AUG|GCUUGG |CUUGAGu|c 61 g|cUGAUGAA GGC|CGUG(G CAAGC)U|GC GAUAAGCCCA GGG|gAGGA( GCAUGCA)UC 121 CUUg-GAUCC UGGGAUU|GC -CGAAUGGG( ACUU-)CCU| ggccgucccu gcac(ucu-- 181 )gugcggggg cugguc-acu ccccu--(-- -uuucuu--- )---ggggga ggggGAAC|C 241 CGCUGAACU| GA(AACA)UC UUAGUAGGCG GAGGAAGAGA AAGC(AA-ua U)GCGA-U|G 301 CCGUGAGUAA U(GGCGA)AU GAAAGCGGUG GAGGACAaac ugaa--ucca ucaug(-gug 361 a)cguggugg a--gaugugg uguug--cgg a-cccccugc -----(uuag g)-----ggu 421 uccaggugug guugg-guu- GAAcuugggc (uggaau)gc ccgggc|c(g ua)gaGGG(U 481 UAAAGC)CCC GUA-gac-uu gcug-ugccu ggcccugcgg ggguguuccu GAGUAGCGUC 541 CAu(---ugg aua)uUGGGC GUGAAG-CUG GGAG(Gcauc gA-)CUCCUA AUCCUAAAca 601 cg-uCUCAAG UCCGAUAGCG AAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGUACCCC(u 661 gaua)GGGGU GUG-AAAAGU GCCUGAAACC AGGCGGUGac a-gcccggca cggcau--gg 721 aaggaaugag gcuugcccug uaagaaacca ug(gcaa-)c augg---gag uaugug-ggc 781 uggcugauca --gugucgug ucAUCCGU(C UUGAAA|C)A CGG|GCCAGG GAGUUUAGUG 841 GUUGUGGCGA GACUAAGAag (ugugucg)c uuugaAGUCG UAGG(GAAA) CCGacagguc 901 cgcagcagca -(ucug)-ug cugugaggga cggGGUCU(u aaua)GGGCC UGGAGUCACA 961 GCCCUAAAAC CCGAAGCCGG UCGAUCUAGC CCUGGG|UAG GAUGAAGUCG CUC(UUAC)G 1021 AGUGAUGGAG GUCCG-ca-g gg-g-ug-uu (gucgugcga )aacauu|cc ucuAACCUGG 1081 GGUUAGUGGU (GAAAG)GCC AAUCAAGGCC GG-UGACAGC UGGUUCCACC CGAAAUGGCU 1141 (CGUA)GGCC AGC|CUGACU gga-|ggu|- uggUGGCG|G GGUAG|AGC- ACUUAUUGGG 1201 UGuuuag|gg gga(gaaa-) UCCCU-CG-G CAUCCUGUAA AACUCCGAAC U|cgucaccg 1261 cugu-uga|a gguugg-AGU CAGGGGCG-C -GGG(G-UAA G)CCUG-UGU CCCGAGAGAG 1321 (GAACAA)CU CAGACUGG-G GUUAAGGU|C CCUAAAUGCC GGCUA-AGUC u--------- 1381 -aaGGGGGUC UUUGGCCCUA GACAAUGGGA AGGUGGGC(U UAGAAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU UC(GUAACA) GA|UCACCCA UCGAGGUCAA AGGC|ACCGA AAAUGGA-GG 1501 GGAAUUAAGC CGGC|U|ACC GAUACC|UCA G|agcaccac --(uuuugu) --gugguggu 1561 cuUGUAGGGU GG|CGucccG UU-GGGGUUG AAGUGGGG-G C(GUGA)GCU CCUGUGG|AC 1621 CCUGCGGGAA UGAGGAUCCU GGU|AGUAGU AGCagca-aa GUGAGG(UGA GAA-U)CCU- 1681 UAC|CGCCGG AG-GGGCUAG GGUUCCUUGG CAAUG-(UUC GU)CA|GCCA AGGGUUAAU| 1741 CGG|-uccUA Aggccau-GG G(UAAUG)UC Cau-gugguc GAAA-GGGUA ACAGG(UUAA 1801 UAUU)CCUGU |AC-ggucca gg-uacu--- ugcg(-guga )cgcuggguu gggc-uucug 1861 acgcuucggg guaggcug|a gcggg(---- -au-uuucgu ------)ccu guu|-uaagg 1921 guu-gaagcc UGGGGAGAG- CC(GUAAU)G G|CGAGAACU Auugguga-a ggccugaau- 1981 |agc|cacuc (cuuguu)gg gugguu-cgg cugugcccug -|gaguccuu gaaaagggag 2041 uccuucuug- ggauccugga u--c|G|CCG UACCGAGAUC C(GACACU)G G|UGCCCCUA 2101 GC(ugaguag )GCUAA-GGC GU||GUUGGG -GU-AACCUG GCUAAGGGAA AUCGGCAAAU 2161 UAGCCCCGUA AC(UUUG)GG AGAAGGGGU| GCCAGC---c au-------- -(---guag) 2221 --------au g--GCUGGU| C(GCA)GUGA CAGGGGGGGC C|CGACUGU( UUAAUAAAA) 2281 ACAUAGCUCC UAG|CUAGCC C(GUG-A)GG GUGUGUACUG GGGGCGACAC CUGCCCAGU| 2341 GCCGGCACGU GAA----gcc cugg---(uu ca)----acg ggg----uGA AGCG|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUGC CUU|GCCGGA 2461 (UA-----AG U)ACCGGC|C UGCAUGAAUG GUUGAACGAG GUCCCUACUG UCCCUAGCCA 2521 GG-ACCUG|G UGAAGCU|-G CUGU-UCUGG (-UGCACA-A G)CCAGAGAC UCCCAGUGGG 2581 AAGCGAAGAC CCCGUAG|AG CUUUACUGC| AGUCUGCUGU U|GGGGCUUG GUCAUGGGUA 2641 UGCAGUGUAG GUGGGAGGUG UCG--augCC AUGGUC(GCC A-)GGCCGUG GUGGAGCCGG 2701 UCAUGAGACA CCACC|UUCC UGUGACUGUG UCUCuaac-- -cccucg(-u uuu-)ugggg 2761 g--ggacAUC GGUAGAUGGG CAGUUUGGCU G(GGG)CGGC AC|GCGCUU( GAAAUGGUAU 2821 C)AAGCGC|G |C|CCUAAGG UCGGC|UCAG GCGGG(ACAG AGA)UCCGCU GUAGA|GU(G 2881 |U-AAGG)GC AUAAGCCGGC UUGACUGUGC UCC(uacuag uag-)GGGGU GCAGGUGC(G 2941 AGA)GCAGGG |CCUAGCGAA CCCCAGAGU- CCU(-cgucg gug)GGG|GC CUGGG-auga 3001 c|aga--aAA GCUACCU|CG GG|GAUAACU G|GGUGGUCG CAGGCAAGA( GCCCAUA)UC 3061 GACCCUGCGG CUUGCUACUU CGAUGUC|GG UU|C|UUUCC |AUCC|UGGG UGU(GCAGCA 3121 )GCACCAAAG GGU|GGGGU( UGUUC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUUG GUUGCUAUCU |ACUGGGAG- |UGUGuGGUU GCCU|GA|GG 3241 GGAA-GGUGG UUC(CAGUAC (GAGA)GGAA C)GGACCGU| CGGCGCCUCU GGUUUACCGG 3301 U|UAUCC-(G AGU)-GGGUA -uuGCCGGGC G-GCUACGCG CUAUGAUUAU AAAGGCU(GA 3361 AGGCAUCUA) AGCCUGAggu uuuCCCU|GA A-AAUAGGCG GCU|u-|--- ---------- 3421 -------(-- ---)------ ---------- ------gugg a|cc|GCGGG (UAGAAGA)C 3481 CUGUUUGUUG G|GGCGG|GG GUGUGAGCUU Cgaggccu(g uuuuu)gggu cgaGUUGU|U 3541 UAGCCU|G|C CGUUUCCAag guuuuu---- |--------u gucccu-||| || // LOCUS M.hungatei 3592 bp RNA RNA 11-JAN-191 DEFINITION Methanospirillum hungatei. ACCESSION No information KEYWORDS No information. SOURCE Methanospirillum hungatei. ORGANISM Methanospirillum hungatei. REFERENCE 1 (bases 1 to 15288) AUTHORS Burggraff,S., Ching,A., Stetter,K.O. and Woese,C.R. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M81323 M61738 LOCUS MEHRGSUB 2910 bp ss-rRNA RNA 10-JAN-1992 DEFINITION Methanospirillum hungatei 23S large subunit ribosomal RNA gene, complete cds. ACCESSION M81323 M61738 KEYWORDS 23S large subunit ribosomal RNA. SOURCE Methanospirillum hungatei (strain JF1) rRNA. ORGANISM Methanospirillum hungatei Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Methanobacteriales; Methanobacteriaceae. REFERENCE 1 (bases 1 to 2910) AUTHORS Burggraff,S., Ching,A., Stetter,K.O. and Woese,C.R. TITLE The sequence of Methanospirillum hungatei 23S rRNA confirms the specific relationship between the extreme halophiles and the methanomicrobiales JOURNAL Unpublished (1991) STANDARD simple staff_entry FEATURES Location/Qualifiers rRNA 1..2910 /note="typical archaeal type" /product="23S large subunit ribosomal RNA" /gene="23S rRNA" BASE COUNT 792 a 640 c 846 g 631 t 1 others ORIGIN BASE COUNT 792 a 640 c 846 g 631 t 683 others ORIGIN 1 |--------- ---------u u--cuUACGC CUGUCAGUGG AUG|GCUCGG |UUCGGG-|u 61 g|cCGACGAA GGG|CGUG(C CAAGC)U|GC GAUAAGCUCC GGG|UAGAC( GCAUGGA)GU 121 CAUU-GAACC GGAGAUC|CC -CGAAUAGG( acau-)CCA| gaugc----- ----(auaaa 181 )--------- gc-auc-auu ccg----(-- -cca--u--- )-----cgga aaggGAAC|G 241 CCCCGAAUU| GA(AACA)UC UUAGUAGGGG CAGGAGAAGA AACC(aa--c c)GGGA-U|G 301 UCGUUAGUAG A(GGCGA)UC GAACACGACA GAGUUCAaac cgaa------ --ucc(-uuc 361 g)gga----- ---gaugugg uguau---gg a-ccgcggca -----(---- -)-----uaa 421 gauuu--gca uucagaacu- GAAguugcc- (uggaac)-g guauac|c(a ga)gaGGG(U 481 GACAGU)CCC GUA-UGUgua ugcaugcaga cuuu-----a gcgguauccu GAGUAGCGUG 541 GGu(---cgg aau)uCCCGC GUGAAU-GUG GGGG(ucauc aa-)CCUCCA AAACUAAAUA 601 cu-cCCCGAA ACCGAUAGCG UAG-UAGUAG C(GUGA)GCG AAAGCUGAAA AGCAACCC(u 661 ggaa)AGGUG UUG-AAAAGU GCCUGAAACU GACAGGUUau c-gcgugaua cggcac--ga 721 aagg----au cuucaaagcg aaggaacgag uu(gcga-)g acuc---aag uacggg-uuu 781 uguug--ccg --gugucgua ucGUACGU(U UUGAAG|A)A CGG|GCCAGA GAGUUUAU-U 841 CUGUUGGCGA CGGUUAAUuu (--cuaug)a aagAAGCCA- GAGG(GAAA) CCAacaaguc 901 cgcagcc--- -(guaa)--- -ggucaggga cgaCGUAC(u acca)GUGCG UGGAGUCAGC 961 AGGAUAAGAC CCGAAGCCCG GCGAUCUAUG CGUGGG|CAG GUUGAAGCGU GCC(GAAA)G 1021 GUGCGUGGAG GACCGcaa-g cg-g-uu-uu (gauaugcaa )aucauu|cg cgugACCUGC 1081 GUAUAGGAGU (GAAAG)AUU AAUCGAGCCG GG-CAUCAGC UGGUUCCUCU CGAAACAUGC 1141 (CGUA)GCAU GAC|CUGAUC uga-|gau|- cgacagug|A GGUAG|AGC- ACUGAUUGGG 1201 gaagcug|gg gaa(gaaa-) uuccu-ca-c ucuccuguca aaCUCCAAAU U|CCCUGUCA 1261 UCag-cga|c gaucggaagu ccgcauuA-C -GGG(G-UAA G)CUUG-UAA UGCGUAAGGG 1321 (AGACAA)CC CAGACCGU-G GUUAAUGU|C CCUCAGUGCA GGCUC-AGUG uaaac--acu 1381 gaaagUAGUC CUGGGUCAAA GACAACUGGG AGGUGAGC(U UAGAAGCA)G CUACCCU|U( 1441 UAA)A|AAGU GC(GUAACA) GC|UUACCAG UCAAGAUUCA GGGC|GCUGA AAAUGGA-CG 1501 GGGCUUAAGC CUGC|C|ACA GAUACC|ACG G|accauacg --(aau---) --uguaugau 1561 ggCGUAGAGA GG|CGUCCUG CA-UGGGCGG AAGCAGGG-U U(GCAA)GAU CCUGUGG|AC 1621 CGUGCAGGAA UGAAAAUUCU GGC|AGUAGU AGAagcuuaa GAUUGG(UGA GAA-U)CCA- 1681 AUC|CGCCGC AG-GGGCUAG GUUUCCUCGA CAAUG-(UUC GU)CA|GUCG AGGGUUAGU| 1741 CGG|-UCCUA AGACGUA-CC G(UAAUU)CG A--GuACGCC GAAA-GGGAA ACAGG(UUAA 1801 UAUU)CCUGU |AC-cuguau ca-------- ---a(-acaa )u-------- ------ccug 1861 acgcuuucgg auaggcau|u gcgga(---- -gu-caucgc ------)ucc guc|-uaagc 1921 cggauauGAA AUUGGAGUA- CC(GUAAU)G G|UGAGAAUU U--uucaa-a acggugaug- 1981 |---|---gg (--guaa)ca -------aug cugauuccag -|gagcccgu gaaaaggg-- 2041 ---------- ----ugauac g--g|U|CCG UACCGAGAAC U(GACACA)G G|UGCCCCUC 2101 GC(ugagaag )GCGAA-GGC GU||GUCGGG -AGUAAAUGU GUUAAGGGAA CUCGGCAAAU 2161 UAGCCCUGUA AC(UUUG)GG AUAAGGGGU| GCCUGC---c ca-------- -(--gcgau) 2221 --------ug g--GCAGGU| C(GCA)GUGA CAAGAUCGCU C|CGACUGU( CUAAUAACA) 2281 ACAUAGCAGA CUG|CAACUC C(GAA-A)GG ACUCGUAUAG UCUGUGAUUC CUGCCCAGU| 2341 GCGAGUAUCU GAA----cac cggg---(uu ca)----acc gga----CGA AGGA|CUCGU 2401 AAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAGUAC CUU|GUCGCU 2461 (UA-----AU U)GGCGAC|U UGCAUGAAUG GAAUAACGAG AGCGAUACUG UCCCUAACAC 2521 AU-ACCCG|U UGAACCU|-U UUGU-ACUAG (-UGCAGA-G A)CUAGUGAC UCCUUAUGGG 2581 AAGUGAAGAC CCCGUGG|AG CUUUACUGC| AGCCUGUCGU U|GGGUUACG GUAUUACCUG 2641 CGCAGGGUAG AUGGGAGACG UCG---AUCC AACCCU(UGU G-)GGGGUUG G-UUAGUCAC 2701 CGAUGAGACA CCAUC|CUGG UUUUAUUGUA AUUCuaac-- ----uga(-u uua-)uca-- 2761 ---ggacAUC GAUAGGUAGG CAGUUUGGGU G(GGG)CGCN AC|ACCCUC( GAAAAAAUAU 2821 C)AAGGGU|G |C|CCUAAGG UCAAC|UCAA GCGGG(UCAG AAA)CCCGCU GAAGA|GU(G 2881 |UCAAGA)GC AAAAGUUGGC CUGACGCGAU UUC(gcauag caa-)GAAAU CGCGAGAG(G 2941 AAA)CUCGGG |UCUAACGAA CCAAUACGC- CCU(--guug aug)AGG|GC UAUUG-acua 3001 c|aga--aAA GCUACCC|CG GG|GAUAACA G|AGUCGUCG CCGGCAAGA( GCACAUA)UC 3061 GACCCGGCGG CUUGCUACCU CGAUGUC|GG UU|C|UUUCC |AUCC|UGGC UGU(GCAGCA 3121 )GCAGCCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGGG AUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GUUACUAUCU |AUAAGGAG- |UG-UUGGAA GUCU|GA|AG 3241 GCAA-GAAUG AAA(UAGUAC (GAGA)GGAA C)UUUCAUU| CGGCGCCACU GGUCUACCGG 3301 U|UGUCU-(G ACA)-AGGCA -acGCCGGGC A-GCUACGCG CCA-AGAGAU AAAAGCU(GA 3361 AAGCAUCUA) AGCUUGAAAC UCAGCCU|GA AcAAGagacu uca|u-|--- ---------- 3421 -------(-- ---)------ ---------- ------uaaa g|ac|ACGGG (UAAAAGA)C 3481 CCGGUUGAUA G|GUUCG|GG AUGUACGCAC Gaag----(g caa--)---- cgaCGUGU|U 3541 CAGUCC|G|C GAAUACUAau cgucuu---- |aa------u aacguc-||| || // LOCUS H.halobium 3592 bp RNA RNA 24-JAN-1986 DEFINITION Halobacterium halobium. ACCESSION No information KEYWORDS No information. SOURCE Halobacterium halobium. ORGANISM Halobacterium halobium. REFERENCE 1 (bases 1 to 15288) AUTHORS No information JOURNAL 152-161(1986) STANDARD No information COMMENTS Organism information Culture collection: ? Sequence information (bases 1 to 3592) Corresponding GenBank entry: X03407 Phylo:Archaebacteria,Methanogens and relatives,Halobacteria BASE COUNT 734 a 712 c 916 g 543 t 687 others ORIGIN 1 |--------- ------gugg c-uacUGUGC CACCUGGUGG AUA|GCUCGG |CUCGGA-|u 61 g|cCGACGAA GGA|CGUG(C CAAGC)U|GC GAUAAGCCUG AGG|gaGCC( GCACGGA)GG 121 CUaa-GAACU CAGGAUC|UC -CUAAUGGG( AAUC-)CCU| ---------- ----(auaac 181 )--------- -------aau ugc----(-- -cuu-gc--- )-----gcaa ugggGAAC|G 241 GCCGGAAUU| GA(AGCA)UC UCAGUACGGC CAGGAAGAGA AAUC(GA--A U)GAGA-C|G 301 CCGUUAGUAA U(GGCGA)AU GAACGCGGCA CAGUCCAaac cgaa------ --gcc(-uuc 361 g)ggc----- ---aaugugg uguuc---gg acugacuuu- -----(---- -)-----cau 421 cguuu--gac cguuc-gugu GAAgucucc- (ugaaac)-g gagcgc|g(a ua)caGGG(U 481 GACAGC)CCC GUAucac--g gaccaguacg acgu----gc gucagcucca GAGUAGCGGG 541 GGu(---ugg aaa)uCCCUC GUGAAUugUG GCAG(Gcauc gA-)CUGCCA AGACUAAGUA 601 cuc-UCCGAG ACCGAUAGUG AAC-AAGUAG U(GUGA)ACG AACGCUGAAA AGCACCCC(a 661 caaa)GGGGG GUGAAAUAGG GCUUGAAAUC AGGUGGCGau g-gagcgacg gggcau--aa 721 aagg------ ccucucuggg aacgacuuga gu(gcaa-)a cuca---ugg uaggac-cug 781 agaggagccg --auguuccg ucGUACGU(U UUGAAA|A)A CGA|GCUAGG GAGUGUGC-C 841 UGUUUGACGA GUCUAAccgg (--aguau)c cGGGAAGGCG UAGG(GAAA) CCAauauggc 901 cgcggc---- -(auu-)--- --gcgagggc cacCGUGU(u caa-)GCGCG GGGAGUCAAA 961 CGGGCACGAC CCGAAACCCG GUGAUCUACG CGUGGG|CAA GGUGAAGCAU GGC(GAAA)G 1021 CCAUGUGGAG GCCUGuua-g gg-uugg-ug (ucuuuc-aa )cacccu|cc cguGaCCUAC 1081 GUGUAGGGGU (GAAAG)GCC CAUCGAACCG GG-CAACAGC UGGUUCCAAC CGAAACAUGU 1141 (CGAA)GCAU GAC|CUCUGC cga-|ggu|- agUUCGUG|G GGUAG|AGCG ACCGAUUGGG 1201 GAguuCA|AC UCC(gaga-) GGAGU-UGuC UCCCCUGUCA AACUCCAAAC C|UACGGAcg 1261 ccgu-cga|c GCAGGG-AAU CCGGUGUG-C -GGG(G-UAA G)CCUG-UGC ACCGUGAGGG 1321 (AGACAA)CC CAGAGUUA-G GUUAAGGU|C CCAAAGUGCG AGCUA-AGUG cg-----auu 1381 GAAGGUGGUC UCGAGCCCUA GACAGCCGGG AGGUGAGC(U UAGAAGCA)G CUACCCU|C( 1441 UAA)G|AAAA GC(GUAACA) GC|UUACCGG CCGAGGUUCG AGGC|GCCCA AAAUGAU-CG 1501 GGGCUUAAGU UCGC|C|ACC GAGACC|UAA C|ggcacggg --(uaac--) --accgugau 1561 ccAGUAGGUU GG|CAUUCUG UU-CGGGUGG AAGCUCGG-G U(GAGA)ACU CGAGUGG|AC 1621 CGAGUGGAAA AGAAAAUCCU GGC|CAUAGU AGCagcguua GUCGGG(UAA GAA-U)CCC- 1681 GAU|GGCCGA AA-GAGCAAG GGUUCCUCGG CAAUG-(CUU AU)CA|GCCG AGGGUUAGC| 1741 CGA|-uCCUA AGGCCCG-UC G(UAAUU)CG A--gcggguc AAAA-GGGAA ACUGG(UUAA 1801 UAUU)CCAGU |GC-caccgu ac-------- --au(-ugaa )a-------- ------gucg 1861 acgccucgga gcagcuug|a gccgg(---- -gc-auucgc ------)ccg guc|-gaacc 1921 guc-gaaguu CGUGGAA-G- CC(GUAAU)G G|CAGGAAGC G--aacga-- -acgucggaa 1981 |ca-|---gg (--gaaa)cu -------caa guca-aucug -|gggcccgu gaaaaggc-- 2041 -----ga--- ----guacgg u--g|U|UCG UACCGAGAUC C(GACACA)G G|UGCUC-UG 2101 GC(agaggaa )GCCAA-GGC Cu||GUCGGG -AAUAACCGA CGUUAGGGAA UUCGGCAAGU 2161 UAGUCCCGUA AG(UUCG)CG AUAAGGGAU| GCCUGC---c ac-------- -(--gcaau) 2221 --------ga g--GCAGGU| C(GCA)GUGA CUCGGAGGCU C|CGACUGU( CUAAUAACA) 2281 ACAUAGGUGA CCG|CAAAUC C(GCA-A)GG ACGCGUACGG UCACUGAAUC CUGCCCAGU| 2341 GCGGGUAUCU GAA----cac ccag---(ua ca)----aug ggg----cGA AGGA|CCCGU 2401 UAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAGUAC CUU|GCCGCU 2461 (UC-----AG U)AGCGGC|U UGCAUGAAUG GAUCAACGAG AGCCUCACUG UCCCAACGUU 2521 GG-GCCCG|G UGAACUG|-U ACGU-UCCAG (-UGCGGA-G U)CUGGAGAC CCCCAAGGGG 2581 AAGCGAAGAC CCUAUAG|AG CUUUACUGC| AGGCUGUCGC U|GGGACACG GUCGCUGAUG 2641 UGCAGAGUAG GUAGGAGACg uuacacaggu aCGUGC(GCU A-)GCACGcc accGAGUCAC 2701 ACAUGAAACA CUACC|CGUC AGUGACUGUG ACCCucac-- ----ucc(-g gga-)gga-- 2761 ---ggacACC GGUAGCCGGG CAGUUUGACU G(GGG)CGGU AC|GCGCUU( GAAAAGAUAU 2821 C)GAGCGC|G |C|CCUAAGC CUAUC|UCAG CCGAG(UCAG AGA)CUCGGC GAAGA|GU(G 2881 |C-AAGA)GC AUAAGAUAGG CUGACAGUGU CCU(acacaa cga-)GGGAC GCUGACGC(G 2941 AAA)GC-UGG |UCUAGCGAA CCAAUUAGG- CUG(--cuug aug)CGG|CC AAUUG-cuga 3001 c|aga--aAA GCUACCU|UA GG|GAUAACA G|AGUCGUCA CUCGCAAGA( GCACAUA)UC 3061 GACCGAGUGG CU-GCUACCU CGAUGUC|GG UU|C|CCUCC |AUCC|UGCC CGU(GCAGAA 3121 )GCGGGCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGAG GUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCUGCUAUCU |AUUGGGGG- |UGuuuUGGU GCUU|GA|CA 3241 GGAA-CGUUC GUA(UAGUAC (GAGA)GGAA C)UACGAAC| GGGUGCCACU GGUGUAUCGG 3301 U|UGUCC-(G AGA)-GGGCA uguGCCGAGC A-GCUACGCA CCA-CGGGGU AAGAGCU(GA 3361 AUGCAUCUA) AGCUCGAAAC CCACCUG|GA A-AAGAAGCA CCA|c-|--- ---------- 3421 -------(-- ---)------ ---------- ------ugag a|cc|GCUCG (UAGAAGA)C 3481 GAGUUCGAUA G|ACUUG|GG GUGUACGCGU Cgag----(g caa--)---- cgaGACGU|U 3541 UAGCCC|G|C GAGUACUAac aggucaa--u |gccac---a c------||| || // LOCUS N.magadii 3592 bp RNA RNA 11-JAN-191 DEFINITION Natrialba magadii subsp. was Natronobacterium magadii. ACCESSION No information KEYWORDS No information. SOURCE Natrialba magadii subsp. was Natronobacterium magadii. ORGANISM Natrialba magadii. REFERENCE 1 (bases 1 to 15288) AUTHORS Lodwick D., Ross H.N.M., Walker J.A., Almond J.W., Grant W.D. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: X72495 ID NMRRNA standard; DNA; PRO; 5209 BP. XX AC X72495; XX DT 11-MAY-1993 (Rel. 35, Created) DT 13-APR-1994 (Rel. 39, Last updated, Version 7) XX DE N.magadii rRNA operon XX KW 16S ribosomal RNA; 16S rRNA gene. XX OS Natronobacterium magadii OC Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; OC Halobacteriales; Halobacteriaceae. XX RN [1] RA Lodwick D., Ross H.N.M., Walker J.A., Almond J.W., Grant W.D.; RT "Nucleotide Sequence of the 16S Ribosomal RNA Gene from the RT Haloalkaliphilic Archaeon (Archaebacterium) Natrobacterium RT magadii, and the Phylogeny of Halobacteria"; RL Syst. Appl. Microbiol. 14:352-357(1991). XX RN [3] RC sequence revised by author 15-NOV-93 RP 1-5209 RA Lodwick D.; RT ; RL Submitted (29-APR-1993) to the EMBL/GenBank/DDBJ databases. RL D. Lodwick, University of Leicester, Dept of Medicine Clinical Sc RL Bldg, Leicester Royal Infirmary, P O Box 65, Leicester LE2 7LX, UK XX FH Key Location/Qualifiers FH FT source 1..5209 FT /organism="Natronobacterium magadii" FT /strain="NCMB2190" FT rRNA 157..1622 FT /product="16S ribosomal RNA" FT tRNA 1750..1824 FT /product="transfer RNA-Ala" FT rRNA 2028..4938 FT /product="23S ribosomal RNA" FT rRNA 5072..5196 FT /product="5S ribosomal RNA" XX SQ Sequence 5209 BP; 1269 A; 1351 C; 1618 G; 971 T; 0 other; BASE COUNT 750 a 736 c 900 g 525 t 681 others ORIGIN 1 |--------- -----guggc u--acUGUGC CAGCUGGUGG AUC|GCUCGG |CUUGAG-|a 61 g|cUGAAGAC GGA|CGUG(C CAAGC)U|GC GAUAAGCCUC AGG|GACCC( GCACGGA)GG 121 GAAA-GAACU GAGGAUU|UC -CGAAUGGG( AAUC-)CCC| ---------- ----(accgc 181 )--------- -------aau ucg----(-- -uuc-gc--- )-----gcaa ugggGAAC|G 241 CCGAGAACU| GA(AACA)UC UCAGUAUCGG CAGGAAAAGA AAAC(gu-aa u)GUGA-U|G 301 UCGUUAGUAC U(GGCGA)AG GAACGCGAUA CAGUCCAaac cgaa------ --gcc(-uuc 361 g)ggc----- ---aaugugg uguuc---gg acugacgau- -----(---- -)-----cac 421 ucucc--gaa acucg-acac GAAgucucu- (uggaac)-a gagcac|g(a ga)caGGG(U 481 GACAGU)CCC GUAcugu--c gacgaguaag agac----ga gucagcucca GAGUAUCGGG 541 GGu(---ugg aua)uCCCUC GUGAAU-AUC CCAG(Gcauc gA-)CUGGGA AGACUAAACA 601 cuc-CUCAAG ACCGAUAGCG AAC-AAGUAG U(GUGA)ACG AACGCUGAAA AGCACCCC(A 661 CAAA)GGGAG GUGCAAUAGG GCGUGAAAUC AGUUGGCGau a-gagcgaca gagcau--ac 721 aagg------ uccgccgagu aacgaccgag ac(gcga-)g ucuc---uag uaggaa-ucg 781 acggaagccg --auguucug ucGUACGU(U UUGAAA|A)A CGA|ACCAGG GAGUGUGC-C 841 UGAUUGACGA GUCUAACucg (--agaau)c gaggaAGGCG UAGG(GAAA) CCGacauggc 901 cgcaguc--- -(uuac)--- -gacgagggc cgcCGUGU(u caa-)GCGCG GGGAGUCAAU 961 CGGGCACGAC CCGAAACUGG AUGAUCUAGA CGUGGG|CAA GGUGAAGCGU GCC(GAAA)G 1021 GCACGUGGAG GCCUGuua-g ug-uugg-ug (uccuac-aa )uacccu|ca cguGACCUAC 1081 GUCUAGGGGU (GAAAG)GCC CAUCGAAUCC AG-AAACAGC UGGUUCCAAC CGAAAGAUGU 1141 (CGAA)GCAU CAC|CUCUGC cga-|gau|- agUUCAUG|G GGUAG|AGCG ACGGAUUGGG 1201 GGaccgc|ac ucc(gaga-) ggagu-gcgc CCCCCUGUCc aacuccgaac c|UAUGAAcg 1261 ucguucga|c GCAGGG-AGU CCGGUGCA-C -GGG(G-UAA G)CCUG-UGU ACCGUAAGGG 1321 (AGACAA)CC CAGAGCUG-G GUUAAGGU|C CCAAAGUGUG GAUUA-AGUG cg-----auc 1381 GAAGGUGGUC UCAAGCCCUA GACAGCCGGG AGGUGAGC(U UAGAAGCA)G CUACCCU|C( 1441 UAA)G|AAAA GC(GUAACA) GC|UUACCGG CCGAGGUUUG AGGC|GCCCA AAAUGAU-CG 1501 GGGCUCAAAU CCAC|C|ACC GAGACC|GAG C|cguacccc --(ugac--) --aggguaau 1561 cgCGUAGGUU GG|CGUUCUG UU-CGGGUGG AAGCACGG-A U(GAGA)AUU CGUGUGG|AC 1621 CGUUCAGUAA CGAAAAUCCU GGU|CAUAGU AGCagcguua GUCGGG(Uua gac-c)CCC- 1681 GAC|GGCCGA AC-GAGUAAG GGUUCCUCAG CAAUG-(CUG AU)CA|GCUG AGGGUUAGC| 1741 CGG|-uccUA Agucugc-cc g(UAAGU)cg a--agcagac aaca-GGGAA AUAGG(UUAA 1801 UAUU)CCUAU |GC-cgguau gc-------- --aa(-uaaa )a-------- ------guug 1861 acgcuuuggg gccaccca|g gcugg(---- -gc-cuucgc ------)cca guu|g-aaca 1921 guc-gaagag CGUGGAA-G- CC(GUAAU)G G|CACGAAGC G--aucgaau ggcug-gaua 1981 |---|---ac (--gcaa)gc -------ugg guca-accca -|gagcccgu gaaaagac-- 2041 -----ga--- ----gcauac u--g|U|CCG UACCGAGAUC C(GACACA)G G|UACUCAUG 2101 GC(ggcgaaa )GCCAA-GGU CU||GUCGGG -AGCAACCGA CGUUAGGGAA UUCGGCAAGU 2161 UAGUCCCGUA CG(UUCG)CA AUAAGGGAU| GCCUGC---c uc-------- -(--ggaaa) 2221 --------ga g--GCAGGU| C(GCA)GUGA CUCGGGCGCU C|CGACUGU( CUAGUAACA) 2281 ACAUAGGUGA CCG|CAAAUC C(GCA-A)GG ACUCGUACGG UCACUGAAUC CUGCCCAGU| 2341 GCGGGUAUCU GAA----cac cccu---(ua ca)----agg gga----cGA AGGA|CCCGU 2401 UAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAGUAC CUU|GCCGCU 2461 (UC-----AG U)AGCGGC|U UGCAUGAAUG GAUCAACGAG AGCGCCACUG UCCCAACGUU 2521 GG-GCCCG|G UGAACUG|-U ACGU-UCCAG (-UGCGGA-G U)CUGGAGAC CCCCAAGGGG 2581 AAGCGAAGAC CCUAUAG|AG CUUUACUGC| AGGCUGUCAC U|GAGACGUG GUCGCCAUUG 2641 UGCAGCAUAG GUAGGAGGCa uuacacaggu aCCCGC(GCU A-)GCGGGcc accGAGCCAG 2701 CAUUGAAAUA CUACC|CGAU GGUGACUGCG ACUCucac-- ----ucc(-u ggc-)gga-- 2761 ---ggacACU GGUAGCCGGG CAGUUUGACU G(GGG)CGGU AC|GCGCCU( GAAAAGAUAU 2821 C)GGGCGC|G |C|CCCAAGA UUUCC|UCAC CCGCG(UCGG AGA)CGCGGG AAAGA|GC(G 2881 |C-AAGA)GC AUACGGAAGU CUGACAGUGU CCG(gcacaa cga-)CGGAC GCUGACGC(G 2941 AAA)GCGUGG |UCUAGCGAA CCAAUUAGG- CUG(--cuug aug)CGG|CC AAUUG-cuga 3001 c|aaa--aAA GCUACCU|UA GG|GAUAACA G|AGUCGUCA CCCGCAAGA( GCACAUA)UC 3061 GACCGGGUGG CUUGCUACCU CGAUGUC|GG UU|C|CCUCC |AUCC|UGCC CGU(GCAGAA 3121 )GCGGGCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGAG GUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCUGCUAUCU |AUUGGGGG- |UGuuAUGGU UCCU|GA|CG 3241 GGAA-CGUUC GUA(UAGUAC (GAGA)GGAA C)UACGAAU| GGGUGCCACU GGUGUACCGG 3301 C|UGUUC-(G AAA)-GAGCA cguGCCGGGC A-GCCACGCA CCA-CGGGGU AAGAGCU(GA 3361 ACGCAUCUA) AGCUCGAAAC CCACCUG|GA A-AAGAGGAG CCA|c-|--- ---------- 3421 -------(-- ---)------ ---------- ------cgag g|cc|ACUCG (UAGAAGA)C 3481 GAGAUCGAUA G|ACUCG|GG GUGUACGCGC Caag----(g caa--)---- cgaGGCGU|U 3541 GAGCCC|G|C GAGCACUAau cggcca---- |agccacac- -------||| || // LOCUS H.morrhuae 3592 bp RNA RNA ~?~???? DEFINITION Halococcus morrhuae. ACCESSION No information KEYWORDS No information. SOURCE Halococcus morrhuae. ORGANISM Halococcus morrhuae. REFERENCE 1 (bases 1 to 15288) AUTHORS No information JOURNAL Leffers,H., Kjems,J., Ostergaard,L., Larsen,N. and Garrett,R.A. STANDARD No information COMMENTS Organism information Culture collection: ? Sequence information (bases 1 to 3592) Corresponding GenBank entry: X05481 Phylo:Archaebacteria,Methanogens and relatives,Halobacteria Leffers,H., Kjems,J., Ostergaard,L., Larsen,N. and Garrett,R.A. Archaea; Euryarchaeota; Halobacteriales; Halobacteriaceae; Halococcu Evolutionary relationships amongst archaebacteria. A comparative study of 23 S ribosomal RNAs of a sulphur-dependent extreme thermophile, an extreme halophile and a thermophilic methanogen J. Mol. Biol. 195 (1), 43-61 (1987) BASE COUNT 711 a 771 c 939 g 506 t 665 others ORIGIN 1 |--------- -------ggc u--acUAUGC CAACUGGUGA AUA|GCUCGG |CUCGAG-|u 61 g|cCGAUGAA GGA|CGUG(C CAAGC)U|GC GAUAAGCUCA GGG|gaGCC( GCACGGA)GG 121 Cgaa-GAACC UGAGAUU|UC -CGAAUGGG( AAUC-)CCC| ---------- ----(accgc 181 )--------- -------aau ugc----(-- -cuucgc--- )-----gcaa ugggGAAC|G 241 CCGGGAACU| GA(AACA)UC UCAGUACCGG CAGGAAGAGA AACC(GA--A U)GGGA-U|G 301 UCGUUACUAA C(GGCGA)GU GAACGCGACA CAGUCCAaac cgaagcuuac ggggc(-uuu 361 -)gccccacg agcaaugugg uguuc---gg acugacucu- -----(---- -)-----cag 421 caccg--gau cgucugugu- GAAgucucc- (uggaac)-g gagcgu|g(a ga)caGGG(U 481 GAAAAC)CCC GUA-acac-a gaccaguacg gugu----gc gucagcucca GAGUAUCGGG 541 GGu(---ugg aua)uCCCUC GUGAAU-AUG GCCG(Gcauc gA-)CGGCCA AGACUAAaca 601 cau-CUCGAG ACCGAUAGUG AAC-AAGUAG C(GUGA)CGG AACGCUGAAA AGCACCCC(A 661 CAAA)GGGAG GUGCAAUAGG GCUUGAACUC AGUUGGCGau c-gagcgacg gggcau--aa 721 aagg------ cgucuuagug aacgacccgg gg(gcga-)c cccg---cag uagcaa-ccg 781 agacgagccg --auguuccg ucGUGCGU(U UUGAAA|A)A CGA|GCCAGG GAGUGUGU-C 841 UGCUUGACGA GUCUAA-CCg (--agcau)c ggggaAGGCA UAGG(GAAA) CCGacauggc 901 cgcagugcgu -(ucu-)-ac guaccagggc cgcCGUGU(u caa-)GCGCG GGGAGUCGAG 961 CGGACACGAC CCGAAUCCGG ACGAUCUACA CGAUGG|CAA GGUGAAGCGU GGC(GAAA)G 1021 CUGCGUGGAG GCCUGuua-g gg-augg-ug (uccuac-aa )uacccu|cc cguGACCAGU 1081 GUGUAGGGGU (GAAAG)GCC CAUCGAGUCC GG-CAACAGC UGGUUCCGAC CGAAACAUGU 1141 (CGAA)GCAU GAC|CUCCGC cga-|ggu|- agUUCGUG|G GGUAG|AGCG ACCGAUUGGA 1201 GGgacCG|AC UCC(gaga-) GGAGU-CGGC CCUCCUGUCG AACUCCGAAC C|UACGAAcg 1261 ccgu-cga|c gcgggg-AGA GCGGUGUG-C -GGG(G-UAA G)CCUG-UGC ACCAUGAGGG 1321 (AGACAA)CC CAGAGCUU-G GUUGAGGU|C CCCAAGUGUG GACUA-AGUG cg-----auc 1381 GAAGGUGGUU CCGAGCCAUA GACAGCCGGG AGGUGAGC(U UAGAAGCA)G CUACCCG|C( 1441 UAA)G|AAAA GC(GUAACA) GC|UUACCGG CCGAGGUUUG GAGC|GCCGA AAAUGAU-CG 1501 GGGCUCAAGU CCAC|C|ACC GAUACC|UAG C|ggcgcacu --(guau--) --agugc-au 1561 ccAGUAGGUC GG|CGCUCCG UU-UGGGUGG AAGUACGG-G C(GAGA)GUU CGUAUGG|AC 1621 CGAACGGUGA CGAUAAUCCU GGC|CACAGU AGCagcggua GUCGGG(Uua gaaau)CCC- 1681 GAC|GGCCUC AC-GGACAAG GGUUCCUCGG CAAUG-(UUC GU)CA|ACCG AGGGUUAGC| 1741 CGG|-uccuc agucuua-cC G(CAGUU)CG a--guaagac GAaucGGGAA ACUGG(UUAA 1801 UAUU)CCAGU |GC-ccgcgu gc-------- ---a(-auca )u-------- ------gccg 1861 acgccucggg gucgaccg|a gccgg(---- -gc-acucgc ------)ccg guc|gaauca 1921 cca-aaacuu CGUGGAA-G- CC(GUAAC)G G|CACGAAGC G--ggcga-a cggugagaua 1981 |---|---ga (--gaaa)uu -------cgg ucca-accug -|gggcccgu gaaaaggc-- 2041 ---------- ----gcacgc g--g|U|CCG UACCCAGAUC C(GACACA)G G|UGUCCgUG 2101 GC(ggcgaaa )GCCAA-GGC CU||GUCGGG -GAGAACCGA CGUUAGGGAA UUCGGCAAGU 2161 UAGUCCCGUA CG(UUCG)CA AUAAGGGAU| GCCUGC---c uc-------- -(--ggaau) 2221 --------ga g--GCAGGU| C(GCA)GCGA CUCGGACGCU C|CAACUGU( CUAAUAACA) 2281 ACACAGGUGA CCG|CAAAUC C(GCA-A)GG ACUCGUACGG UCACUGAAUC CUGCCCAGU| 2341 GCGGGUAUCU GAA----cac ccgg---(ua ca)----acg gga----cGA AGGA|CCCGU 2401 CAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAGUAC CUU|GCCGCU 2461 (UC-----AG U)AGCGGC|U CGCAUGAAUG GAUGAAUGAG AGCGUCACUG UCCCAACGUU 2521 GG-GCCCG|G UGAACUG|-U ACGU-UCCAG (-UGCGGA-G U)CUGGAGAC CCCCAAGGGG 2581 AAGCGAAGAC CCUAUAG|AG CUUUACUGC| AAGCUGUCGC U|GGGACACG GUCGCUAGUG 2641 UGCAGGAUAG GUAGGAGUCG ucacgcaggc aCCCGC(GCU A-)GCGGGcc gccGAGACAC 2701 CACUGAAAUA CUACC|CACU AGUGACUGUG ACCCuuac-- ----ucc(-g gga-)gga-- 2761 ---ggacACC GGUAGCUGGG CAGUUUGACU G(GGG)CGGU AC|GGGCUU( GAAAAGAUAU 2821 C)GAGCCC|G |C|CCGAAGA UCACC|UCAU CCGGG(UCGG AGA)CCCGGA AUAGA|GC(G 2881 |C-AAGA)GC AAAAGGUGGU CUGACAGUGU UCU(cuccaa cacg)AGAAC GCUGACGC(G 2941 AAA)GCGCGG |UCUAGCGAA CGGAUGAGC- CUG(--cuug aug)CGG|GC CAUUU-acga 3001 c|aga--aAA GCUACCU|UA GG|GAUAACA G|AGUCGUCA CCCGCAAGA( GCACAUA)UC 3061 GACCGGGUGG CUUGCUACCU CGAUGUC|GG UU|C|CCUCC |AUCC|UGCC CGU(GCAGCA 3121 )GCGGGCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGAG GUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCUGCUAUCU |AUUGGGGG- |UGucAUGGU ACCU|GA|CG 3241 GGAA-CGAUC GUA(UAGUAC (GAGA)GGAA C)UACGAUU| GGUCGCCACU GGUGUACCGG 3301 U|UGUCC-(G AGA)-GGGCA cguGCCGGGC A-GCCACGCG ACA-CGGGGU AAGAGCU(GA 3361 ACGCAUCUA) AGCUCGAAAC CCACCCG|GA A-AAGAGGUA CCg|c-|--- ---------- 3421 -------(-- ---)------ ---------- ------cgag g|uc|ACUCG (UAGAAGA)C 3481 GAGUUCGAUA G|ACUCG|GG GUGUACGCGC Cgag----(g caa--)---- cgaGGCGU|U 3541 UAGCCC|G|C GAGCACUAac agaccg---- |agcc----a cau----||| || // LOCUS H.marism.1 3592 bp RNA RNA 21-JAN-1989 DEFINITION Halobacterium marismortui. ACCESSION No information KEYWORDS No information. SOURCE Halobacterium marismortui. ORGANISM Halobacterium marismortui. REFERENCE 1 (bases 1 to 15288) AUTHORS No information JOURNAL Fri Jul 21 10:00:26 1989 STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: X137388 BASE COUNT 761 a 740 c 894 g 544 t 653 others ORIGIN 1 |gagucacaa cgacguuggc u--acUAUGC CAGCUGGUGG AUU|GCUCGG |CUCAGG-|c 61 g|cUGAUGAA GGA|CGUG(C CAAGC)U|GC GAUAAGCCAU GGG|GAGCC( GCACGGA)GG 121 CGAA-GAACC AUGGAUU|UC -CGAAUGAG( aaucu)CUC| ---------- ----(uaac- 181 )--------- -------aau ugc----(-- -uuc-gc--- )-----gcaa ugagGAAC|C 241 CCGAGAACU| GA(AACA)UC UCAGUAUCGG GAGGAACAGA AAAC(GC-aa u)GUGA-U|G 301 UCGUCAGUAA C(CGCGA)GU GAACGCGAUA CAGCCCAaac cgaa------ -gccc(-uca 361 c)gggc---- ---aaugugg uguca--ggg c-uaccucu- -----(---- -)-----cau 421 cgucc--gac caucu-cgac GAAgucucu- (uggaac)-a gagcGU|g(a ua)caGGG(U 481 GACAAC)CCC GUAcucg--a gaccaguacg acgu----gc gguagugcca GAGUAGCGGG 541 GGu(---ugg aua)uCCCUC GCGAAU-AAC GCAG(gcauc gA-)CUGCGA AGGCUAAACA 601 caa-CCUGAG ACCGAUAGUG AAC-AAGUAG U(GUGA)ACG AACGCUGCAA AGUACCCU(C 661 AGAA)GGGAG GCGAAAUAGA GCAUGAAAUC AGUUGGCGau c-gagcgaca gggcau--ac 721 aagg------ ucccuugacg aaugaccgag ac(gcga-)g ucuc---cag uaagac-uca 781 cgggaagccg --auguucug ucGUACGU(U UUGAAA|A)A CGA|GCCAGG GAGUGUGU-C 841 UGUAUGGCAA GUCUAACCgg (--aguau)c cgg-GAGGCA CAGG(GAAA) CCGacauggc 901 cgcagggc-- -(uuu-)--- gcccgagggc cgcCGUCU(u caa-)GGGCG GGGAGCCAUG 961 UGGACACGAC CCGAAUCCGG ACGAUCUACG CAUGGA|CAA GAUGAAGCGU GCC(GAAA)G 1021 GCACGUGGAA GUCUguua-g ag-uugg-ug (uccuac-aa )uacccu|cu cguGAUCUAU 1081 GUGUAGGGGU (GAAAG)GCC CAUCGAGUCC GG-CAACAGC UGGUUCCAAU CGAAACAUGU 1141 (CGAA)GCAU GAC|CUCCGC Cga-|ggu|- aGUCUGUG|A GGUAG|AGCG ACCGAUUGGU 1201 GUgucCG|CC UCC(gaga-) GGAGU-CGGC ACACCUGUCA AACUCCAAAC U|UACAGAcg 1261 ccguuuga|c GCGGGG-AUU CCGGUGCG-C -GGG(G-UAA G)CCUG-UGU ACCAGGAGGG 1321 (GAACAA)CC CAGAGAUA-G GUUAAGGU|C CCCAAGUGUG GAUUA-AGUG ua-auccucu 1381 GAAGGUGGUC UCGAGCCCUA GACAGCCGGG AGGUGAGC(U UAGAAGCA)G CUACCCU|C( 1441 UAA)G|AAAA GC(GUAACA) GC|UUACCGG CCGAGGUUUG AGGC|GCCCA AAAUGAU-CG 1501 GGACUCAAAU CCAC|C|ACC GAGACC|UGU C|cguaccgc --(ucau--) --augguaau 1561 cgAGUAGAUU GC|CGCUCUA AU-UGGAUGG AAGUAGGG-G U(GAGA)GCU CCUAUGG|AC 1621 CGAUUAGUGA CGAAAAUCCU GGC|CAUAGU AGCagcgaua GUCGGG(Uga g-a-A)CCCc 1681 GAC|GGCCUA AU-GGAUAAG GGUUCCUCAG CACUG-(CUG AU)CA|GCUG AGGGUUAGC| 1741 CGG|-uCCUA AGUCAUA-CC G(CAACU)CG A--cuaugac GAAAuGGGAA ACGGG(UUAA 1801 UAUU)CCCGU |GC-cacuau gc-------- --ag(-ugaa )a-------- ------guug 1861 acgcccuggg gucgauca|c gcugg(---- -gc-auucgc ------)cca guc|g-aacc 1921 guc-caacuc CGUGGAA-G- CC(GUAAU)G G|CAGGAAGC G--gacga-- -acggcggca 1981 |ua-|---gg (--gaaa)cg -------uga uuc--accug -|gggcccau gaaaagac-- 2041 -----ga--- ----gcauag u--g|U|CCG UACCGAGAAC C(GACACA)G G|UGUCCAUG 2101 GC(ggcgaaa )GCCAA-GGC CU||GUCGGG -AGCAACCAA CGUUAGGGAA UUCGGCAAGU 2161 UAGUCCCGUA CC(UUCG)GA AGAAGGGAU| GCCUGC---u cc-------- -(--ggaac) 2221 --------gg a--GCAGGU| C(GCA)GUGA CUCGGAAGCU C|GGACUGU( CUAGUAAUA) 2281 ACAUAGGUGA CCG|CAAAUC C(GCA-A)GG ACUCGUACGG UCACUGAAUC CUGCCCAGU| 2341 GCAGGUAUCU GAA----cac cucg---(ua ca)----aga gga----cGA AGGA|CCUGU 2401 CAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAGUAC CUU|GCCGCA 2461 (UC-----AG U)AGCGGC|U UGCAUGAAUG GAUUAACCAG AGCUUCACUG UCCCAACGUU 2521 GG-GCCCG|G UGAACUG|-U ACAU-UCCAG (-UGCGGA-G U)CUGGAGAC ACCCAGGGGG 2581 AAGCGAAGAC CCUAUGG|AG CUUUACUGC| AGGCUGUCGC U|GAGACGUG GUCGCCGAUG 2641 UGCAGCAUAG GUAGGAGUCG uuacagaggu aCCCGC(GCU A-)GCGGGcc acccAGACAA 2701 CAGUGAAAUA CUACC|CGUC GGUGACUGCG ACUCucac-- ----ucc(-g gga-)gga-- 2761 ---ggacACC GAUAGCCGGG CAGUUUGACU G(GGG)CGGU AC|GCGCUC( GAAAAGAUAU 2821 C)GAGCGC|G |C|CCUAUGG UCAUC|UCAG CCGGG(ACAG AGA)CCCGGC GAAGA|GU(G 2881 |C-AAGA)GC AAAAGAUGAC UUGACAGUGU UCU(ucccaa cga-)GGAAC GCUGACGC(G 2941 AAA)G-GUGG |UCUAGCGAA CCAAUUAGC- CUG(--cuug aug)CGG|GC AAUUG-auga 3001 c|aga--aAA GCUACCC|UA GG|GAUAACA G|AGUCGUCA CUCGCAAGA( GCACAUA)UC 3061 GACCGAGUGG CUUGCUACCU CGAUGUC|GG UU|C|CCUCC |AUCC|UGCC CGU(GCAGAA 3121 )GCGGGCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGAG GUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCUGCUAUCU |ACUGGGUG- |UGuaAUGGU GUCU|GA|CA 3241 AGAA-CGACC GUA(UAGUAC (GAGA)GGAA C)UACGGUU| GGUGGCCACU GGUGUACCGG 3301 U|UGUUC-(G AGA)-GAGCA cguGCCGGGU A-GCCACGCC ACA-CGGGGU AAGAGCU(GA 3361 ACGCAUCUA) AGCUCGAAAC CCACUUG|GA A-AAGAGACA CCG|c-|--- ---------- 3421 -------(-- ---)------ ---------- ------cgag g|uc|CCGCG (UACAAGA)C 3481 GCGGUCGAUA G|ACUCG|GG GUGUGCGCGU Cgag----(g uaa--)---- cgaGACGU|U 3541 AAGCCC|A|C GAGCACUAac agaccaa--- |agccaucau ucauacg||| || // LOCUS H.marism.2 3592 bp RNA RNA 01-DEC-1997 DEFINITION Haloarcula marismortui.; . ACCESSION AF0346 9 KEYWORDS . SOURCE Haloarcula marismortui. ORGANISM Haloarcula marismortui. REFERENCE 1 (bases 1 to 6178) AUTHORS Dennis,P.P. and Mylvaganam,S. TITLE Haloarcula marismortui Disparate ribosomal RNA operons in the halophilic archaeon JOURNAL Unpublished STANDARD No information REFERENCE 2 (bases 1 to 6178) AUTHORS Mylvaganam,S. and Dennis,P.P. TITLE Direct Submission JOURNAL Submitted (17-NOV-1997) Biochemsitry and Molecular Biology, University of British Columbia, 2146Health Sciences Mall, Vancouver, BC V6T 1Z3, Canada STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AF034619 BASE COUNT 754 a 731 c 890 g 542 t 675 others ORIGIN 1 |--------- -----uuggc u--acUAUGC CAGCUGGUGG AUU|GCUCGG |CUCAGG-|c 61 g|cUGAUGAA GGA|CGUG(C CAAGC)U|GC GAUAAGCUGU GGG|GAGCC( GCACGGA)GG 121 CGAA-GAACC ACAGAUU|UC -CGAAUGAG( aaucu)CUC| ---------- ----(uaac- 181 )--------- -------aau ugc----(-- -uuc-gc--- )-----gcaa ugagGAAC|C 241 CCGAGAACU| GA(AACA)UC UCAGUAUCGG GAGGAACAGA AAAC(GC-aa c)GUGA-U|G 301 UCGUUAGUAA C(CGCGA)GU GAACGCGAUA CAGCCCAaac cgaa------ -gccu(-gca 361 c)gggc---- ---aaugugg uguca--ggg c-uaccucu- -----(---- -)-----cau 421 cagcc--gac cgucu-ucac GAAgucucu- (uggaau)-a gagcGU|g(a ua)caGGG(U 481 GACAAC)CCC GUAcuga--a gaccaguacg cugu----gc gguagugcca GAGUAGCGGG 541 GGu(---ugg aua)uCCCUC GCGAAU-AAC GCAG(gcauc gA-)CUGCGA AGGCUAAACA 601 caa-CCUGAG ACCGAUAGUG AAC-AAGUAG U(GUGA)ACG AACGCUGCAA AGUACCCU(C 661 AGAA)GGGAG GCGAAAUAGA GCAUGAAAUC AGUUGGCGau c-gagcgaca gggcau--ac 721 aagg------ ucccuugacg aaugaccgag ac(gcga-)g ucuc---cag uaagac-uca 781 cgggaagccg --auguucug ucGUACGU(U UUGAAA|A)A CGA|GCCAGG GAGUGUGU-C 841 UGUAUGGCAA GUCUAACCgg (--aguau)c cgg-GAGGCA CAGG(GAAA) CCGacauggc 901 cgcagggc-- -(uuu-)--- gcccgagggc cgcCGUCU(u caa-)GGGCG GGGAGCCAUG 961 UGGACACGAC CCGAAUCCGG ACGAUCUACG CAUGGA|CAA GAUGAAGCGU GCC(GAAA)G 1021 GCACGUGGAA GUCUguua-g ag-uugg-ug (uccuac-aa )uacccu|cu cguGAUCUAU 1081 GUGUAGGGGU (GAAAG)GCC CAUCGAGUCC GG-CAACAGC UGGUUCCAAU CGAAACAUGU 1141 (CGAA)GCAU GAC|CUCCGC Cga-|ggu|- aGUCUGUG|A GGUAG|AGCG ACCGAUUGGU 1201 GUgucCG|CC UCC(ggga-) GGAGU-CGGC ACACCUGUCA AACUCCAAAC U|UACAGAcg 1261 cuguuuga|c GCGGGG-AUU CCGGUGCG-C -GGG(G-UAA G)CCUG-UGU ACCAGGAGGG 1321 (GAACAA)CC CAGAGAUA-G GUUAAGGU|C CCCAAGUGUG GAUUA-AGUG ua-auccucu 1381 GAAGGUGGUC UCGAGCCCUA GACAGCCGGG AGGUCAGC(U UAGAAGCA)G CUACCCU|C( 1441 UAA)G|AAAA GC(GUAACA) GC|UUACCGG CCGAGGUUUG AGGC|GCCCA AAAUGAU-CG 1501 GGACUCAAAU CCAC|C|ACC GAGACC|UGU C|cguaccgc --(ucau--) --augguaau 1561 cgAGUAGAUU GC|CGCUCUA AU-UGGAUGG AAGUAGGG-G C(GAGA)GCU CCUAUGG|AC 1621 CGAUUAGUGA CGAAAAUCCU GGC|CAUAGU AGCagcgaua GUCGGG(Uga g-a-A)CCCc 1681 GAC|GGCCUA AU-GGAUAAG GGUUCCUCAG CACUG-(CUG AU)CA|GCUG AGGGUUAGC| 1741 CGG|-uCCUA AGUCUCA-CC G(CAACU)CG A--cuaugac GAAAuGGGAA ACAGG(UUAA 1801 UAUU)CCUGU |GC-caucau gc-------- --ag(-ugaa )a-------- ------guug 1861 acacccuggg gucgauca|c gcccg(---- -gc-auucgc ------)ccg guc|g-aacc 1921 guc-caacuc CGUGGAA-G- CC(GUAAU)G G|CAGGAAGC G--gacga-- -acggcggca 1981 |ua-|---gg (--gaaa)cg -------uga uuc--accug -|gggcccau gaaaagac-- 2041 -----ga--- ----gcauag u--g|U|CCG UACCGAGAAC C(GACACA)G G|UGUCCAUG 2101 GC(ggcgaaa )GCCAA-GGC CU||GUCGGG -AGCAACCAA CGUUAGGGAA UUCGGCAAGU 2161 UAGUCCCGUA GG(UUCG)GA AGAAGGGAU| GCCUGC---u cc-------- -(--ggaac) 2221 --------gg a--GCAGGU| C(GCA)GUGA CUCGGAAGCU C|GGACUGU( CUAGUAAUA) 2281 ACAUAGGUGA CCG|CAAAUC C(GCA-A)GG ACUCGUACGG UCACUGAAUC CUGCCCAGU| 2341 GCAGGUAUCU GAA----cac cucg---(ua ca)----aga gga----cGA AGGA|CCUGU 2401 CAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAGUAC CUU|GCCGCA 2461 (UC-----AG U)AGCGGC|U UGCAUGAAUG GAUUAACCAG AGCUUCACUG UCCCAACGUU 2521 GG-GCCCG|G UGAACUG|-U ACAU-UCCAG (-UGCGGA-G U)CUGGAGAC ACCCAGGGGG 2581 AAGCGAAGAC CCUAUGG|AG CUUUACUGC| AGGCUGUCGC U|GAGACGUG GUCGCCGAUG 2641 UGCAGCAUAG GUAGGAGUCG uuacagaggu aCCCGC(GCU A-)GCGGGcc acccAGACAA 2701 CAGUGAAAUA CUACC|CGUC GGUGACUGCG ACUCucac-- ----ucc(-g gga-)gga-- 2761 ---ggacACC GAUAGCCGGG CAGUUUGACU G(GGG)CGGU AC|GCGCUC( GAAAAGAUAU 2821 C)GAGCGC|G |C|CCUAUGG UGAUC|UCAG CCGGG(ACAG AGA)CCCGGC GAAGA|GU(G 2881 |C-AAGA)GC AAAAGAUGAC UUGACAGUGU UCU(ucccaa cga-)GGAAC GCUGACGC(G 2941 AAA)G-GUGG |UCUAGCGAA CCAAUUAGC- CUG(--cuug aug)CGG|GC AAUUG-auga 3001 c|aga--aAA GCUACCC|UA GG|GAUAACA G|AGUCGUCA CUCGCAAGA( GCACAUA)UC 3061 GACCGAGUGG CUUGCUACCU CGAUGAC|GG UU|C|CCUCC |AUCC|U-CC CGU(GCAGAA 3121 )GCGGGCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGAG GUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCUGCUAUCU |ACUGGGUG- |UGuaAUGGU GUCU|GA|CA 3241 AGAA-CGACC GUA(UAGUAC (GAGA)GGAA C)UACGGUU| GGUGGCCACU GGUGUACCGG 3301 U|UGUUC-(G AGA)-GAGCA cguGCCGGGU A-GCCACGCC ACA-CGGGGU AAGAGCU(GA 3361 ACGCAUCUA) AGCUCGAAAC CCACUUG|GA A-AAGAGACA CCG|c-|--- ---------- 3421 -------(-- ---)------ ---------- ------cgag g|uc|CCGCG (UACAAGA)C 3481 GCGGUCGAUA G|ACUCG|GG GUGUGCGCGU Cgag----(g uaa--)---- cgaGACGU|U 3541 AAGCCC|A|C GAGCACUAac agaccaa--- |agccaucau -------||| || // LOCUS H.marism.3 3592 bp RNA RNA 01-DEC-1997 DEFINITION Haloarcula marismortui.; . ACCESSION AF0346 0 KEYWORDS . SOURCE Haloarcula marismortui. ORGANISM Haloarcula marismortui. REFERENCE 1 (bases 1 to 5946) AUTHORS Dennis,P.P. and Mylvaganam,S. TITLE Haloarcula marismortui Disparate ribosomal RNA operons in the halophilic archaeon JOURNAL Unpublished STANDARD No information REFERENCE 2 (bases 1 to 5946) AUTHORS Mylvaganam,S. and Dennis,P.P. TITLE Direct Submission JOURNAL Submitted (17-NOV-1997) Biochemsitry and Molecular Biology, University of British Columbia, 2146Health Sciences Mall, Vancouver, BC V6T 1Z3, Canada STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AF034620 BASE COUNT 755 a 736 c 886 g 540 t 675 others ORIGIN 1 |--------- -----uuggc u--acUAUGC CAGCUGGUGG AUU|GCUCGG |CUCAGG-|c 61 g|cUGAUGAA GGA|CGUG(C CAAGC)U|GC GAUAAGCCAU GGG|GAGCC( GCACGGA)GG 121 CGAA-GAACC AUGGAUU|UC -CGAAUGAG( aaucu)CUC| ---------- ----(uaac- 181 )--------- -------aau ugc----(-- -uuc-gc--- )-----gcaa ugagGAAC|C 241 CCGAGAACU| GA(AACA)UC UCAGUAUCGG GAGGAACAGA AAAC(GC-aa u)GUGA-U|G 301 UCGUCAGUAA C(CGCGA)GU GAACGCGAUA CAGCCCAaac cgaa------ -gccc(-uca 361 c)gggc---- ---aaugugg uguca--ggg c-uaccucu- -----(---- -)-----cau 421 cagcc--gac cgucu-cgac GAAgucucu- (uggaac)-a gagcGU|g(a ua)caGGG(U 481 GACAAC)CCC GUAcucg--a gaccaguacg acgu----gc gguagugcca GAGUAGCGGG 541 GGu(---ugg aua)uCCCUC GCGAAU-AAC GCAG(gcauc gA-)CUGCGA AGGCUAAACA 601 caa-CCUGAG ACCGAUAGUG AAC-AAGUAG U(GUGA)ACG AACGCUGCAA AGUACCCU(C 661 AGAA)GGGAG GCGAAAUAGA GCAUGAAAUC AGUUGGCGau c-gagcgaca gggcau--ac 721 aagg------ ucccuugacg aaugaccgag ac(gcga-)g ucuc---cag uaagac-uca 781 cgggaagccg --auguucug ucGUACGU(U UUGAAA|A)A CGA|GCCAGG GAGUGUGU-C 841 UGUAUGGCAA GUCUAACCgg (--aguau)c cgg-GAGGCA CAGG(GAAA) CCGacauggc 901 cgcagggc-- -(uuu-)--- gcccgagggc cgcCGUCU(u caa-)GGGCG GGGAGCCAUG 961 UGGACACGAC CCGAAUCCGG ACGAUCUACG CAUGGA|CAA GAUGAAGCGU GCC(GAAA)G 1021 GCACGUGGAA GUCUguua-g ag-uugg-ug (uccuac-aa )uacccu|cu cguGAUCUAU 1081 GUGUAGGGGU (GAAAG)GCC CAUCGAGUCC GG-CAACAGC UGGUUCCAAU CGAAACAUGU 1141 (CGAA)GCAU GAC|CUCCGC Cga-|ggu|- aGUCUGUG|A GGUAG|AGCG ACCGAUUGGU 1201 GUgucCG|CC UCC(ggga-) GGAGU-CGGC ACACCUGUCA AACUCCAAAC U|UACAGAcg 1261 cuguuuga|c GCGGGG-AUU CCGGUGCG-C -GGG(G-UAA G)CCUG-UGU ACCAGGAGGG 1321 (GAACAA)CC CAGAGAUA-G GUUAAGGU|C CCCAAGUGUG GAUUA-AGUG ua-auccucu 1381 GAAGGUGGUC UCGAGCCCUA GACAGCCGGG AGGUCAGC(U UAGAAGCA)G CUACCCU|C( 1441 UAA)G|AAAA GC(GUAACA) GC|UUACCGG CCGAGGUUUG AGGC|GCCCA AAAUGAU-CG 1501 GGACUCAAAU CCAC|C|ACC GAGACC|UGU C|cguaccgc --(ucau--) --augguaau 1561 cgAGUAGAUU GC|CGCUCUA AU-UGGAUGG AAGUAGGG-G U(GAAA)ACU CCUGUGG|AC 1621 CGAUUAGUGA CGAAAAUCCU GGC|CAUAGU AGCagcgaua GUCGGG(Uga g-a-A)CCCc 1681 GAC|GGCCUA AU-GGAUAAG GGUUCCUCAG CACUG-(CUG AU)CA|GCUG AGGGUUAGC| 1741 CGG|-uCCUA AGUCAUA-CC G(CAACU)CG A--cuauauc GAAAuUGGAA ACGGG(UUAA 1801 UAUU)CCCGU |GC-cacuau gc-------- --ag(-ugaa )a-------- ------guug 1861 acgcccuggg gucgauca|c gcugg(---- -gc-auucgc ------)cca guc|g-aacc 1921 guc-caacuc CGUGGAA-G- CC(GUAAU)G G|CAGGAAGC G--gacga-- -acggcggca 1981 |ua-|---gg (--gaaa)cg -------uga uuc--accug -|gggcccau gaaaagac-- 2041 -----ga--- ----gcauag u--g|U|CCG UACCGAGAAC C(GACACA)G G|UGUCCAUG 2101 GC(ggcgaaa )GCCAA-GGC CU||GUCGGG -AGCAACCAA CGUUAGGGAA UUCGGCAAGU 2161 UAGUCCCGUA CC(UUCG)GA AGAAGGGAU| GCCUGC---u cc-------- -(--ggaac) 2221 --------gg a--GCAGGU| C(GCA)GUGA CUCGGAAGCU C|GGACUGU( CUAGUAAUA) 2281 ACAUAGGUGA CCG|CAAAUC C(GCA-A)GG ACUCGUACGG UCACUGAAUC CUGCCCAGU| 2341 GCAGGUAUCU GAA----cac cucg---(ua ca)----aga gga----cGA AGGA|CCUGU 2401 CAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAGUAC CUU|GCCGCA 2461 (UC-----AG U)AGCGGC|U UGCAUGAAUG GAUUAACCAG AGCUUCACUG UCCCAACGUU 2521 GG-GCCCG|G UGAACUG|-U ACAU-UCCAG (-UGCGGA-G U)CUGGAGAC ACCCAGGGGG 2581 AAGCGAAGAC CCUAUGG|AG CUUUACUGC| AGGCUGUCGC U|GAGACGUG GUCGCCGAUG 2641 UGCAGCAUAG GUAGGAGUCG uuacagaggu aCCCGC(GCU A-)GCGGGcc acccAGACAA 2701 CAGUGAAAUA CUACC|CGUC GGUGACUGCG ACUCucac-- ----ucc(-g gga-)gga-- 2761 ---ggacACC GAUAGCCGGG CAGUUUGACU G(GGG)CGGU AC|GCGCUC( GAAAAGAUAU 2821 C)GAGCGC|G |C|CCUAUGG UCAUC|UCAG CCGGG(ACAG AGA)CCCGGC GAAGA|GU(G 2881 |C-AAGA)GC AAAAGAUGAC UUGACAGUGU UCU(ucccaa cga-)GGAAC GCUGACGC(G 2941 AAA)GC-CGG |UCUAGCGAA CCAAUUAGC- CUG(--cuug aug)CGG|GC AAUUG-auga 3001 c|aga--aAA GCUACCC|UA GG|GAUAACA G|AGUCGUCA CUCGCAAGA( GCACAUA)UC 3061 GACCGAGUGG CUUGCUACCU CGAUGAC|GG UU|C|CCUCC |AUCC|U-CC CGU(GCAGAA 3121 )GCGGGCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGAG GUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCUGCUAUCU |ACUGGGUG- |UGuaAUGGU GUCU|GA|CA 3241 AGAA-CGACC GUA(UAGUAC (GAGA)GGAA C)UACGGUU| GGUGGCCACU GGUGUACCGG 3301 U|UGUUC-(G AGA)-GAGCA cguGCCGGGC A-GCCACGCG ACA-CGGGGU AAGAGCU(GA 3361 ACGCAUCUA) AGCUCGAAAC CCACUUG|GA A-AAGAGACA CCG|c-|--- ---------- 3421 -------(-- ---)------ ---------- ------cgag g|uc|CCGCG (UACAAGA)C 3481 GCGGUCGAUA G|ACUCG|GG GUGUGCGCGU Cgag----(g uaa--)---- cgaGACGU|U 3541 AAGCCC|A|C GAGCACUAac agaccaa--- |agccaucau -------||| || // LOCUS T.acidophi 3592 bp RNA RNA 11-JAN-191 DEFINITION Thermoplasma acidophilum. ACCESSION No information KEYWORDS No information. SOURCE Thermoplasma acidophilum. ORGANISM Thermoplasma acidophilum. REFERENCE 1 (bases 1 to 5946) AUTHORS Ree,H.K. and Zimmerman,R.A. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M32298 M20822 Phylo:Archaebacteria LOCUS THARGG 4154 bp ds-DNA BCT 15-SEP-1990 DEFINITION T.acidophilum 23S ribosomal RNA gene. ACCESSION M32298 M20822 KEYWORDS 23S ribosomal RNA. SOURCE T.acidophilum (strain 122-1B2) DNA, clones pTH1-1, pL8 and pTH3-7. ORGANISM Thermoplasma acidophilum Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoplasmales. REFERENCE 1 (bases 1 to 4154) AUTHORS Ree,H.K. and Zimmerman,R.A. TITLE Organization and expression of the 16S, 23S and 5S ribosomal RNA genes from the archaebacterium Thermoplasma acidophilum JOURNAL Nucleic Acids Res. 18, 4471-4478 (1990) STANDARD full staff_review COMMENT Draft entry and computer-readable sequence for [Nucleic Acids Res. 18, 4471-4478 (1990)] kindly submitted by H.K.Ree 23-FEB-1990. FEATURES Location/Qualifiers misc_RNA 429..3473 /note="23S RNA gene (3' end +/- 5 bp)" rRNA 521..3426 /note="23S RNA" BASE COUNT 1097 a 903 c 1177 g 977 t BASE COUNT 761 a 643 c 899 g 605 t 684 others ORIGIN 1 |--------- ------gugc ucugcUAAUC UGCCUAGAGG AUG|GCUUGG |UUCGGG-|c 61 g|cCGAAGAA GGA|CGUG(C CAAGC)U|GC GAUAAGCCUU GGG|GAGGC( GCAUGGA)GC 121 CUUA-GAUCC AAGGAUC|UC -CGAAUGGG( ACUU-)CCU| gcc------- ----(guaa- 181 )--------- ---ggc-acu cc-----(-- -gaa--a--- )------gga gaggGAAC|C 241 CGGGGAAUU| GA(AACA)UC UUAGUACCCG GAGGAAAAGA AAUC(AA--U U)GAGA-U|A 301 CCGUUAGUAA A(GGCGA)UC GAAAGCGGUA GAAGGCAaac cgaa-uagc- ccuuc(-gaa 361 a)gaagg-ga aa-gaugugg aguuu---gg u-cuuccucu -----(---- -)------aa 421 u-gccuccug aagcgagau- GAAu-cuuc- (uggaaa)-g aagagc|c(u ua)gaAGG(U 481 GAUAGC)CCU GUA-aucgaa gcuucagaag cuac-aaggg gaaguaacca GAGUACCAUG 541 CGu(---cgu uuu)uCGCGU GGGAAU-UUG GGUG(Gcacu aA-)CAUCCA ACCUUAAAUA 601 cg-uCCCGAG UCCGAUAGCG AACaAAGUAC C(GUGA)GGG AAAGCUGAAA AGAAACCC(g 661 -gaa)GGGUG GUG-AAAAGA GCCUGAAACU AGGCAGAGau a-aacuuaua gggcag--uu 721 aaga------ ---------g g-ugaa-guc gu(uaacu)a cgau---g-- ---------- 781 -----gaucg --cuguccua uuGUCCGU(G UUGAAG|A)A CGG|GCCAGG GAGUUCUG-A 841 CGAGUGGCAA GGUUAAuccu (---gaa-)a ggaguAGCCG UAGC(GAAA) GCAacu-acc 901 cgcaca---- -(gcaa)--- --uggggggg uggCGUG-(g uaaa)-CGCG UUUAGUCACU 961 CGUGAGAGAC CCGAAGCCGG UCGAUCUACA CCUGAG|UAG GUUGAAGCUC AGU(GAAA)G 1021 CUGGGUGGAG GACCGaa-cc ua-uucu-ga (ugugca-aa )ucgu-u|ug gauGACUUGG 1081 GUGUAGGGGU (UAAAG)GCC AAUCUAGGCC GG-CAAUAGC GGGUUCCCCC CGAUACUACC 1141 (CGCA)GGUA GAC|CUCGAU Gga-|gau|- UCUCGGCG|A GGUAG|AGCG ACCGAUUGGU 1201 UGguaAG|CA GUC(gaaa-) GGCUG-CG-C CGACUUGUCA AACUCCGAAC U|UGUCGAGA 1261 UCgu-aga|A GUCGGG-UGC UAGGGGGC-A -GGG(A-UAA G)CUUU-GCU UCCGUGAUGG 1321 (GAACAA)CC AAGACGAG-G GUUAAGGU|C CCUAAGUUCU AGUUA-AGUG c------acu 1381 aaaua-GGUU UGUGGCCAAA GACAGUGGGG AGGUAGGC(U CAGAAGCA)G CCAUCCU|U( 1441 CAA)A|GAGU GC(GUAACA) GC|UCACUCA CCGAGGUCAC AUGC|CUAGA AGAUGGA-AG 1501 GGGCUAAAAC UAGA|C|ACC GAGACC|UUC G|agcacc-- --(gaaa--) ----ggugau 1561 cuGGUAGGGG GG|CGUGCCA UG-UGGAUAG AAGUCUCC-C C(GAGA)GGA GGGAUGG|AC 1621 CGCAUGGUAU CGCGGAUCCU GGU|GAAAGU AGCagag-aa GAACCG(UGA GAA-u)CGG- 1681 UUC|CGCCGA AA-GGGCUAG GGUUCCUUGG CAAUG-(UUC GU)CA|GCCG AGGGUUAGU| 1741 CGA|-UCCUA AGGCCAU-AC C(UAACA)GG A--uaugguc GAAG-GGGAA GCCGG(UUAA 1801 UAUU)CCGGC |AC-acugaa c--guuu--- ----(----- )----u-gcc cugu-augag 1861 aagguucagg guaggggc|g guacg(---- -gg-ugccaa ------)cgu auu|-uaugc 1921 uca-uaagcg GAUGGAGAG- UC(GUAAU)G A|CGAGAAGU U--cgcga-a agagcguau- 1981 |--g|uuccc (--guuu)gg gaauc--gcc ucgauccccg -|gaucccau gaaaaucaug 2041 caggg----- gucagguuca g--u|A|UCG UACCAAGAAC C(GACACU)G G|UGCCCCUA 2101 GG(ugagaag )CCUAA-GGC GU||UUUGGG -AU-AAUGGA CGCGAGGGAA AUCGGCAAAA 2161 UAGCUCCGUA UC(UUCG)GU AUAAGGAGU| GCCUAU---u cc-------- -(--gugag) 2221 --------gg a--AUAGGU| C(GCA)GUGA CGAAGGGACU C|CGACUGU( UUACCACAA) 2281 ACACAGAUCG CUG|CUAGUC C(GUA-A)GG AUGUGUAUAG CGGUUGAAAC CUGCCCAGU| 2341 GCUGGUACCU GAA----agc cccg---(ua ca)----agg gga----aGA AGGG|CCAGU 2401 AAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAAUAC CUA|GCCGCU 2461 (UA-----AU U)GGCGGC|U UGCAUGAA-G GUUCAACGUG GGUCCCACUG UCCCCGCGUU 2521 CA-GCCCA|G UGAAAUU|-G AUGU-ACUGG (-UGCACA-A U)CCAGUCUC UCCCACGUGA 2581 AAGCGAAGUC CCCGUGG|AG CUUUACUGC| AGCCUGUAGC U|GUGGUGCG AUCCUGACUG 2641 CGUAGUGUAG GAAGGAGCCG UCG---AAGC UCUGGU(UUC G-)GCCGGAG U-GGAGGCGC 2701 CAAUGAAACA CUUCC|CUCU CGGGAUUGCG UCACuaac-- ----cuc(-u ucg-)gag-- 2761 ---ggacAAC UAUUGGUGGG CAGUUUGGGU G(GGG)CGCC AC|GCCCCU( AACAACGUAA 2821 C)AGGGGC|C |C|CCAAAGG UCAGC|UCAG GAGGG(UCAG AAA)UCCUCC GUAGA|GU(G 2881 |U-AAAA)GC AAAAGCUGGC UUGACUGUGU UGC(agacaa cua-)GCAAC GCAGAUGC(G 2941 AAA)GCAGGG |UUUAGCGAA CCACCCAGU- UCC(uccuua gug)GGG|GC GGGUG-auaa 3001 g|aga--gAA GUUACCC|CA GG|GAUAACU G|AGUCGUCC UCGGCAAGA( GUACACA)UC 3061 GACCCGAGGG UUUGCUACUU CGAUGUC|GU CU|G|UUCCU |AUCC|UGGU GCU(GCAUAA 3121 )GGUGCCAAG GGU|GGGGC( UGUUC)GCCC AUUAAAAGGG AUCCUGAGAU GGGUUCACUA 3181 C|GUC(GCGA )GACAGUAGG GUUGCUUCUC |CGUGGGAG- |UG-CUCGAU GUCU|GA|GG 3241 GGAA-GGGGC CUU(UAGUAC (GAGA)GGAA C)AAGGGCU| CGUGACCUCU AGUUUACCGG 3301 U|UGUCU-(G GCA)-AGGCA -ucGCCGGGU A-GCCACGUC AUACGCGGAU AAGAGCU(GA 3361 AAGCAUCUA) AGCUCGAAGC CGCCCCC|GA A-AAUAGACA UCG|uc|--- ---------- 3421 -------(-- ---)------ ---------- -----aucag a|uc|GCCUC (UAGAAGA)G 3481 AGGUUUGAUA G|AGCCG|GG AUGUAAGGAU Cgagc---(u ucg--)---g cgaGAUUU|U 3541 AAGUCC|A|C GGCUACUAa- agaucg--aa |ggcac---a a------||| || // LOCUS A.fulgid.1 3592 bp RNA RNA ~?~???? DEFINITION Archaeoglobus fulgidus. ACCESSION No information KEYWORDS No information. SOURCE Archaeoglobus fulgidus. ORGANISM Archaeoglobus fulgidus. REFERENCE 1 AUTHORS No information JOURNAL No information TITLE No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M64487 Phylo:Archaebacteria,? LOCUS ARFRRLSA 2933 bp ss-rRNA RNA 11-FEB-1992 DEFINITION A.fulgicus large subunit ribosomal RNA. ACCESSION M64487 KEYWORDS large subunit ribosomal RNA. SOURCE Archaeoglobus fulgicus (strain VC-16) rRNA. ORGANISM Archaeoglobus fulgicus Prokaryota; Bacteria; Gracilicutes; Scotobacteria; Sulfate- or sulfur-reducing dissimilatory bacteria. REFERENCE 1 (bases 1 to 2933) AUTHORS Woese,C.R., Achenbach,L., Rouviere,P. and Mandelco,L. TITLE Archaeal phylogeny: Reexamination of the phylogenetic position of Archaelglobus fulgidus in light of certain composition-induced artifacts JOURNAL Unpublished (1991) STANDARD full staff_review FEATURES Location/Qualifiers rRNA 1..2933 /product="large subunit ribosomal RNA" BASE COUNT 640 a 791 c 1045 g 456 t 1 others ORIGIN BASE COUNT 640 a 791 c 1045 g 456 t 660 others ORIGIN 1 |--------- ------cggc g--aaUAUCC CGGCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|CCGACGAA GGG|CGUG(C CAAGC)U|GC GAAAAGCCCG GGG|GAGGC( GCAAGGA)GC 121 CUUA-GAACC CGGGAUU|CC -CGAAUGGG( AUAU-)CCU| guccc----- ----(uucg- 181 )--------- gg-gac-gcu ccc----(-- -uua--c--- )-----ggga gcggGAAC|G 241 CGGGGAAAG| GA(AACA)UC UGUGUACCCG CAGGAAAAGA AAGC(AA--A C)GCGA-U|G 301 CCGUGAGUAG G(GGCGA)CC GAAAGCGGCA CAGCCCAaac cgaa-cauc- cgggg(-gua 361 a)cuccg-ga ug-naugugg uguugu-agg a-cccugccu -----(---- -)------aa 421 ugccg--ugg aacgaagcc- GAAgugguc- (uggaac)-g gcccgc|c(g ua)gaGGG(U 481 GAUAGC)CCC GUA-ggcgua aguuccacgg uuac-aaggc gggguauccu GAGUACCGUG 541 GGu(---ugg aau)uCUCGC GGGAAG-CUG GGGG(Ucauc aA-)CCCCCA AGGCUAAAUA 601 CGU-CCCGAG UCCGAUAGCG CAA-UAGUAG G(GUGA)CCG AAAGCUGAAA AGAACCCC(g 661 agaa)GGGGA GUG-AAAAGA GCCUGAAAUC GGCCGGGGau a-gugagcag uggcuc--ga 721 aagg-----u aaucccggca aaggaaagug cu(gaga-)g gcgc---gag uacgag-ccg 781 ggacaugccg --gagucacu gcGUACGU(U CUGAAG|A)A CGG|GCCAGG GAGUGUAC-G 841 GCAGGGGCGA GGCUAAGGag (--ugaaa)c uCCGAAGCCG GAGG(GAAA) CCGaca-gcc 901 cgcagc---- -(uuau)--- --gcgagggg cggGGUG-(u gcu-)-CGCC UGUAGUCCCU 961 GCCGUACGAC CCGAAGCCGG GCGAUCUAGG CGGGGG|CAG GGUGAAGCGG UGC(GAAA)G 1021 CGCCGUGGAG GCCCG-ca-g gg-g-ug-uu (gaugugcaa )aucgcu|cc ucuGACCCCC 1081 GUCUAGGGGU (GAAAG)UCC AAUCGAGCCC GG-GGAUAGC UGGUUCCCCC CGAGACAUAC 1141 (CGCA)GUAU GAC|CCGUCC Gga-|ggu|- ugGCGGUG|G GGUAG|AGC- ACUGAUAGGA 1201 GGaucCG|GG GGG(ugac-) ACCCU-CG-C CCUCCUGUCA AACUCCGAAU C|CACCGCcg 1261 ccgu-aga|U GGGCGG-AGU CAGGGGCA-G -GGG(G-UAA G)CUCC-UGU UCCGAGAGGG 1321 (AGACAA)CC CAGACCGG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG uaaacggauu 1381 AAAGGAGGUC CCGGGUCGAA GACAGCGGGG AGGUAGGC(U UAGAAGCA)G CCACCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCCG CCGAGACUCG GGGC|GCCGA AAAUGGA-CG 1501 GGGCUCAAGC CGGC|C|GCC GAUACC|CCG G|ggcucc-- --(gaaa--) ----ggagau 1561 ccGGUAGGGG GG|CGUGCCG AU-GGCGCAG AAGCCUGA-G C(GUAA)GCU CGGGUGG|AG 1621 CCGUCGGUAA CGAAUAUCCU GGC|GGUAGU AGCagca-ua GUCGGG(UGA GAA-U)CCC- 1681 GAC|CGCCGG AA-GGGCAAG GGUUCCACGG CAAUG-(UUC GU)CA|GCCG UGGGUGAGU| 1741 CGG|-UCCUA AGGUGGC-CC G(UAACU)CG G--cgccgcc GAAAaGGGAA GCCGG(UUAA 1801 UAUU)CCGGC |AC-cgcugc cc----gc-- ugcc(uuuau )ggca----- ------cccg 1861 acgccucggg auaggccg|g gcggg(---- -agcugccag ------)ccc guc|-caagc 1921 guc-gaagcc CCCGGAGAU- CC(GUAAU)G G|AGAGAAGG G--gguga-a ggcguga-ug 1981 |---|---gc (--guaa)gc -------cgg ccgacuccug -|gggcccgu gaaaaggg-- 2041 ---------- ----gggcag uugg|A|CCG UACCGAGAAC C(GACACA)G G|UGCCC-UG 2101 GG(ugaguag )CCUAA-GGC GU||GGCGGA -AC-AUUCCG GCCGAGGGAA UUCGGCAAGU 2161 UGGCCCCGUA UC(UUUG)GU AGAAGGGGU| GCCUGC---g guuu------ -(---agc-) 2221 ------agac c--GCAGGU| C(GCA)GUGA CUAGGGGGGA G|CGACUGU( UUACUAAAA) 2281 ACACAGGGGG CUG|CAAUCC G(GCA-A)CG GUUUGUACAG CCCCUGAGUC CUGCCCAGU| 2341 GCGGGUACCU GAA----acc cggg---(ua ca)----acc ggg----CGA AGGG|CCCGU 2401 AAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAAUAC CUU|GCCGCU 2461 (UA-----AU U)GGCGGC|U UGCAUGAACG GAUUAACGAC UCCCCCACUG UCCCCGGCCG 2521 GA-AGCCG|G CGAACCC|-G ACAU-CCCAG (-UGAAGA-G U)CUGGGGAC CCCCGUUGGG 2581 AAGCGAAGAC CCUGUGG|AG CUUUACUGC| AGCCUGCCGU U|GCUCCACA GCUGGGGGUG 2641 CGCAGGGUAG GCGGGAGGCU UCG---AAGC CCCGUC(UCC G-)GGCGGGG U-GGAGCCGU 2701 CGAUGAGACA CCGCC|CACC UCCAGCUGUG GAGCuaac-- ----ucc(cg cug-)gga-- 2761 ---ggacAUC GGUAGGUGGG CAGUUUGGGU G(GGG)CGCC AC|UCCCCC( GAAAAGGUAU 2821 C)GGGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAA)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGUGAA CCC(gcaaaa uaa-)GGGUU CACGAGCC(G 2941 AAA)GGCGGG |CCUAGCGAA CCACCCUGG- CCU(--uuug gug)AGG|CC GGGUG-acga 3001 c|aga--aAA GCUACCC|CA GG|GAUAACA G|AGUUGUCC CCGGCGAGA( GUACAUA)UC 3061 GACCCGGGGG CUUGCUACCU CGAUGUC|GG CU|C|UCCCC |AUCC|UGGC CGU(GCAGCA 3121 )GCGGCCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGGG AUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GUUGCUAUCU |AACGGGGG- |UGucAGGGC GGCU|GA|GG 3241 GGAA-GGAGG CUC(UAGUAC (GAGA)GGAA C)GAGCCUC| CGGCGCCACU GGUUUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -ucGCCGGGC A-GCUACGCG CCACGCGGUU AAAGGCU(GA 3361 AAGCAUCUA) AGCCUGAAAC CGCCCCC|GA A-AAGAGCCG CCC|au|--- ---------- 3421 -------(-- ---)------ ---------- ------uaag g|gc|UCCCG (UAAAAGA)C 3481 GGGGUUGAUA G|GACCG|GG GUGUAAGCGG Cgagc---(u ucg--)---g cgaGCCGU|U 3541 CAGCCC|G|C GGUCACUAau cgcccg---- |cgccg---- -------||| || // LOCUS A.fulgid.2 3592 bp RNA RNA 24-NOV-1997 DEFINITION Archaeoglobus fulgidus.; . ACCESSION AE0009 6 AE00 782 KEYWORDS . SOURCE Archaeoglobus fulgidus. ORGANISM Archaeoglobus fulgidus. REFERENCE 1 (bases 1 to 10740) AUTHORS Fleischmann,R.D., Quackenbush,J., Lee,N.H., Sutton,G.G., Gill ,S., Kirkness,E.F., Dougherty,B.A., McKenney,K., Adams,M.D., Loftus,B., Peterson,S., Reich,C.I., McNeil,L.K., Badger,J.H., Glodek,A., Zhou,L., Overbeek,R., Gocayne,J.D., Weidman,J.F., McDonald,L., Utterback,T., Cotton,M.D., Spriggs,T., Artiach,P., Kaine,B.P., Sykes,S.M., Sadow,P.W., D'Andrea,K.P., Bowman,C., Fujii,C., Garland,S.A., Mason,T.M., Olsen,G.J., Fraser,C.M., Smith,H.O., Woese,C.R. and Venter,J.C. Klenk,H.P., Clayton,R.A., Tomb,J., White,O., Nelson,K.E., Ketchum,K.A., Dodson,R.J., Gwinn,M., Hickey,E.K., Peterson,J.D ., Richardson,D.L., Kerlavage,A.R., Graham,D.E., Kyrpides,N.C., TITLE sulphate-reducing archaeon Archaeoglobus fulgidus The complete genome sequence of the hyperthermopholic, JOURNAL Unpublished (1997) STANDARD No information REFERENCE 2 (bases 1 to 10740) AUTHORS Klenk,H.P., Clayton,R.A., Tomb,J., White,O., Nelson,K.E., Ketchum,K.A., Dodson,R.J., Gwinn,M., Hickey,E.K., Peterson, J.D., Richardson,D.L., Kerlavage,A.R., Graham,D.E., Kyrpides,N.C., Fleischmann,R.D., Quackenbush,J., Lee,N.H., Sutton,G.G., Gill,S., Kirkness,E.F., Dougherty,B.A., McKenney,K., Adams,M.D., Loftus,B., Peterson,S., Reich,C.I., McNeil,L.K., Badger,J.H., Glodek,A., Zhou,L., Overbeek,R., Gocayne,J.D., Weidman,J.F., McDonald,L., Utterback,T., Cotton,M.D., Spriggs,T., Artiach,P., Kaine,B.P., Sykes,S.M., Sadow,P.W., D'Andrea,K.P., Bowman,C., Fujii,C., Garland,S. A., Mason,T.M., Olsen,G.J., Fraser,C.M., Smith,H.O., Woese, C.R. and Venter,J.C. TITLE Direct Submission JOURNAL Submitted (24-NOV-1997) The Institute for Genomic Research, 9712 Medical Center Dr, Rockville, MD 20850, USA STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AE000966 (AE000782) BASE COUNT 641 a 790 c 1046 g 455 t 660 others ORIGIN 1 |--------- ------cggc g--aaUAUCC CGGCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|CCGACGAA GGG|CGUG(C CAAGC)U|GC GAAAAGCCCG GGG|GAGGC( GCAAGGA)GC 121 CUUA-GAACC CGGGAUU|CC -CGAAUGGG( AUAU-)CCU| guccc----- ----(uucg- 181 )--------- gg-gac-gcu ccc----(-- -uua--c--- )-----ggga gcggGAAC|G 241 CGGGGAAAG| GA(AACA)UC UGUGUACCCG CAGGAAAAGA AAGC(AA--A C)GCGA-U|G 301 CCGUGAGUAG G(GGCGA)CC GAAAGCGGCA CAGCCCAaac cgaa-cauc- cgggg(-gua 361 a)cuccg-ga ug-gaugugg uguugu-agg a-cccugccu -----(---- -)------aa 421 ugccg--ugg aacgaagcc- GAAgugguc- (uggaac)-g gcccgc|c(g ua)gaGGG(U 481 GAUAGC)CCC GUA-ggcgua aguuccacgg uuac-aaggc gggguauccu GAGUACCGUG 541 GGu(---ugg aau)uCUCGC GGGAAG-CUG GGGG(Ucauc aA-)CCCCCA AGGCUAAAUA 601 CGU-CCCGAG UCCGAUAGCG CAA-UAGUAG G(GUGA)CCG AAAGCUGAAA AGAACCCC(g 661 agaa)GGGGA GUG-AAAAGA GCCUGAAAUC GGCCGGGGau a-gugagcag uggcuc--ga 721 aagg-----u aaucccggca aaggaaagug cu(gaga-)g gcgc---gag uacgag-ccg 781 ggacaugccg --gagucacu gcGUACGU(U CUGAAG|A)A CGG|GCCAGG GAGUGUAC-G 841 GCAGGGGCGA GGCUAAGGag (--ugaaa)c uCCGAAGCCG GAGG(GAAA) CCGaca-gcc 901 cgcagc---- -(uuau)--- --gcgagggg cggGGUG-(u gcu-)-CGCC UGUAGUCCCU 961 GCCGUACGAC CCGAAGCCGG GCGAUCUAGG CGGGGG|CAG GGUGAAGCGG UGC(GAAA)G 1021 CGCCGUGGAG GCCCG-ca-g gg-g-ug-uu (gaugugcaa )aucgcu|cc ucuGACCCCC 1081 GUCUAGGGGU (GAAAG)UCC AAUCGAGCCC GG-GGAUAGC UGGUUCCCCC CGAGACAUAC 1141 (CGCA)GUAU GAC|CCGUCC Gga-|ggu|- ugGCGGUG|G GGUAG|AGC- ACUGAUAGGA 1201 GGaucCG|GG GGG(ugac-) ACCCU-CG-C CCUCCUGUCA AACUCCGAAU C|CACCGCcg 1261 ccgu-aga|U GGGCGG-AGU CAGGGGCA-G -GGG(G-UAA G)CUCC-UGU UCCGAGAGGG 1321 (AGACAA)CC CAGACCGG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG uaaacg-a-u 1381 AAAGGAGGUC CCGGGUCGAA GACAGCGGGG AGGUAGGC(U UAGAAGCA)G CCACCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCCG CCGAGACUCG GGGC|GCCGA AAAUGGA-CG 1501 GGGCUCAAGC CGGC|C|GCC GAUACC|CCG G|ggcucc-- --(gaaa--) ----ggagau 1561 ccGGUAGGGG GG|CGUGCCG AU-GGCGCAG AAGCCUGAGG C(GUAA)GCU CGGGUGG|AG 1621 CCGUCGGUAA CGAAUAUCCU GGC|GGUAGU AGCagca-ua GUCGGG(UGA GAA-U)CCC- 1681 GAC|CGCCGG AA-GGGCAAG GGUUCCACGG CAAUG-(UUC GU)CA|GCCG UGGGUGAGU| 1741 CGG|-UCCUA AGGUGGC-CC G(UAACU)CG G--cgccgcc GAAAaGGGAA GCCGG(UUAA 1801 UAUU)CCGGC |AC-cgcugc cc----gc-- ugcc(uuuau )ggca----- ------cccg 1861 acgccucggg auaggccg|g gcggg(---- -agcugccag ------)ccc guc|-caagc 1921 guc-gaagcc CCCGGAGAU- CC(GUAAU)G G|AGAGAAGG G--gguga-a ggcguga-ug 1981 |---|---gc (--guaa)gc -------cgg ccgacuccug -|gggcccgu gaaaaggg-- 2041 ---------- ----gggcag uugg|A|CCG UACCGAGAAC C(GACACA)G G|UGCCC-UG 2101 GG(ugaguag )CCUAA-GGC GU||GGCGGA -AC-AUUCCG GCCGAGGGAA UUCGGCAAGU 2161 UGGCCCCGUA UC(UUUG)GU AGAAGGGGU| GCCUGC---g guuu------ -(---agc-) 2221 ------agac c--GCAGGU| C(GCA)GUGA CUAGGGGGGA G|CGACUGU( UUACUAAAA) 2281 ACACAGGGGG CUG|CAAUCC G(GCA-A)CG GUUUGUACAG CCCCUGAGUC CUGCCCAGU| 2341 GCGGGUACCU GAA----acc aggg---(ua ca)----acc ggg----CGA AGGG|CCCGU 2401 AAAC|GGC|G GGGGU(AACU AUG)ACCCUC |U(UAA)GGU AGCGUAAUAC CUU|GCCGCU 2461 (UA-----AU U)GGCGGC|U UGCAUGAACG GAUUAACGAC UCCCCCACUG UCCCCGGCCG 2521 GA-AGCCG|G CGAACCC|-G ACAU-CCCAG (-UGAAGA-G U)CUGGGGAC CCCCGUUGGG 2581 AAGCGAAGAC CCUGUGG|AG CUUUACUGC| AGCCUGCCGU U|GCUCCACA GCUGGGGGUG 2641 CGCAGGGUAG GCGGGAGGCU UCG---AAGC CCCGUC(UCC G-)GGCGGGG U-GGAGCCGU 2701 CGAUGAGACA CCGCC|CACC UCCAGCUGUG GAGCuaac-- ----ucc(cg cug-)gga-- 2761 ---ggacAUC GGUAGGUGGG CAGUUUGGGU G(GGG)CGCC AC|UCCCCC( GAAAAGGUAU 2821 C)GGGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAA)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGUGAA CCC(gcaaaa uaa-)GGGUU CACGAGCC(G 2941 AAA)GGCGGG |CCUAGCGAA CCACCCUGG- CCU(--uuug gug)AGG|CC GGGUG-acga 3001 c|aga--aAA GCUACCC|CA GG|GAUAACA G|AGUUGUCC CCGGCGAGA( GUACAUA)UC 3061 GACCCGGGGG CUUGCUACCU CGAUGUC|GG CU|C|UCCCC |AUCC|UGGC CGU(GCAGCA 3121 )GCGGCCAAG GGU|GAGGU( UGUUC)GCCU AUUAAAGGGG AUCGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GUUGCUAUCU |AACGGGGG- |UGucAGGGC GGCU|GA|GG 3241 GGAA-GGAGG CUC(UAGUAC (GAGA)GGAA C)GAGCCUC| CGGCGCCACU GGUUUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -ucGCCGGGC A-GCUACGCG CCACGCGGUU AAAGGCU(GA 3361 AAGCAUCUA) AGCCUGAAAC CGCCCCC|GA A-AAGAGCCG CCC|au|--- ---------- 3421 -------(-- ---)------ ---------- ------uaag g|gc|UCCCG (UAAAAGA)C 3481 GGGGUUGAUA G|GACCG|GG GUGUAAGCGG Cgagc---(u ucg--)---g cgaGCCGU|U 3541 CAGCCC|G|C GGUCACUAau cgcccg---- |cgccg---- -------||| || // LOCUS T.celer 3592 bp RNA RNA 11-JAN-191 DEFINITION Thermococcus celer. ACCESSION No information KEYWORDS No information. SOURCE Thermococcus celer. ORGANISM Thermococcus celer. REFERENCE 1 (bases 1 to 10740) AUTHORS Woese,C.R. and Achenbach,L. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M67497 LOCUS THCLRRNA 3029 bp ss-rRNA RNA 14-APR-1992 DEFINITION T.celer 23S ribosomal RNA gene sequence. ACCESSION M67497 KEYWORDS 23S ribosomal RNA; large subunit ribosomal RNA. SOURCE Thermococcus celer (strain VU 13) rRNA. ORGANISM Thermococcus celer Prokaryota; Archaeobacteria; Methanobacteriales; Methanothermaceae. REFERENCE 1 (bases 1 to 3029) AUTHORS Woese,C.R. and Achenbach,L. TITLE The sequence of Thermococcus celer 23S rRNA JOURNAL Unpublished (1991) STANDARD full staff_entry FEATURES Location/Qualifiers rRNA 1..3029 /product="23S ribosomal RNA" /gene="23S rRNA" BASE COUNT 631 a 847 c 1096 g 455 t ORIGIN BASE COUNT 631 a 847 c 1096 g 455 t 563 others ORIGIN 1 |--------- -ggggcagag a-accUAAGC CGUCUGGUGG AUG|GCUCGG |CUCGGGg|c 61 g|cCGACGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCC GGC|GAGGC( GCAGGCA)GC 121 CGUC-GAACC GGGGAUU|CC -CGAAUGGG( ACCU-)CCC| gcggc----- ----(uuuu- 181 )--------- gc-cgc-acu ccc----(-- -agu--c--- )-----ggga ggggGAAC|G 241 CGGGGAAUU| GA(AACA)UC UUAGUACCCG CAGGAAAAGA AAGC(AA--A A)GCGA-U|G 301 CCGUUAGUAG G(GGCGA)CC GAAAGCGGCA CAGGGCAaac ugaa-cccu- ccggg(-gaa 361 a)cccgg-ag gg-gaugugg uguagu-agg g-cccugca- -----(---- -)-----cug 421 gagccucgag ggugaagcc- GAAguccgc- (uggaac)-g cggcgc|c(g ua)gaGGG(U 481 GAAAGC)CCC GUA-ggcgua agcccucagg cucc---ugc agggu-uccu GAGUACCGUC 541 GGu(---cgg aua)uCCGGC GGGAAG-CUG GGAG(Gcauc gG-)CUCCCA ACCCUAAAUA 601 cg-uCCCGAG ACCGAUAGCG AAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGCACCCC(u 661 -ggc)AGGGG GUG-AAAAGA GCCUGAAACC AGACGGCGau a-ggugggug cggccc--ga 721 aaggguuga- -cccuccccg aaggaaacac gg(gcga-)c cgug---gag uacgag-ggg 781 aggc-gaccg --ggguugca ccGUCCGU(C UUGGAU|C)A CGG|GGCAGG GAGUUCAC-G 841 GCCGUGGCGA GGUUAAgggg (--guuaa)c cccGAAGCCA CAGG(GAAA) CCGacagguc 901 cgcagccc-- -(guaa)--- gggugaggga cggGGUGU(g aaa-)GCGCC CGGAGUCACG 961 GCCGUGAGAC CCGAAGCCGG UCGAUCUAGC CCGGGG|CAG GGUGAAGUCC CUC(AACA)G 1021 AGGGAUGGAG GCCCG-auag gg-g-ug-cu (gacgugcaa )uucgcu|cc cguGACCCCG 1081 GGCUAGGGGU (GAAAG)GCC AAUCGAGGCC GG-CGAUAGC UGGUUCCCGC CGAAUUAUCC 1141 (CUCA)GGAU AGC|CCGGCC Gga-|ggu|- AGGUGGUG|G GGUAG|AGC- ACUGAUUGGG 1201 GGuuuAG|GG GGA(gaaa-) UCCCC-CG-G CUCCCUGUCA AACUCCGAAC C|CACUGCCG 1261 CCgu-aga|a ggccgg--AG UAGGGUGA-C -GGU(G-UAA G)-CCG-UCA ACCGAGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCUAAAUGCC GGCUA-AGUG uu-a--cucc 1381 aaaGGGCGUC CCUGGCCUUA GACAGCGGGG AGGUAGGC(U UAGAAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCCG UCGAGGUCAG GGGC|CCCGA AAAUGGA-CG 1501 GGGCUUAAGC CGGC|U|GCC GAGACC|CCG G|cgcacgga cc(gauu--) gguccgugau 1561 cgGGUAGGCG GG|CGUGCCG GU-GGGGUGG AAGCCGGG-C C(GUAA)GGU CCGGUGG|AC 1621 CCGUCGGUAU UGUGGAUCCU GCC|GGGAGU AGCagca-ua GCCGGG(UGA GAA-U)CCC- 1681 GGC|CGCCGA AG-GGGCCAG GGUUCCACGG CAAUG-(UUC AU)CA|GCCG UGGGUUAGU| 1741 CGG|-UCCUA AGCCAGU-CC G(UAACU)CG G--cgcuggc GAAA-GGGAA ACGGG(UUUA 1801 UAUU)CCCGU |AC-cgcggu gg-uagg--- ugcg(-gcaa )cgcaa-gcc c-gaggggug 1861 acgccucggg guaggcgg|a ccggc(---- --ccacaag- ------)gcc ggc|-uaagc 1921 gua-uaaguc CGGGGAGUG- CC(GUAAU)G G|CGAGAACC G--gauga-a agcgcgaau- 1981 |ggc|cuccc (--guaa)gg gggguu-ccg ccgaucccug -|gggcccgu gaaaagcccu 2041 c-gggaa--- cgauccaccg c--g|A|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagaag )CCUAA-GGC GU||GUCGGG -GGAAACCCA GCCGAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AUAAGGGGU| GCCUGC---g ggu------- -(--gcgua) 2221 -------acc c--GCAGGU| C(GCA)GUGA CUCGGGGGAC C|CGACUGU( UUAGUAAAA) 2281 ACACAGGUCC CAG|CUAGCC C(GAA-A)GG GUUUGUACUG GGGCCGACGC CUGCCCAGU| 2341 GCCGGUAUGU GAA----gcc cggg---(uc ca)----acc ggg----uGA AGCA|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUUC CUU|GUCGGU 2461 (UA-----AA U)GCCGAC|C UGCAUGAAUG GCGUAACGAG GUCCCCACUG UCCCCGGCUG 2521 GG-GCCCG|G CGAAACC|-A CUG--CCAGG (-CGCAUA-U G)CCUGGGAC CUCCGGUGGG 2581 AAGCGAAGAC CCCAUGG|AG CUUUACUGC| AGCCUGCCGU U|GCCGUACG GCGGGGGGUG 2641 CGCAGCGUAG GCGGGAGGCG UCG---AAGC CCGCCU(UCC G-)GGGCGGG U-GGAGCCGU 2701 CCAUGAGACA CCGCC|CACC CUCUGCCGUA CGGCuaac-- ---cccc(-- gac-)gggg- 2761 ---ggacAGC GGUAGGUGGG CAGUUUGGCU G(GGG)CGGC AC|ACCCUC( GAAAAGGUAU 2821 C)GAGGGU|G |C|CCUAAGG UCGGC|UCAG GCGGG(UCAG GAA)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACUGGAC CCG(uaacag agg-)CGGGU CCAGCCGC(G 2941 AAA)GCGUGG |CCUAGCGAA CCCCUGUGC- CUC(--cccg gug)GGG|GC CAGGG-auga 3001 c|aga--aAA GCUACCC|UG GG|GAUAACA G|AGUCGUCU CGGGCGAGA( GCCCAUA)UC 3061 GACCCCGAGG CUUGCUACCU CGCUGUC|GG CU|C|UUCCC |AUCC|UGGC CCU(GCAGCA 3121 )GGGGCCAAG GGU|GGGGG( UGUUC)ACCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GAUGCUAUCU |ACCGGAGG- |UG-UUGGCC GCCU|GA|GG 3241 GGAA-GGCUC CCC(CAGUAC (GAGA)GGAA C)AGGGAGC| CGCGGCCUCU GGUCUACCGG 3301 U|UGUCC-(U ACA)-GGGCA -caGCCGGGC A-GCUACGCC GUG-UCCGAU AAGGCCU(GA 3361 AAGCAUCUA) AGGCCGAAGC GGUCCCC|GA A-AAUAGGCG GCC|ac|ucc caggcgcagg 3421 ggguc--(gg gc-)gaccgg uccuuugccu ggga--cgag g|gc|UCGGG (AAGAAGA)C 3481 CCGUUUGAUG G|GGCGG|GG AUGUAAGCGG Gaagg---(g aaa--)---c cgaCCCGU|U 3541 CAGUCU|G|C CGCUCCCAau cgcccgagg- |uuucugccu c------||| || // LOCUS P.furiosus 3592 bp RNA RNA 11-JAN-191 DEFINITION Pyrococcus furiosus. ACCESSION No information KEYWORDS No information. SOURCE Pyrococcus furiosus. ORGANISM Pyrococcus furiosus. REFERENCE 1 (bases 1 to 10740) AUTHORS Kjems,J., Larsen,N., Dalgaard,J.Z., Garrett,R.A. and Stetter,K.O. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M86627 LOCUS PYWRGGA 2818 bp ds-DNA BCT 06-FEB-1992 DEFINITION P.furiosus 23S ribosomal RNA, partial cds. ACCESSION M86627 KEYWORDS 23S ribosomal RNA. SOURCE Pyrococcus furiosus DNA. ORGANISM Pyrococcus furiosus Prokaryota; Archaeobacteria; Thermococcales; Thermococcaceae. REFERENCE 1 (bases 1 to 2818) AUTHORS Kjems,J., Larsen,N., Dalgaard,J.Z., Garrett,R.A. and Stetter,K.O. TITLE Phylogenetic relationships amongst the hyperthermophilic archaea determined from partial 23S rRNA gene sequences JOURNAL Syst. Appl. Microbiol. (1992) In press STANDARD full staff_review FEATURES Location/Qualifiers rRNA 1..1656 /partial /product="23S ribosomal RNA" /standard_name="23S rRNA" 3'UTR 1657..2818 /standard_name="23S rRNA" BASE COUNT 647 a 666 c 876 g 526 t 103 others ORIGIN BASE COUNT 318 a 489 c 597 g 251 t 1937 others ORIGIN 1 |~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~|~~~~~~ |~~~~~~~|~ 61 ~|~~~~~~~~ ~~~|~~~~(~ ~~~~~)~|~~ ~~~~~~~~~~ ~~~|~~~~~( ~~~~~~~)~~ 121 ~~~~~~~~~~ ~~~~~~~|~~ ~~~~~~~~~( ~~~~~)~~~| ~~~~~~~~~~ ~~~~(~~~~~ 181 )~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~(~~ ~~~~~~~~~~ )~~~~~~~~~ ~~~~~~~~|~ 241 ~~~~~~~~~| ~~(~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~)~~~~~~|~ 301 ~~~~~~~~~~ ~(~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ 361 ~)~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ ~)~~~~~~~~ 421 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~)~~ ~~~~~~|~(~ ~~)~~~~~(~ 481 ~~~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 541 ~~~(~~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ 601 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ 661 ~~~~)~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 721 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~(~~~~~)~ ~~~~~~~~~~ ~~~~~~~~~~ 781 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~|~)~ ~~~|~~~~~~ ~~~~~~~~~~ 841 ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~)~ ~~~~~~~~~~ ~~~~(~~~~) ~~~~~~~~~~ 901 ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~)~~~~~ ~~~~~~~~~~ 961 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~|~~~ ~~~~~~~~~~ ~~~(~~~~)~ 1021 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~~~ )~~~~~~|~~ ~~~~~~~~~~ 1081 ~~~~~~~~~~ (~~~~~)~~~ ~~~~~~~~~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1141 (~~~~)~~~~ ~~~|~~~~~~ ~~~~|~~~|- ~~~~~~~~|~ ~~~~~|~~~~ ~~~~~~~~~~ 1201 ~~~~~~~|~~ ~~~(~~~~~) ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~|~~~~~~~~ 1261 ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~-~ ~~~~(~~~~~ ~)~~~~~~~~ ~~~~~~~~~~ 1321 (~~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1381 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~~~)~ ~~~~~~~|~( 1441 ~~~)~|~~~~ ~~(~~~~~~) ~~|~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ ~~~~~~~~~~ 1501 ~~~~~~~~~~ ~~~~|~|~~~ ~~~~~~|~~~ ~|~~~~~~~~ ~~(~~~~~~) ~~~~~~~~~~ 1561 ~~~~~~~~~~ ~~|~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~|~~ 1621 ~~~~~~~~~~ ~~~GGAUCCU AGC|GGGAGU AGCagcg-ua GUCGGG(UGA GAA-U)CCC- 1681 GAC|CGCCGG AG-GGGCCAG GGUUCCACGG CAAUG-(GUC GU)CA|GCCG UGGGUUAGU| 1741 CGG|-UCCUA ACCCCGC-CC G(UAACU)CG G--CGCGGGG GAAA-GGGAA ACGGG(UUUA 1801 UAUU)CCCGU |AC-cgcggg gg-uagg--- ugcg(-gcaa )cgcaa-gcc cgga-gggug 1861 acgccucggg guaggcgg|a ccggc(---- --cgaugag- ------)gcc ggc|-uaagc 1921 gua-uaagcc CGGGGAGUG- CC(GUAAU)G G|CGAGAACC G--gguga-a agcgcgaau- 1981 |ggc|ccccc (--guua)gg gggguu-ccg ccgaucccug -|gggcccgu gaaaagccu- 2041 ccgggaauuc cgaucccccg c--G|A|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagaag )CCUAA-GGC GU||GUCGGG -GGAAACCCG GCCGAGGGAA CUCGGCAAAC 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g ggu------- -(--gcgua) 2221 -------acc c--GCAGGU| C(GCA)GUGA CUAGGGGGGC C|CGACUGU( UUAAUAAAA) 2281 ACACAGGUCC CAG|CUAGCC C(GAA-A)GG GUUUGUACUG GGGCCGACGC CUGCCCAGU| 2341 GCCGGUAUGU GAA----gcc cggg---(ua ca)----acc ggg----uGA AGCA|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUUC CUU|GUCGGU 2461 (UA----aAA U)GCCGAC|C UGCAUGAAUG GCGUAACGAG GUCCCCGCUG UCCCCGGCCG 2521 GG-GCCCG|G CGAAACC|-U CUG--CCUGG (-CGCGCA-U G)CCAGGGAC CCCCGGUGGG 2581 AAGCGAAGAC CCCAUGG|AG CUUUACUGC| AGCCUGCCGU U|GCCACACG GCGGGGGGUG 2641 CGCAGCGUAG GCGGGAGGCG UCG---AAGC CCGGCC(UCC G-)GGUCGGG U-GGAGCCGU 2701 CCAUGAGACA CCGCC|CACU CCCCGCUGUG UGGCuaac-- ---cccc(-g aaa-)gggg- 2761 ---ggacAGC GGUAGGUGGG CAGUUUGGCU G(GGG)CGGC AC|GCCUCC( GAAAAGGUAU 2821 C)GGAGGC|G |C|CCUAAGG UCGGC|UCAG GCGGG(UCAG GAA)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACUGGAC CCG(uaacag agg-)CGGGU CCAGCCCC(G 2941 AAA)GGGUGG |CCUAGCGAA CCCCUGUGC- CUC(--cccg gug)GGG|GC CAGGG-auga 3001 c|aga--aAA GCUACCC|UG GG|GAUAACA G|AGUCGUCU CGGGCGAGA( GCCCAUA)UC 3061 GACCCCGAGG CUUGCUACCU CGCUGUC|GG CU|C|UUCCC |AUCC|UGGC CCU(GCAGCA 3121 )GGGGCCAAG GGU|GGGGG( UGUUC)ACCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GAUGCUAUCU |ACCGGGGG- |UG-UUGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCN(UAGUAC (GAGA)GGAA C)AGGGUGC| CGCGGCCUCU GGUCUACCGG 3301 U|UGUCC-(U CCC)-GGGCA -ucGCCGGGC A-GCUACGCC GCA-GCCGAU AAGGCCU(GA 3361 AAGCAUCUA) AGGCC--AAC GGCCCCC|GA A-AAUAGGCG GCC|gu|ucc cuggcgcuug 3421 ggccugg(gc ga-)cc-ggc ccau-ugcca ggga--cgag g|gc|UCGGG (UAGAAGA)C 3481 CCGGUUGAUA G|GGCGG|GG AUGUAAGCGG Gaagg---(g ucaa-)---c cgaCCCGC|U 3541 UAGUCC|G|C CGCCCCUAau cgcccgaggu |ccugccccu cg-----||| || // LOCUS P.horikosh 3592 bp RNA RNA 26-JUN-1998 DEFINITION Pyrococcus horikoshii strain OT3.; . ACCESSION AP0000 1 KEYWORDS . SOURCE Pyrococcus horikoshii strain OT3. ORGANISM Pyrococcus horikoshii strain OT3. REFERENCE 1 (sites) AUTHORS Kawarabayasi,Y., Sawada,M., Horikawa,H., Haikawa,Y., Hino,Y., Yamamoto,S., Sekine,M., Baba,S., Kosugi,H., Hosoyama,A., Nagai ,Y., Sakai,M., Ogura,K., Otuka,R., Nakazawa,H., Takamiya,M., Ohfuku,Y., Funahashi,T., Tanaka,T., Kudoh,Y., Yamazaki,J., Kushida,N., Oguchi,A., Aoki,K., Nakamura,Y., Robb,T.F., Horikoshi,K., Masuchi,Y., Shizuya,H. and Kikuchi,H. TITLE Complete Sequence and Gene Organization of the Genome of a Hyper-thermophilic Archaebacterium, Pyrococcus horikoshii OT3 JOURNAL DNA Research 5, 55-76 (1998) STANDARD No information REFERENCE 2 (bases 1 to 287000) AUTHORS Tanaka,T., Kawarabayasi,Y. and Kikuchi,H. TITLE Direct Submission JOURNAL Submitted (11-JUN-1998) to the DDBJ/EMBL/GenBank databases. Yutaka Kawarabayasi, National Institute of Technology and Evaluation, Biotechnology Center; 2Chome 49-10 Nishihara, Shibuya-ku, Tokyo 151-0066, Japan (E-mail: genomeOT3@nite.go.jp, Tel:+81-3-3481-8951, Fax: +81-3-3481-8424) STANDARD No information COMMENTS Organism information Culture collection: ORIGINAL ACCESSION # AB009472 is no longer available from GB Sequence information (bases 1 to 3592) Corresponding GenBank entry: AP000001 Kawarabayasi, Y. is officially affiliated with the National Institute of Bioscience and Human-Technology, Tsukuba, Ibaraki 305-0046, Japan. Robb, T. F. is at the Center of Marine Biotechnology, University of Maryland, Baltimore, MD, USA. Horikoshi, K. is at the Japan Marine Science and Technology Center, Yokosuka, Kanagawa 237-0061, Japan. Masuchi, Y. is at the University of Tokyo, Meguro, Tokyo 153-0041, Japan. Shizuya, H. is at the California Institute of Technology, Pasadena, CA, USA. The other authors are at the National Institute of Technology and Evaluation, 2-49-10 Nishihara, Shibuya, Tokyo 151-0066, Japan. The sequence data of the fosmid clones for the entire genomic sequence are deposited in DDBJ/Genbank/EMBL with accession numbers AB009464-AB009531 All the sequence with length 100 codons or more between ATG or GTG and stop codon are defined as CDS Homology analysis is performed by Smith-Waterman algorithm against GenBank and GenPept release 103; EMBL release 52.0; SwissProt release 34.0; PIR-Protein release 54.0; and OWL release 29.5. E-mail address for comments and questions: genomeOT3@nite.go.jp Restriction map, ORF organization, sequence alignment and more information are available at W.W.W. site of Biotechnology Center, URL: http://www.bio.nite.go.jp/. ORIGINAL ACCESSION NUMBER: AB009472 (no longer available from GenBank) BASE COUNT 607 a 876 c 1138 g 433 t 538 others ORIGIN 1 |-------ag ggggucagga c--ucUAaGC CGCCCGGUGG AUG|GCUCGG |CUCGGGg|c 61 g|cCGACGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCC GGC|GAGGC( GCAGGCA)GC 121 CGUC-GAACC GGGGAUU|CC -CGAAUGGG( accu-)CCC| gcggc----- ----(uuau- 181 )--------- gc-cgc-acu ccgggg-(-- -uuu-au--- )--ccccgga ggggGAAC|G 241 CGGGGAAUU| GA(AACA)UC UUAGUACCCG CAGGAAAAGA AAGC(aa--a a)GCGA-U|G 301 CCGUGAGUAG G(GGCGA)CC GAAAACGGCA GAGGGCAaac ugaa-cccc- ggacc(gacg 361 a)gguuc-gg gg-gaugugg gguugu-agg gcccccguau g----(---- -)-----aga 421 cccuc--gcg ggugaagcc- GAAguccgc- (uggaac)-g cggcgc|c(g ga)gaGGG(U 481 GAUAGC)CCC GUA-GGCgua agcccgcagg gucu-----c gggggacccu GAGUACCGUC 541 GGU(---ugg aua)UCCGGC GGGAAG-CUG GGAG(GCAUC GG-)CUCCCA ACCCUAAAUA 601 cg-uCCCGAG ACCGAUAGCG AAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGCACCCC(g 661 ggag)GGGGG -UG-AAAAGA GCCUGAAACC GGGCGGCGau a-ggagggug cggccc--ga 721 aaggaauga- -gccuccccg aaggaaaccg cg(gcga-)c gcgg---gag uacgag-ggg 781 aggg-gaccg --ggguugca ccGUCCGU(C UUGAAA|C)A CGG|GGCAGG GAGUUCGC-G 841 GCCGUGGCGA GGUUAagggg (-guuaag)c cccguagccg cAGG(GAAA) CCgacaugcc 901 cgcagccggg c(uuau)gcc cggugagggg cggGGUGC(g aaa-)GCGCC CGGAGUCACG 961 GCCGCGAGAC CCGAAACCGG UCGAUCUAGC CCGGGG|CAG GGUGAAGUCC CUC(AACA)G 1021 AGGGAUGGAG GCCCgcua-g gg-g-ug-cu (gaugugcag )uucgcu|cc cgugACCCCG 1081 GGCUAGGGGU (GAAAG)GCC AAUCGAGGCC GG-AGAUAGC UGGUUCCCGC CGAAUCAUCC 1141 (CGCA)GGAU GGC|CUCCCC gga-|ggu|- agGCGGUG|G GGUAG|AGC- ACUGAUUGGA 1201 GGugcag|gg ggc(gaaa-) gcccc-cg-g ccuccuguca aaCUCCGAAC C|CACCGCCG 1261 CCgu-aga|u gggggg-agu ag-ggugg-c -GGU(G-UAA G)-CCGUCCA CC-GAGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCAAAGUGCC GGCUA-AGUG u-uac--ucc 1381 aaaGGGUGUC CCGGGCCUUA GACAGCGGGG AGGUAGGC(U UAGAAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCCG UCGAGGUCCG GGGC|CCCGA AAAUGGA-CG 1501 GGGCUCAAGC CGGC|C|GCC GAGACC|CCG G|cgcacgga cc(gauu--) gguccgugau 1561 cggGUAGGCG GG|CGUGCCG GU-GGCGUAG AAGCCGGG-C C(GUAA)GGU CCGGUGG|Ag 1621 ccgccgguau cGCGGAUCCU GCC|GGGAGU AGCagcg-ua GUCGGG(UGA GAA-U)CCC- 1681 GAC|CGCCGG AG-GGGCCAG GGUUCCACGG CAAUG-(GUC GU)CA|GCCG UGGGUUAGU| 1741 CGG|-UCCUA ACCCCGC-CC G(UAACU)CG G--CGCGGGG GAAA-GGGAA ACGGG(UUUA 1801 UAUU)CCCGU |AC-cgcggg gg-uagg--- ugcg(-gcaa )cgcaa-gcc cgga-gggug 1861 acgccucggg guaggcgg|a ccggc(---- --cgaugag- ------)gcc ggc|-uaagc 1921 gua-uaagcc CGGGGAGUG- CC(GUAAU)G G|CGAGAACC G--gguga-a agcgcgaau- 1981 |ggc|ccccc (--guua)gg gggguu-ccg ccgaucccug -|gggcccgu gaaaagcccu 2041 ccgggaauuc cgaucccccg c--g|a|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagaag )CCUAA-GGC GU||GUCGGG -GGAAACCCG GCCGAGGGAA CUCGGCAAAC 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g ggu------- -(--gcgua) 2221 -------acc c--GCAGGU| C(GCA)GUGA CUAGGGGGGC C|CGACUGU( UUAAUAAAA) 2281 ACACAGGUCC CAG|CUAGCC C(GAA-A)GG GUUUGUACUG GGGCCGACGC CUGCCCAGU| 2341 GCCGGUAUGU GAA----gcc cggg---(ua ca)----acc ggg----UGA AGCA|CCGGU 2401 AAAC|GGC|G GGGGU(AACU AUA)ACCCUC |U(UAA)GGU AGCGAAAUUC CUU|GUCGGU 2461 (UA-----AA U)GCCGAC|C UGCAUGAAUG GCGUAACGAG GUCCCCGCUG UCCCCGGCCG 2521 GG-GCCCG|G CGAAACC|-U CUG--CCUGG (-CGCGCA-U G)CCAGGGAC CCCCGGUGGG 2581 AAGCGAAGAC CCCAUGG|AG CUUUACUGC| AGCCUGCCGU U|GCCACGCG GCGAGGGGUG 2641 CGCAGCGUAG GCGGGAGGCG UCG---AAGC CCGGCC(UCC G-)GGUCGGG U-GGAGCCGU 2701 CCAUGAGACA CCGCC|CACU CCUCGCCGCG UGGCuaac-- ---cccc(-g aaa-)gggg- 2761 ---ggacAGC GGUAGGUGGG CAGUUUGGCU G(GGG)CGGC AC|GCCCCC( GAAAAGGUAU 2821 C)gggGGc|G |C|CCUAAGG UCGGC|UCAG GCGGG(UCAG GAA)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACUGGAC CCG(uaacag agg-)CGGGU CCAGCCCC(G 2941 AAA)GGGUGG |CCUAGCGAA CCCCUGUGC- CUC(--cccg gug)GGG|GC CAGGG-auga 3001 c|aga--aAA GCUACCC|UG GG|GAUAACA G|AGUCGUCU CGGGCGAGA( GCCCAUA)UC 3061 GACCCCGAGG CUUGCUACCU CGCUGUC|GG CU|C|UCCCC |AUCC|UGGC CCU(GCAGCA 3121 )GGGGCCAAG GGU|GGGGG( UGUUC)ACCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GAUGCUAUCU |ACCGGGGG- |UG-UUGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(UAGUAC (GAGA)GGAA C)AGGGCGC| CGCGGCCUCU GGUCUACCGG 3301 U|UGUCC-(U CCC)-GGGCA -uaGCCGGGC A-GCUACGCC GCA-GCCGAU AAGGCCU(GA 3361 AGGCAUCUA) AGGCCGAAGC GGCCCCC|GA A-AAUAGGCG GCC|gu|ucc cuggcauuug 3421 ggccugg(gu ga-)cc-ggc ccgu-agcca ggga--cgag g|gc|UCGGG (UAGAAGA)C 3481 CCGGUUGAUG G|GGCGG|GG AUGUAAGCGG Gaagg---(g ucaa-)---c cgaCCCGC|U 3541 UAGUCU|G|C CGCUCCCAau cgcccgag-- |guccugucc cccg---||| || // LOCUS CRENARCHAE 3592 bp RNA RNA ~?~???? DEFINITION . ACCESSION No information KEYWORDS No information. SOURCE No information. ORGANISM No information. REFERENCE 1 AUTHORS No information JOURNAL No information TITLE No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: DIVIDER BASE COUNT 0 a 0 c 0 g 0 t 3592 others ORIGIN 1 x~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~-~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 61 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 121 ~~~~~~~~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~-~~~~~ ~~~~~~~~-- ----~~~~~- 181 ~~------~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~-~~~~| ~~~~~~~~~~ ~~~~~~~~~~ 241 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~|~ ~~~~~~~--~ ~~~~~~-~~~ 301 ~~~~~~~~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~~~~~~| ~(~~~-)~|~ ~~~~~~~~~~ 361 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~|~~~ ~~~~~~~~~~ -----(---- -)-----~~~ 421 ~~~~~--~~~ ~~~~~~~~~- ~~~~~~~~~- ~~~~~~~~-~ ~|~~~~~~~~ ~~~~~~~~~~ 481 ~~~~~-~~~~ ~~~-~~~~~~ ~~~~~~~~~~ ~~~~---~~~ ~~~~~~---- ~|~~~~~~~~ 541 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~|-~~~ ~~~~~~~~~~ ~--~~~~~~~ ~~~~~~~~~~ 601 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~-~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 661 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~|~~ ~-~~~~~~~~ ~~~~~~~~~~ 721 ~~~~~~~~~~ ~~-------- ---------- --~~~~~~~~ ~~~~~~~~~~ ~~~------- 781 ---------- ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~|~)~ ~~~~~~~~~~ ~~~~~~~~-~ 841 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~|~--- 901 ---------- ---------- ---------- ---~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 961 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1021 ~~~~~~~~~~ ~~~~~-~~-~ ~~~~-~~-~~ ~~~~~~~-|~ (~~~~-)~~~ ~~~~~~~~~~ 1081 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1141 ~~~~-~~~~| ~~~~~~~-~~ ~~~~~~~~~~ --~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1201 |~~~-~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~-~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1261 ~~~~~~~~~~ ~~|~~~-~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~-~-~~~ ~|~~~~~~~~ 1321 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~-~~~~ ~|~|~|--~~ 1381 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~|~~(~ ~-~~~~~~~~ ~~~~~~~~~~ 1441 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ ~~~~~~~(~~ 1501 ~~~~~~~|~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~| ~~~~~~|~~~ ~~~~~~~~-- 1561 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ 1621 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ -~~~--~~|~ |~~~~~~~~~ ~~~~~~~~~- 1681 ~~~~~~~~~~ ~~-~~~~~~~ ~|||||~~~~ ~~~~~~(~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1741 ~~~~~~~~~~ ~~~~~~~-~~ ~~~~~~~~~~ ~---~~~~~~ ~~~--~~~~~ ~~~~~~~~~~ 1801 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~-~~~~~ 1861 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~---- ~~~--~~~~~ ~----~~~~~ ~~~~~~~~~~ 1921 ~~~--~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~-- ~~~~~~~~~~ 1981 ~~~~~~~~~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~-~~ ~~~~~~~~~~ 2041 ~~~~~~~~~~ ~~~~~~~~~~ ~--~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2101 ~~~~~~~~~- ~~~~~~~~~~ ~~~~~-~~~~ -~~-~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2161 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~-~~~~~~ 2221 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2281 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~-~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2341 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2401 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2461 ~~~-----~~ ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2521 ~~-~~~~~~~ ~~~~~~~~-~ ~~~~-~~~~~ ~-~~~~~~-~ ~~~~~~~~~~ ~~~~~~~~~~ 2581 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2641 ~~~~~~~~~~ ~~~~~~~~~~ ~~~---~~~~ ~~~~~~~~~~ ~-~~~~~~~~ ~-~~~~~~~~ 2701 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~-~~~~~~ 2761 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ -~~~~~~~~~ ~~~~~~~~~~ 2821 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 2881 ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~---~~~ ~~~-~~~~~~ ~~~~~~~~~~ 2941 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~-~~~ ~~~(---~~~ ~~~)~~~~~~ ~~~~~~~~~~ 3001 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3061 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3121 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3181 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~- ~~~~~~~~~~ ~~~~~~~~~~ 3241 ~~~~-~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3301 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~-~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3361 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~-~~~~~~~~ ~~~~~~~~~~ ~~~------- 3421 -------~~~ ~~~~------ --------~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 3481 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~-----~~ ~~~~-~~~~~ ---~~~~~~~ 3541 ~~~~~~|~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~|| || // LOCUS Pb.islandi 3592 bp RNA RNA 11-JAN-191 DEFINITION Pyrobaculum islandicum. ACCESSION No information KEYWORDS No information. SOURCE Pyrobaculum islandicum. ORGANISM Pyrobaculum islandicum. REFERENCE 1 (sites) AUTHORS Kjems,J., Larsen,N., Dalgaard,J.Z., Garrett,R.A. and Stetter,K.O. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M86622 (JJC; 27 January 1999) The accession number M86622 is referenced in Dalgaard and Garrett, Gene, 121 (1992):103-110 for Pyrobaculum organotrophum. The flanking sequence for the two introns is identical. Is the GenBank entry misnamed? The most recent taxon file lists the two organisms as separate entries. --------------------------------------- LOCUS PYBRGGA 3358 bp ds-DNA BCT 06-FEB-1992 DEFINITION P.islandicum 23S ribosomal RNA, partial cds. ACCESSION M86622 KEYWORDS 23S ribosomal RNA. SOURCE Pyrobaculum islandicum DNA. ORGANISM Pyrobaculum islandicum Prokaryota; Archaeobacteria; Thermoproteales. REFERENCE 1 (bases 1 to 3358) AUTHORS Kjems,J., Larsen,N., Dalgaard,J.Z., Garrett,R.A. and Stetter,K.O. TITLE Phylogenetic relationships amongst the hyperthermophilic archaea determined from partial 23S rRNA gene sequences JOURNAL Syst. Appl. Microbiol. (1992) In press STANDARD full staff_review FEATURES Location/Qualifiers rRNA 1..2616 /partial /product="23S ribosomal RNA" /standard_name="23S rRNA" 3'UTR 2617..3358 BASE COUNT 578 a 922 c 1181 g 561 t 116 others ORIGIN BASE COUNT 484 a 759 c 933 g 324 t 1092 others ORIGIN 1 |~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~|~~~~~~ |~~~~~~~|~ 61 ~|~~~~~~~~ ~~~|~~~~(~ ~~~~~)~|~~ ~~~~~~~~~~ ~~~|~~~~~( ~~~~~~~)~~ 121 ~~~~~~~~~~ ~~~~~~~|~~ ~~~~~~~~~( ~~~~~)~~~| ~~~~~~~~~~ ~~~~(~~~~~ 181 )~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~(~~ ~~~~~~~~~~ )~~~~~~~~~ ~~~~~~~~|~ 241 ~~~~~~~~~| ~~(~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~)~~~~~~|~ 301 ~~~~~~~~~~ ~(~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ 361 ~)~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ ~)~~~~~~~~ 421 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~)~~ ~~~~~~|~(~ ~~)~~~~~(~ 481 ~~~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~ggaucccg GAGUACCACG 541 GCu(---uag uuu)uGCCGU GGGAAU-ACG CCGG(Ccacu gG-)CCGGCA AGGCUAAACA 601 cgu-CCCGAG UCCGAUAGCG CAC-UAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(g 661 -gaa)GGGGG GUG-AAAAGA GCCUGAAACC GGGCGGCuac a-gug-gggc gggccc--ga 721 aagga---ug ccccuccccg aaggaaaccc cg(guaa-)c gggg---gag uacgag-ggg 781 aggg-gucca --ggguccgc ccUUACGU(C UAGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 GCCGUGGCGA GCCUAAGGGG (--uucaa)c cccggAGGCG UAGG(GAAA) CCGaca-gcc 901 cgcagcgggg -(gaaa)-cc ccgcgagggg cggGGUCU(g aaa-)GGGCC CGUAGUCACG 961 GCCGUGAGAC CAGAAACCGG GCGAUCUAGC CCUGGG|CAG GGUGAAGCGG GGC(GAAA)G 1021 CCCCGUGGAG GCCCGaag-g gguu-cu-ga (cgugca-aa )ucgu-u|cc cauGACCUGG 1081 GGCUAGGGGC (AAAAG)ACC AAUCAAGCCC GG-UGAUAGC UGGUUCCCCC CGAAGCUGGU 1141 (CGCA)GCCA GGC|CUCCCC Gga-|ggc|- ggCCGGCG|G GGUAG|AGC- ACUGAUCGGG 1201 GGugcgG|GA GCC(gaaa-) GGCUC-CG-G CCCCCGGUCA AACUCCGAAC C|ugccggcg 1261 ccnn-nnn|n nnnnnn-nnn nnnnnnnn-n nnnn(n-nnn n)nnnn-nnn nnnnnnnnnn 1321 (nnnnnn)nn nnnnnnnn-n nnnnnnnn|n nnnnnnnnnn nnnnn----- ---------- 1381 ---------- nnnnnnnnnn nnnnnnnnnn nnnnnGGC(C UAACACCA)G CCAUCGG|C( 1441 UAA)G|CAAC GC(GUAACA) GC|GGACCCG CCGAGGC--G GGGC|CCCGA AGAUGUAGAG 1501 GGACU-AAGC CCGC|C|GCC GAGACC|CCG G|gccuccgg cc(guu---) ggccggagau 1561 gcGGUAGGGG GG|CGCGGCC GU-GGGGCAG AAGCCGGG-C C(GUGA)GGU CCGGUGG|AC 1621 CCGCGGCCGA CGAAGAUCCC GGC|GGUAGU AGCagcg-aa GAGGGG(UGA GAA-G)CCC- 1681 CUC|CGCCGG AAaGGACCAG GGUUUCCUGG CAACU-(UCA AU)AG|GCCA GGAGUUAGC| 1741 CGG|-UCCUA AGGCGGG-GC C(UAAUA)GG U--ACCCGCC GAAA-GGGAA ACGGG(UUAA 1801 UAUU)CCCGU |GC-cgcggg gg-uagguuc ugcg(-guaa )cgcag-gcc ccgu-ccccg 1861 acgccucggg auagggcg|g gcggg(---- -ac-ugccgu ------)ccc gcu|-uaacc 1921 guc-gaaggc CGGGGAGUG- CC(GUAAU)G G|CGAGAACC G--gccgaac ggcgggaau- 1981 |agc|cgggg (--guuu)cc ccgguc-cgc ccgacuccug -|gggcccgu gaaaagggga 2041 cggggaa--- cgagcccccg c--g|C|CCG UACCGAGAAC C(GACGCA)G G|UGCUCCUG 2101 GG(ugagaag )CCCAA-GGC G-||GCUCGG -GUGACCCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g gccuua-ggg -(--uacac) 2221 -cccuggggc c--GCAGGU| C(GCA)GUGA CAAGGGGGAC C|UGACUGU( UUAACAAAA) 2281 ACAUAGGUCC CCG|CGAGCC C(GUA-A)GG GUGUGUACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUACGU GAA----ccc cggg---(ua ca)----acc ggg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G UGGAUGAAUG GAU-AACGAG GUCCCCACUG UCCCGGCCCG 2521 GG-CCC--|G CGAA-CC|-C ACCUCCCAGG (-UGCAC--A U)CCUGGGAC CCCCGACGGG 2581 GCGAGAAGUC CCUGUGG|AG CUUCACAGC| AGCCUGUCGC U|GCGGGGGG GCGGGGGGUG 2641 CAGAGCGUAG GUGGGAGCGA UGA----AAC GGGGUC(UCC G-)GGCUCCU G--GAU-CGG 2701 UCCUGGAACA CCACC|CACU CUCCGCCCCU CCGCuuac-- --ccgcc(-g gaa-)ggcgg 2761 ---ggacAGC GGCAGGCGGG CUGUUCGGCU G(GGG)CGGC AC|ACCCCU( GAAAAGAUAU 2821 C)GGGGGU|G |C|CCAAAGC UCGGC|UCAG GCGGG(UCAG AAA)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGG CUGACUGGGC CCU(ugaaca caa-)GGGGC CCAGGCGG(G 2941 AAA)CCGUGG |CCUAGAGAA CGCUCGUGC- CCC(-cacca gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CA GG|AAUAACC G|GCUCGUCG CGGGUGAGA( GUCCCCA)UC 3061 GACCCCGCGG UUUGGUACCC AGACGUC|GU CU|C|UUCCC |AUCC|UGGC GGU(GCAGCA 3121 )GCCGCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGAU GGGUUCAGAC 3181 C|GUC(GCGA )GACAGGUCG GUCUCUACCU |GUCGGGGG- |CG-UUGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(CAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCUCU AGUGUACCGG 3301 U|UGUCC-(G GCA)GGGGCA -cUGCCGGGC A-GCCACGCC GCG-GGGGAU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAAGC CCUCCCC|GA G-ACGAGGCG GCC|gu|cgc ccuggggg-- 3421 -------(gc aa-)------ ---ccccugg ggcg--cgag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGGGG|CG GUGUAACCCC Cgaggg--(u ucu--)--cc cgaGGGGG|- 3541 GAGCCG|G|C CCCUCCCAAu cgcccgagcg |ugcggggcg gc-----||| || // LOCUS T.tenax 3592 bp RNA RNA 11-JAN-191 DEFINITION Thermoproteus tenax. ACCESSION No information KEYWORDS No information. SOURCE Thermoproteus tenax. ORGANISM Thermoproteus tenax. REFERENCE 1 AUTHORS No information JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: published, no accession # Kjems, J., Leffers, H., Garrett R.A., Wich G., Leinfelder W., and Bock A. 1987, Nucleic Acids Research 15:4821-4835. Gene organization, transcription signals and processing of the single ribosomal RN RNA operon of the archaebacterium Thermoproteus tenax BASE COUNT 593 a 899 c 1135 g 404 t 561 others ORIGIN 1 |--------- -----gcacg ---guCAAGC CGCCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cCGAGGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|UAGCC( GCAAGCG)GG 121 Cguu-GAACC CGGGAUU|CC -CGAAUGGG( GCUU-)CCU| accggggcc- ----(gaaca 181 )-----ggcu cc-ggu-gcc cc-----(-- -gua--a--- )------ggg gcGGGAAC|G 241 CGGGGAAAG| GA(AACA)UC UUAGUACCCG CAGGAAGAGA AACC(AA--C A)GGGA-A|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA GAGCCCAaac caaa-uccu- cacgg(gaug 361 a)ccgug-gg ga-gaugugg uguugu--gg g-cucggg-- -----(---- -)-----uac 421 cgccg-gcgg gcgguagcc- GAAgugggc- (uggaau)-g ccccgc|c(g ua)gaGGG(U 481 GAUAGC)CCC GUA-ggcuaa accgcccgug gcgg-agucc cgggaucccg GAGUACCCCG 541 CCu(---ugg uuu)uGGCGG GGGAAG-CUG GCGG(Ccacu gG-)CCGCCA AGGCUAAGCA 601 cgu-CCCGAG UCCGAUAGCG AACU-AGUAC C(GUGA)GGG AAAGCUGAAA AGCACCCC(g 661 -gaa)GGGGG GUG-AAAAGA GCCUGAAACC GGGCGGCuac a-gug-gggc gggccc--ca 721 aagga---ug cccucgcccg aaggaaaccc cg(guga-)c gggg---gag uacgag-ggc 781 gagg-gucca --ggguccgc ccUUACGU(C UAGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 GCCGUGGCGA GUCUAAgggg (--uuuaa)c cccguAGGCG CAGG(GAAA) CCGaca-gcc 901 cguagccggu -(uugc)-gc cggugagggg cggGGUCC(g aaa-)GGGCC UGUAGCCACG 961 GCCGUGAGAC CAGAAACCGG GCGAUCUAGC CCUGGG|CAG GGCGAAGCGG GGC(GAAA)G 1021 CCCCGUGGAG GCCCGaaa-g gg--uucuga (ugugca-aa )uc-guu|cc cauGACCUGG 1081 GGCUAGGGGC (AAAAG)ACC AACCAAGCCC GG-UGAUAGC UGGUUCCCCC CGAAGCGGGU 1141 (CCCA)GCCC GGC|CUCCCU Gga-|ggu|- CUCCGGCG|G GGUAG|AGC- ACUGAUCGAG 1201 ggcgcAG|GG CCC(gaaa-) GGGUC-CG-G CCCUCGGUCA AACUCCGAAC C|CGCCGGAA 1261 CCgu-gga|A GGGGGG-AGG CGGGGCCA-G UGGG(G-UAA G)CCUC-UGG CCCGAGACGG 1321 (GAACAA)CC GGGACCGG-G GUUAAGGC|C CCUAAGUGCG GGCUA-AGUG uca--acggg 1381 UAAGGGCGUC CCCUGCCCAA GACAGCGGGG CCGUGGGC(C UAACAGCA)G CCAUCGG|C( 1441 CAA)G|CAAC GC(GUAACA) GC|GGACCCG CCGAGGCAGG GGGC|CCCGU AGAUGUAGAG 1501 GGACUCAAGC CCGC|C|GCC GAGACC|CCG G|gccuccgg cc(guu---) ggccgga-gu 1561 gcGGUAGGGG GG|CGCGGCC GU-gGGGCAG AAGCCGGG-C C(GAGA)GGU CCGGUGG|AC 1621 CCGCGGUCGA CGAAGAUCCC GGC|GGUAGU AGCagcg-aa GAGGGG(UGA GAA-G)CCC- 1681 CUC|CGCCGG AAaGGACCAG GUUUUCCCGG CAACU-(ACA AU)AG|GCUG GGAGUUAGC| 1741 CGG|-UCCUA AGGCGGG-GC C(UAGCU)GG C--ACCCGCC GAAA-GGGAA ACGGG(UUAA 1801 UAUU)CCCGU |GC-cgcggg gg-uagguuc ugcg(-gcaa )cgcag-gcc ccgu-ccccg 1861 acgccucggg auagggcg|g gcggg(---- -ac-caccgu ------)ccc guu|-uaacc 1921 gcu-gaaggc CGGGGAGUG- CC(GUAAU)G G|CGAGAACC G--gccga-a ggcgggaau- 1981 |agc|cgggg (--guuu)cc ucgguc-cgc ccgacuccug -|gggcccau gaaaagggga 2041 cggggaa--- cgagcccucg c--g|U|CCG UACCGAGAAC C(GACGCA)G G|UGUUCCUG 2101 GG(ugaaaag )CCCAA-GGC GG||cuuggg -UUUACCCCA GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCUGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g gucuug-ggg -(--uucac) 2221 -cccugggac c--GCAGGU| C(GCA)GUGA CAAGGGGGAC C|UGACUGU( UUAACAAAA) 2281 ACAUAGGUCC CCG|CGAGCC C(GUA-A)GG GUUUGUACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUACGU GAA----ccc cggg---(uc ca)----acc ggg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G UGCAUGAAUG GAUCAACGAG GUCCCCACUG UCCCGGCCUG 2521 GG-GCCCA|G CGAA-CC|-C ACCU-CCAGG (-UGCACA-G U)CCUGGGAC CCCCGACGGG 2581 GCGAGAAGUC CCUGUGG|AG CUUCACUGC| AGCCUGUCGU U|GCGGAGGG GCGGGGGGUG 2641 CAGAGCGUAG GCAGGAGCAA UGA----AAC GGGGUC(UCC G-)GGCCCCG U-GGAUGCGG 2701 UCCUGGAACA CUGCC|CACU CUCCGCCCCU CCGCuaac-- --ccggg(-g caa-)cccgg 2761 ---ggacAGC GGCAGGCGGG CAGUUCGGCU G(GGG)CGGC AC|ACCCCU( GAAAAGAUAU 2821 C)GGGGGU|G |C|CCAAAGC UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUAGA|GU(G 2881 |U-AAGG)GC AAAAGCCGGG CUGACUGCGC CCU(ugaacg caa-)GGGGC GCAGGCGG(G 2941 AAA)CCGGGG |CCUAGAGAA CGCUCGUGC- CCC(-cacca gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CA GG|AAUAACC G|GCUCGUCG CGGGUGAGA( GUCCCCA)UC 3061 GACCCCGCGG UUUGGUACCC AGACGUC|GU CU|C|UUCCC |AUCC|UGGC GGU(GCAGCA 3121 )GCCGCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGAU GGGUUCAGAC 3181 C|GUC(GCGA )GACAGGUCG GUCUCUACCU |GUCGGGGG- |CG-UUGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(CAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCUCU AGUCUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -cuGCCGGGC A-GCCACGCC GUA-GGGGAU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAAGC UCCCCCC|GA G-ACGAGGCG GCC|gu|ugc ccuggggg-- 3421 -------(gc aa-)------ ---ccccugg ggca--cgag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGGGG|CG GUGUAACCCC Cgaggg--(u cuc--)--cc cgaGGGGG|- 3541 GAGCCG|G|C CCCUCCCAau cgcccg---a |gcgu----- -------||| || // LOCUS P.occultum 3592 bp RNA RNA 11-JAN-191 DEFINITION Pyrodictium occultum. ACCESSION No information KEYWORDS No information. SOURCE Pyrodictium occultum. ORGANISM Pyrodictium occultum. REFERENCE 1 AUTHORS Kjems,J., Larsen,N., Dalgaard,J.Z., Garrett,R.A. and Stetter,K.O. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M86626 LOCUS PYRRGGA 3865 bp ds-DNA BCT 06-FEB-1992 DEFINITION P.occultum 23S ribosomal RNA, partial cds. ACCESSION M86626 KEYWORDS 23S ribosomal RNA. SOURCE Pyrodictium occultum DNA. ORGANISM Pyrodictium occultum Prokaryota; Archaeobacteria; Pyrodictiales. REFERENCE 1 (bases 1 to 3865) AUTHORS Kjems,J., Larsen,N., Dalgaard,J.Z., Garrett,R.A. and Stetter,K.O. TITLE Phylogenetic relationships amongst the hyperthermophilic archaea determined from partial 23S rRNA gene sequences JOURNAL Syst. Appl. Microbiol. (1992) In press STANDARD full staff_review FEATURES Location/Qualifiers rRNA 1..1673 /partial /product="23S ribosomal RNA" /standard_name="23S rRNA" 3'UTR 1674..3865 /standard_name="23S rRNA" BASE COUNT 535 a 1165 c 1450 g 647 t 68 others ORIGIN BASE COUNT 303 a 523 c 632 g 215 t 1919 others ORIGIN 1 |~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~|~~~~~~ |~~~~~~~|~ 61 ~|~~~~~~~~ ~~~|~~~~(~ ~~~~~)~|~~ ~~~~~~~~~~ ~~~|~~~~~( ~~~~~~~)~~ 121 ~~~~~~~~~~ ~~~~~~~|~~ ~~~~~~~~~( ~~~~~)~~~| ~~~~~~~~~~ ~~~~(~~~~~ 181 )~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~(~~ ~~~~~~~~~~ )~~~~~~~~~ ~~~~~~~~|~ 241 ~~~~~~~~~| ~~(~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~)~~~~~~|~ 301 ~~~~~~~~~~ ~(~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ 361 ~)~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ ~)~~~~~~~~ 421 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~)~~ ~~~~~~|~(~ ~~)~~~~~(~ 481 ~~~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 541 ~~~(~~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ 601 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ 661 ~~~~)~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 721 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~(~~~~~)~ ~~~~~~~~~~ ~~~~~~~~~~ 781 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~|~)~ ~~~|~~~~~~ ~~~~~~~~~~ 841 ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~)~ ~~~~~~~~~~ ~~~~(~~~~) ~~~~~~~~~~ 901 ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~)~~~~~ ~~~~~~~~~~ 961 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~|~~~ ~~~~~~~~~~ ~~~(~~~~)~ 1021 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~~~ )~~~~~~|~~ ~~~~~~~~~~ 1081 ~~~~~~~~~~ (~~~~~)~~~ ~~~~~~~~~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1141 (~~~~)~~~~ ~~~|~~~~~~ ~~~~|~~~|- ~~~~~~~~|~ ~~~~~|~~~~ ~~~~~~~~~~ 1201 ~~~~~~~|~~ ~~~(~~~~~) ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~|~~~~~~~~ 1261 ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~-~ ~~~~(~~~~~ ~)~~~~~~~~ ~~~~~~~~~~ 1321 (~~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1381 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~~~)~ ~~~~~~~|~( 1441 ~~~)~|~~~~ ~~(~~~~~~) ~~|~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ ~~~~~~~~~~ 1501 ~~~~~~~~~~ ~~~~|~|~~~ ~~~~~~|~~~ ~|~~~~~~~~ ~~(~~~~~~) ~~~~~~~~~~ 1561 ~~~~~~~~~~ ~~|~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~|~~ 1621 ~~~~~~~~~~ ~~~GGAUCCC GGC|GGUAGU AACAGCG-AA GAGGGG(UGA GAA-U)CCC- 1681 CUC|CGCCGG AAaGGGCCAG GGUUCCUCGG CAACG-(GUC GU)CG|GCCG AGGGUGAGC| 1741 CGG|-CCCUA ACCCGGG-CC G(UAACA)CG G--ACCCGGG GAAA-GGGAA ACGGG(UCAA 1801 UAUU)CCCGU |GC-cgcggg gg-uagg--- ugcg(-gcaa )cgcaa-gcc ccgc-cuccg 1861 acgccuccgg guaggcgg|a ccggg(ggcg gcc-gaaggg ccgcga)ccc ggc|-uaacc 1921 gcu-gaaggc CGGGGAGAG- CC(GUCAU)G G|CGAGAACC G--gccga-a ggcgggaau- 1981 |ggg|ggccc (--guua)gg gcccuu-ccg ccgaucccgg -|gggcccuu gaaaaggggg 2041 cggggag--- cgaucccccg c--g|C|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagaag )CCCCA-GGC GU||CUGGGG -GUCAACCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCCGC---g gucugg-agg -(--uaaac) 2221 -ccucuggac c--GCGGGU| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CAAGCC C(GAA-A)GG GUGUGUACGG GGGCCGAAUC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(uu ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACU CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGAG GUCCCCACUG UCCUGGCCCG 2521 GG-GCCCC|G UGAACCC|-A CGGA-GCCGG (-UGCACA-G G)CCGGCAUC CCCCCGCAGG 2581 GCGAGAAGAC CCCGUGG|AG CUUUACCGC| AGCCUGGGGU U|GCCCUUCG GGCCUGGGUG 2641 CGUAGCGUAG GUGGGAGCCG AUG---AGCC ACCCUC(UCC G-)GGGGGUG G-GGAGGCGC 2701 CAAUGAAACA CCACC|CACC CAGGCCCGGG GGGCuuaccc --ccggc(-g gga-)gccgg 2761 -ggggacAGC CCCAGGUGGG CGGUUUGGCU G(GGG)CGGC AC|GCCCGC( GAAAAGGUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AGC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACCGGAC CCC(gaacag caa-)GGGGU CCGGCCGG(G 2941 AAA)CCGCGG |CCUAGCGAA CGCUCGUGC- CCC(-ccucg gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCUCACA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GACUCUACCC |GCGGGGGG- |UG-UGGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCC(UAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCUCU AGUGUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -gcGCCGGGC A-GCCACGCC GUA-GGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|gc|ccg gccucc---- 3421 -------(uc u--)------ -----ggggg ccggg-cccg g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUAAGCCC Cgagggcc(u uag--)ggcc cgaGGGGU|U 3541 GAGCCC|G|C CGCUCCCAau cgcccggaac |gccgugggc uu-----||| || // LOCUS A.pern.OH2 3592 bp RNA RNA 11-NOV-1998 DEFINITION Aeropyrum pernix.; 16S rRNA; 23S rRNA; 16S ribosomal RNA; 23S ribosomal RNA; internal transcribed spacer region. ACCESSION AB019552 KEYWORDS 16S rRNA; 23S rRNA; 16S ribosomal RNA; 23S ribosomal RNA; internal transcribed spacer region. SOURCE Aeropyrum pernix. ORGANISM Aeropyrum pernix. REFERENCE 1 (sites) AUTHORS Nomura,N., Sako,Y., Morinaga,Y., Kogishi,T. and Uchida,A. TITLE presence of multiple hotspots for intron homing Intraspecific genetic polymorphism in the rRNA gene locus of the hyperthermophilic archaeon Aeropyrum pernix, implying the JOURNAL Unpublished (1998) STANDARD No information REFERENCE 2 (bases 1 to 4883) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (05-NOV-1998) to the DDBJ/EMBL/GenBank databases. Norimichi Nomura, Kyoto University, Lab. of Marine Microbiology, Graduate School of Agriculture; Kitashirakawa-Oiwake-cho, Sakyo-ku, Kyoto 606-8502, Japan (E-mail:j54718@sakura.kudpc.kyoto-u.ac.jp, Tel: 81-75-753-6219, Fax:81-75-753-6226) STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AB019552 BASE COUNT 576 a 945 c 1196 g 374 t 501 others ORIGIN 1 |-------gc gggccuacgg ca-ccuaugc cgCCCGGGGG AUG|GCUCGG |CUCGGG-|c 61 g|CCGAGGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|UAGGC( GCAGGCA)GC 121 CGUC-GAACC CGGGAUC|CC -CGAAUGGG( ACUU-)CCC| gcugcgggc- ----(gaaga 181 )-----gccc gc-agc-gcc ccc----(-- -guuu----- )-----gggg gcGGGAAC|C 241 CCCCGAAAG| GA(AGCA)UC UGAGUAGGGG GAGGAGAAGA AACC(AA--C A)GGGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA GAGCCCAaac caau-cccc- cacgg(gagg 361 a)ccgug-gg gg-gaugagg gguugc-ggc ccugcggcu- ggggc(uucg a)gccccggc 421 gggccgaccc acgguagcc- GAAgugggc- (uggaau)-g ccccgc|c(g ua)gaGGG(U 481 GAUAGC)CCC GUA-ggcgaa accguggggg cccg---ugc cgcag-ggca GAGUACCACG 541 GCU(---ugg uau)UGCCGU GGGAAG-CUG GGAG(Gaacc gA-)CUUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(g 661 -gga)GGGGG GUG-AAAAGA GCCUGAAACC GGGCGGCgac a-uac-ggug cggccc--ga 721 aaggg--gug aaa-cccccg aaggaaaccc gg(gcga-)c cggg---gag uacgag-ggg 781 gaug-gaccg --gggucgca ccGUCCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 GCCGUGGCGA GCUUAAgggg (--gucaa)c cccguAGGCG CAGG(GAAA) CCGacaggcc 901 cgcagccggg g(gcga)ccc cggcgagggg cggGGUCC(u aaua)GGGCC CGGAGUCACG 961 GCCGUGAGAC CAGAAACCGG GCGAUCUAGG CGGGGG|CAG GGCGAAGCCG GGG(GAAA)C 1021 CCCGGUGGAG GCCCGcaa-g gg--ugcuga (cgugca-aa )uc-gcu|cc ccuGACCCCC 1081 GUCUAGGGGU (GAAAG)GCC AAUCUAGCCC GGaaGAUAGC UGGUUCCCGC CGAAGUGGCU 1141 (CGCA)GGCC AGC|CCCGCC gga-|ggc|c ggGCCGUG|G GGUAG|AGcu acUGAUUGGG 1201 CGUgcaG|GG GGC(guaaa) GCCCC-CG-G CGCCCAGUCA AACUCCGAAC C|UGCGGCCG 1261 CCgu-aga|U GGCGGG-AGU GGGGUUCC-C CGGG(G-UAA G)CCCG-GGG GCCGAGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG cca---agcc 1381 AAAGGGCGUC CCCCGCCUAA GACAGCGGGG AGGUGGGC(C UAACAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCGGG GGGC|CCCGA AGAUUGGUCG 1501 GGGCUUAAGC CGGC|C|GCC GAGACC|CCG G|gccgcggc uc(cgaug-) gagccgcg-u 1561 gcGGUAGGCG GG|CGUCGGG GU-GGCGUAG AAGCCGGG-C C(GCGA)GGU CCGGUGG|AG 1621 CCGCCCCGAG CGCGGAUCCC GGC|GGUAGU AACagcg-aa GCGGGG(UGA GAA-U)CCC- 1681 CGC|CGCCGG AAaGGGCAAG GGUUCCCCGG CAAUG-(GUC GU)CA|GCCG GGGGUCAGC| 1741 CGG|-UCCUA ACCGGGA-CC G(UAACA)CG G--AUCCCGG GAAG-GGGAA ACGGG(UUAA 1801 CAUU)CCCGU |GC-cgcggg gg-uacgcuu cgcg(-gcaa )cgcaaggcc ccac-ccccg 1861 acgccucggg guaggcgg|a ccggg(ggcc ccc-uuaggg gggcga)ccc ggc|-uaacc 1921 ugccgaaggc CGGGGAGUG- CC(GUAAC)G G|CGAGAACC G--gccga-a ggggggaau- 1981 |ggc|ccgcc (--uaug)-g cggguu-ccg ccgaccccug -|gggcccuu gaaaaggggg 2041 uggggaa--- ggaucccccg c--g|C|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagcag )CCCAA-GGC GU||CUGGGG -GCUAACCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g guccug-ggg -(-cucgaa) 2221 -cccugggac c--GCAGGU| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAACAAAA) 2281 ACAUAGCUCC CCG|CUAGCC C(GAAAA)GG GUGUGUACGG GGGGUGAGCC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(ua ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACC CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAACG GGUCAACGAG GUCCCCACUG UCCUGGCCCG 2521 GG-GCCCC|G UGAACCC|-C CUGA-GCCGA (-UGCACA-G U)CCGGCAUC CCCCCGCAGG 2581 GAGAGAAGAC CCCGUGG|AG CUUUACCGC| AGCCCGGCGU U|GGcccccg ggcggggguG 2641 CGUAGCGUAG GUGGGAGCCG AUG---AGCC ACCCUC(UCC G-)GGGGGUG G-GGAGGCGU 2701 CCCUGAAACA CCACC|CCCG CCCGGGGCCU UACCCCGC-- ---cccg(gg uggg)cggg- 2761 ---ggacagC GUCGGGUGGG CGGUUUGGCU G(GGG)CGGC AC|GCCCGC( GAAAGGGUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACCGGAC CCC(ugaaag caa-)GGGGU CCGGCCGG(G 2941 AAA)CCGUGG |CCUAGCGCA CGCUCGUGC- CCC(--cucg gug)GGG|GC CGGGC-acga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCCCCCA)UC 3061 GACCCCGCGG UUUGCUACCU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUCAGAC 3181 C|GUC(GUGA )GACAGGUCG GUCUCUACCC |GCGGGGGG- |UG-uGGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(UAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCUCU GGUGUACCGG 3301 U|UGUCC-(U GGU)-GGGCA -acGCCGGGC A-GCUAAGCC GUA-GGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|gu|ugg gcgccggc-- 3421 -------(cu ggu)------ ---gccggcg ucca--cgag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUGAGCCC Cgaggggg(c uuag-)cccu cgaGGGGU|G 3541 UAGCCC|G|C CGCUCCCAau cgcccgag-- |gccg----- -------||| || // LOCUS A.pern.K1 3592 bp RNA RNA 07-AUG-1998 DEFINITION Aeropyrum pernix.; 16S rRNA; 16S ribosomal RNA; 23S rRNA; 23S ribosomal RNA; rRNA intron-encoded homing endonuclease. ACCESSION AB008745 KEYWORDS 16S rRNA; 16S ribosomal RNA; 23S rRNA; 23S ribosomal RNA; rRNA intron-encoded homing endonuclease. SOURCE Aeropyrum pernix. ORGANISM Aeropyrum pernix. REFERENCE 1 (sites) AUTHORS Nomura,N., Sako,Y. and Uchida,A. TITLE hyperthermophilic archaeon Aeropyrum pernix K1 Molecular characterization and postsplicing fate of three introns within the single rRNA operon of the JOURNAL J. Bacteriol. 180 (14), 3635-3643 (1998) STANDARD No information REFERENCE 2 (bases 1 to 10717) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (06-NOV-1997) to the DDBJ/EMBL/GenBank databases. Norimichi Nomura, Kyoto University, Lab. of Marine Microbiology, Graduate School of Agriculture; Kitashirakawa-Oiwake-cho, Sakyo-ku, Kyoto, Kyoto 606-01, Japan (E-mail:j54718@sakura.kudpc.kyoto-u.ac. jp, Tel: 075-753-6219, Fax:075-753-6226) STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AB008745 BASE COUNT 577 a 947 c 1199 g 377 t 492 others ORIGIN 1 |-------gc gggccuacgg ca-ccuaugc cgCCCGGGGG AUG|GCUCGG |CUCGGG-|c 61 g|CCGAGGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|UAGGC( GCAGGCA)GC 121 CGUC-GAACC CGGGAUC|CC -CGAAUGGG( ACUU-)CCC| gcugcgggc- ----(gaaga 181 )-----gccc gc-agc-gcc ccc----(-- -guuu----- )-----gggg gcGGGAAC|C 241 CCCCGAAAG| GA(AGCA)UC UGAGUAGGGG GAGGAGAAGA AACC(AA--C A)GGGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA GAGCCCAaac caau-cccc- cacgg(gagg 361 a)ccgug-gg gg-gaugagg gguugc-ggc ccugcggcu- ggggc(uucg a)gccccggc 421 gggccgaccc acgguagcc- GAAgugggc- (uggaau)-g ccccgc|c(g ua)gaGGG(U 481 GAUAGC)CCC GUA-ggcgaa accguggggg cccg---ugc cgcag-ggca GAGUACCACG 541 GCU(---ugg uau)UGCCGU GGGAAG-CUG GGAG(Gaacc gA-)CUUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(g 661 -gga)GGGGG GUG-AAAAGA GCCUGAAACC GGGCGGCgac a-uac-ggug cggccc--ga 721 aaggg--gug aaa-cccccg aaggaaaccc gg(gcga-)c cggg---gag uacgag-ggg 781 gaug-gaccg --gggucgca ccGUCCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 GCCGUGGCGA GCUUAAgggg (--gucaa)c cccguAGGCG CAGG(GAAA) CCGacaggcc 901 cgcagccggg g(gcga)ccc cggcgagggg cggGGUCC(u aaua)GGGCC CGGAGUCACG 961 GCCGUGAGAC CAGAAACCGG GCGAUCUAGG CGGGGG|CAG GGCGAAGCCG GGG(GAAA)C 1021 CCCGGUGGAG GCCCGcaa-g gg--ugcuga (cgugca-aa )uc-gcu|cc ccuGACCCCC 1081 GUCUAGGGGU (GAAAG)GCC AAUCUAGCCC GGaaGAUAGC UGGUUCCCGC CGAAGUGGCU 1141 (CGCA)GGCC AGC|CCCGCC gga-|ggc|c ggGCCGUG|G GGUAG|AGcu acUGAUUGGG 1201 CGUgcaG|GG GGC(guaaa) GCCCC-CG-G CGCCCAGUCA AACUCCGAAC C|UGCGGCCG 1261 CCgu-aga|U GGCGGG-AGU GGGGUUCC-C CGGG(G-UAA G)CCCG-GGG GCCGAGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG cca---agcc 1381 AAAGGGCGUC CCCCGCCUAA GACAGCGGGG AGGUGGGC(C UAACAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCGGG GGGC|CCCGA AGAUUGGUCG 1501 GGGCUUAAGC CGGC|C|GCC GAGACC|CCG G|gccgcggc uc(cgaug-) gagccgcg-u 1561 gcGGUAGGCG GG|CGUCGGG GU-GGCGUAG AAGCCGGG-C C(GCGA)GGU CCGGUGG|AG 1621 CCGCCCCGAG CGCGGAUCCC GGC|GGUAGU AACagcg-aa GCGGGG(UGA GAA-U)CCC- 1681 CGC|CGCCGG AAaGGGCAAG GGUUCCCCGG CAAUG-(GUC GU)CA|GCCG GGGGUCAGC| 1741 CGG|-UCCUA ACCGGGA-CC G(UAACA)CG G--AUCCCGG GAAG-GGGAA ACGGG(UUAA 1801 CAUU)CCCGU |GC-cgcggg gg-uacgcuu cgcg(-gcaa )cgcaaggcc ccac-ccccg 1861 acgccucggg guaggcgg|a ccggg(ggcc ccc-uuaggg gggcga)ccc ggc|-uaacc 1921 ugccgaaggc CGGGGAGUG- CC(GUAAC)G G|CGAGAACC G--gccga-a ggggggaau- 1981 |ggc|ccgcc (--uaug)-g cggguu-ccg ccgaccccug -|gggcccuu gaaaaggggg 2041 uggggaa--- ggaucccccg c--g|C|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagcag )CCCAA-GGC GU||CUGGGG -GCUAACCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g guccug-ggg -(-cucgaa) 2221 -cccugggac c--GCAGGU| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAACAAAA) 2281 ACAUAGCUCC CCG|CUAGCC C(GAAAA)GG GUGUGUACGG GGGGUGAGCC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(ua ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACC CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAACG GGUCAACGAG GUCCCCACUG UCCUGGCCCG 2521 GG-GCCCC|G UGAACCC|-C CUGA-GCCGA (-UGCACA-G U)CCGGCAUC CCCCCGCAGG 2581 GAGAGAAGAC CCCGUGG|AG CUUUACCGC| AGCCCGGCGU U|GGcccccg ggcggggguG 2641 CGUAGCGUAG GUGGGAGCCG AUG---AGCC ACCCUC(UCC G-)GGGGGUG G-GGAGGCGU 2701 CCCUGAAACA CCACC|CCCG CCCGGGGCCU UACCCCGC-- ---cccg(gg uggg)cggg- 2761 ---ggacagC GUCGGGUGGG CGGUUUGGCU G(GGG)CGGC AC|GCCCGC( GAAAGGGUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACCGGAC CCC(ugaaag caa-)GGGGU CCGGCCGG(G 2941 AAA)CCGUGG |CCUAGCGCA CGCUCGUGC- CCC(--cucg gug)GGG|GC CGGGC-acga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCCCCCA)UC 3061 GACCCCGCGG UUUGCUACCU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUCAGAC 3181 C|GUC(GUGA )GACAGGUCG GUCUCUACCC |GCGGGGGG- |UG-uGGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(UAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCUCU GGUGUACCGG 3301 U|UGUCC-(U GGU)-GGGCA -acGCCGGGC A-GCUAAGCC GUA-GGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|gu|ugg gcgccggc-- 3421 -------(cu ggu)------ ---gccggcg ucca--cgag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUGAGCCC Cgaggggg(c uuag-)cccu cgaGGGGU|G 3541 UAGCCC|G|C CGCUCCCAau cgcccgag-- |gccg-uagg cugcu--||| || // LOCUS A.pern.TB1 3592 bp RNA RNA 11-NOV-1998 DEFINITION Aeropyrum pernix.; 16S rRNA; 16S ribosomal RNA; 23S ribosomal RNA; 23S rRNA; internal transcribed spacer region. ACCESSION AB019553 KEYWORDS 16S rRNA; 16S ribosomal RNA; 23S ribosomal RNA; 23S rRNA; internal transcribed spacer region. SOURCE Aeropyrum pernix. ORGANISM Aeropyrum pernix. REFERENCE 1 (sites) AUTHORS Nomura,N., Sako,Y., Morinaga,Y., Kogishi,T. and Uchida,A. TITLE the hyperthermophilic archaeon Aeropyrum pernix, implying the presence of multiple hotspots for intron homing Intraspecific genetic polymorphism in the rRNA gene locus of JOURNAL Unpublished (1998) STANDARD No information REFERENCE 2 (bases 1 to 5268) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (05-NOV-1998) to the DDBJ/EMBL/GenBank databases. Norimichi Nomura, Kyoto University, Lab. of Marine Microbiology, Graduate School of Agriculture; Kitashirakawa-Oiwake-cho, Sakyo-ku, Kyoto 606-8502, Japan (E-mail:j54718@sakura.kudpc.kyoto-u.ac.jp, Tel: 81-75-753-6219, Fax:81-75-753-6226) STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AB019553 BASE COUNT 576 a 945 c 1196 g 374 t 501 others ORIGIN 1 |-------gc gggccuacgg ca-ccuaugc cgCCCGGGGG AUG|GCUCGG |CUCGGG-|c 61 g|CCGAGGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|UAGGC( GCAGGCA)GC 121 CGUC-GAACC CGGGAUC|CC -CGAAUGGG( ACUU-)CCC| gcugcgggc- ----(gaaga 181 )-----gccc gc-agc-gcc ccc----(-- -guuu----- )-----gggg gcGGGAAC|C 241 CCCCGAAAG| GA(AGCA)UC UGAGUAGGGG GAGGAGAAGA AACC(AA--C A)GGGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA GAGCCCAaac caau-cccc- cacgg(gagg 361 a)ccgug-gg gg-gaugagg gguugc-ggc ccugcggcu- ggggc(uucg a)gccccggc 421 gggccgaccc acgguagcc- GAAgugggc- (uggaau)-g ccccgc|c(g ua)gaGGG(U 481 GAUAGC)CCC GUA-ggcgaa accguggggg cccg---ugc cgcag-ggca GAGUACCACG 541 GCU(---ugg uau)UGCCGU GGGAAG-CUG GGAG(Gaacc gA-)CUUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(g 661 -gga)GGGGG GUG-AAAAGA GCCUGAAACC GGGCGGCgac a-uac-ggug cggccc--ga 721 aaggg--gug aaa-cccccg aaggaaaccc gg(gcga-)c cggg---gag uacgag-ggg 781 gaug-gaccg --gggucgca ccGUCCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 GCCGUGGCGA GCUUAAgggg (--gucaa)c cccguAGGCG CAGG(GAAA) CCGacaggcc 901 cgcagccggg g(gcga)ccc cggcgagggg cggGGUCC(u aaua)GGGCC CGGAGUCACG 961 GCCGUGAGAC CAGAAACCGG GCGAUCUAGG CGGGGG|CAG GGCGAAGCCG GGG(GAAA)C 1021 CCCGGUGGAG GCCCGcaa-g gg--ugcuga (cgugca-aa )uc-gcu|cc ccuGACCCCC 1081 GUCUAGGGGU (GAAAG)GCC AAUCUAGCCC GGaaGAUAGC UGGUUCCCGC CGAAGUGGCU 1141 (CGCA)GGCC AGC|CCCGCC gga-|ggc|c ggGCCGUG|G GGUAG|AGcu acUGAUUGGG 1201 CGUgcaG|GG GGC(guaaa) GCCCC-CG-G CGCCCAGUCA AACUCCGAAC C|UGCGGCCG 1261 CCgu-aga|U GGCGGG-AGU GGGGUUCC-C CGGG(G-UAA G)CCCG-GGG GCCGAGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG cca---agcc 1381 AAAGGGCGUC CCCCGCCUAA GACAGCGGGG AGGUGGGC(C UAACAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCGGG GGGC|CCCGA AGAUUGGUCG 1501 GGGCUUAAGC CGGC|C|GCC GAGACC|CCG G|gccgcggc uc(cgaug-) gagccgcg-u 1561 gcGGUAGGCG GG|CGUCGGG GU-GGCGUAG AAGCCGGG-C C(GCGA)GGU CCGGUGG|AG 1621 CCGCCCCGAG CGCGGAUCCC GGC|GGUAGU AACagcg-aa GCGGGG(UGA GAA-U)CCC- 1681 CGC|CGCCGG AAaGGGCAAG GGUUCCCCGG CAAUG-(GUC GU)CA|GCCG GGGGUCAGC| 1741 CGG|-UCCUA ACCGGGA-CC G(UAACA)CG G--AUCCCGG GAAG-GGGAA ACGGG(UUAA 1801 CAUU)CCCGU |GC-cgcggg gg-uacgcuu cgcg(-gcaa )cgcaaggcc ccac-ccccg 1861 acgccucggg guaggcgg|a ccggg(ggcc ccc-uuaggg gggcga)ccc ggc|-uaacc 1921 ugccgaaggc CGGGGAGUG- CC(GUAAC)G G|CGAGAACC G--gccga-a ggggggaau- 1981 |ggc|ccgcc (--uaug)-g cggguu-ccg ccgaccccug -|gggcccuu gaaaaggggg 2041 uggggaa--- ggaucccccg c--g|C|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagcag )CCCAA-GGC GU||CUGGGG -GCUAACCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g guccug-ggg -(-cucgaa) 2221 -cccugggac c--GCAGGU| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAACAAAA) 2281 ACAUAGCUCC CCG|CUAGCC C(GAAAA)GG GUGUGUACGG GGGGUGAGCC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(ua ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACC CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAACG GGUCAACGAG GUCCCCACUG UCCUGGCCCG 2521 GG-GCCCC|G UGAACCC|-C CUGA-GCCGA (-UGCACA-G U)CCGGCAUC CCCCCGCAGG 2581 GAGAGAAGAC CCCGUGG|AG CUUUACCGC| AGCCCGGCGU U|GGcccccg ggcggggguG 2641 CGUAGCGUAG GUGGGAGCCG AUG---AGCC ACCCUC(UCC G-)GGGGGUG G-GGAGGCGU 2701 CCCUGAAACA CCACC|CCCG CCCGGGGCCU UACCCCGC-- ---cccg(gg uggg)cggg- 2761 ---ggacagC GUCGGGUGGG CGGUUUGGCU G(GGG)CGGC AC|GCCCGC( GAAAGGGUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACCGGAC CCC(ugaaag caa-)GGGGU CCGGCCGG(G 2941 AAA)CCGUGG |CCUAGCGCA CGCUCGUGC- CCC(--cucg gug)GGG|GC CGGGC-acga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCCCCCA)UC 3061 GACCCCGCGG UUUGCUACCU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUCAGAC 3181 C|GUC(GUGA )GACAGGUCG GUCUCUACCC |GCGGGGGG- |UG-uGGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(UAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCUCU GGUGUACCGG 3301 U|UGUCC-(U GGU)-GGGCA -acGCCGGGC A-GCUAAGCC GUA-GGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|gu|ugg gcgccggc-- 3421 -------(cu ggu)------ ---gccggcg ucca--cgag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUGAGCCC Cgaggggg(c uuag-)cccu cgaGGGGU|G 3541 UAGCCC|G|C CGCUCCCAau cgcccgag-- |gccg----- -------||| || // LOCUS A.pern.TB7 3592 bp RNA RNA 11-NOV-1998 DEFINITION Aeropyrum pernix.; 16S rRNA; 16S ribosomal RNA; 23S ribosomal RNA; 23S rRNA; internal transcribed spacer region. ACCESSION AB019554 KEYWORDS 16S rRNA; 16S ribosomal RNA; 23S ribosomal RNA; 23S rRNA; internal transcribed spacer region. SOURCE Aeropyrum pernix. ORGANISM Aeropyrum pernix. REFERENCE 1 (sites) AUTHORS Nomura,N., Sako,Y., Morinaga,Y., Kogishi,T. and Uchida,A. TITLE the hyperthermophilic archaeon Aeropyrum pernix, implying the presence of multiple hotspots for intron homing Intraspecific genetic polymorphism in the rRNA gene locus of JOURNAL Unpublished (1998) STANDARD No information REFERENCE 2 (bases 1 to 5324) AUTHORS Nomura,N. TITLE Direct Submission JOURNAL Submitted (05-NOV-1998) to the DDBJ/EMBL/GenBank databases. Norimichi Nomura, Kyoto University, Lab. of Marine Microbiology, Graduate School of Agriculture; Kitashirakawa-Oiwake-cho, Sakyo-ku, Kyoto 606-8502, Japan (E-mail:j54718@sakura.kudpc.kyoto-u.ac.jp, Tel: 81-75-753-6219, Fax:81-75-753-6226) STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AB019554 BASE COUNT 576 a 945 c 1196 g 374 t 501 others ORIGIN 1 |-------gc gggccuacgg ca-ccuaugc cgCCCGGGGG AUG|GCUCGG |CUCGGG-|c 61 g|CCGAGGAA GGG|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|UAGGC( GCAGGCA)GC 121 CGUC-GAACC CGGGAUC|CC -CGAAUGGG( ACUU-)CCC| gcugcgggc- ----(gaaga 181 )-----gccc gc-agc-gcc ccc----(-- -guuu----- )-----gggg gcGGGAAC|C 241 CCCCGAAAG| GA(AGCA)UC UGAGUAGGGG GAGGAGAAGA AACC(AA--C A)GGGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA GAGCCCAaac caau-cccc- cacgg(gagg 361 a)ccgug-gg gg-gaugagg gguugc-ggc ccugcggcu- ggggc(uucg a)gccccggc 421 gggccgaccc acgguagcc- GAAgugggc- (uggaau)-g ccccgc|c(g ua)gaGGG(U 481 GAUAGC)CCC GUA-ggcgaa accguggggg cccg---ugc cgcag-ggca GAGUACCACG 541 GCU(---ugg uau)UGCCGU GGGAAG-CUG GGAG(Gaacc gA-)CUUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(g 661 -gga)GGGGG GUG-AAAAGA GCCUGAAACC GGGCGGCgac a-uac-ggug cggccc--ga 721 aaggg--gug aaa-cccccg aaggaaaccc gg(gcga-)c cggg---gag uacgag-ggg 781 gaug-gaccg --gggucgca ccGUCCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 GCCGUGGCGA GCUUAAgggg (--gucaa)c cccguAGGCG CAGG(GAAA) CCGacaggcc 901 cgcagccggg g(gcga)ccc cggcgagggg cggGGUCC(u aaua)GGGCC CGGAGUCACG 961 GCCGUGAGAC CAGAAACCGG GCGAUCUAGG CGGGGG|CAG GGCGAAGCCG GGG(GAAA)C 1021 CCCGGUGGAG GCCCGcaa-g gg--ugcuga (cgugca-aa )uc-gcu|cc ccuGACCCCC 1081 GUCUAGGGGU (GAAAG)GCC AAUCUAGCCC GGaaGAUAGC UGGUUCCCGC CGAAGUGGCU 1141 (CGCA)GGCC AGC|CCCGCC gga-|ggc|c ggGCCGUG|G GGUAG|AGcu acUGAUUGGG 1201 CGUgcaG|GG GGC(guaaa) GCCCC-CG-G CGCCCAGUCA AACUCCGAAC C|UGCGGCCG 1261 CCgu-aga|U GGCGGG-AGU GGGGUUCC-C CGGG(G-UAA G)CCCG-GGG GCCGAGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCUAAGUGCC GGCUA-AGUG cca---agcc 1381 AAAGGGCGUC CCCCGCCUAA GACAGCGGGG AGGUGGGC(C UAACAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCGGG GGGC|CCCGA AGAUUGGUCG 1501 GGGCUUAAGC CGGC|C|GCC GAGACC|CCG G|gccgcggc uc(cgaug-) gagccgcg-u 1561 gcGGUAGGCG GG|CGUCGGG GU-GGCGUAG AAGCCGGG-C C(GCGA)GGU CCGGUGG|AG 1621 CCGCCCCGAG CGCGGAUCCC GGC|GGUAGU AACagcg-aa GCGGGG(UGA GAA-U)CCC- 1681 CGC|CGCCGG AAaGGGCAAG GGUUCCCCGG CAAUG-(GUC GU)CA|GCCG GGGGUCAGC| 1741 CGG|-UCCUA ACCGGGA-CC G(UAACA)CG G--AUCCCGG GAAG-GGGAA ACGGG(UUAA 1801 CAUU)CCCGU |GC-cgcggg gg-uacgcuu cgcg(-gcaa )cgcaaggcc ccac-ccccg 1861 acgccucggg guaggcgg|a ccggg(ggcc ccc-uuaggg gggcga)ccc ggc|-uaacc 1921 ugccgaaggc CGGGGAGUG- CC(GUAAC)G G|CGAGAACC G--gccga-a ggggggaau- 1981 |ggc|ccgcc (--uaug)-g cggguu-ccg ccgaccccug -|gggcccuu gaaaaggggg 2041 uggggaa--- ggaucccccg c--g|C|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagcag )CCCAA-GGC GU||CUGGGG -GCUAACCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g guccug-ggg -(-cucgaa) 2221 -cccugggac c--GCAGGU| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAACAAAA) 2281 ACAUAGCUCC CCG|CUAGCC C(GAAAA)GG GUGUGUACGG GGGGUGAGCC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(ua ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACC CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAACG GGUCAACGAG GUCCCCACUG UCCUGGCCCG 2521 GG-GCCCC|G UGAACCC|-C CUGA-GCCGA (-UGCACA-G U)CCGGCAUC CCCCCGCAGG 2581 GAGAGAAGAC CCCGUGG|AG CUUUACCGC| AGCCCGGCGU U|GGcccccg ggcggggguG 2641 CGUAGCGUAG GUGGGAGCCG AUG---AGCC ACCCUC(UCC G-)GGGGGUG G-GGAGGCGU 2701 CCCUGAAACA CCACC|CCCG CCCGGGGCCU UACCCCGC-- ---cccg(gg uggg)cggg- 2761 ---ggacagC GUCGGGUGGG CGGUUUGGCU G(GGG)CGGC AC|GCCCGC( GAAAGGGUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACCGGAC CCC(ugaaag caa-)GGGGU CCGGCCGG(G 2941 AAA)CCGUGG |CCUAGCGCA CGCUCGUGC- CCC(--cucg gug)GGG|GC CGGGC-acga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCCCCCA)UC 3061 GACCCCGCGG UUUGCUACCU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUCAGAC 3181 C|GUC(GUGA )GACAGGUCG GUCUCUACCC |GCGGGGGG- |UG-uGGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(UAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCUCU GGUGUACCGG 3301 U|UGUCC-(U GGU)-GGGCA -acGCCGGGC A-GCUAAGCC GUA-GGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|gu|ugg gcgccggc-- 3421 -------(cu ggu)------ ---gccggcg ucca--cgag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUGAGCCC Cgaggggg(c uuag-)cccu cgaGGGGU|G 3541 UAGCCC|G|C CGCUCCCAau cgcccgag-- |gccg----- -------||| || // LOCUS S.marinu.1 3592 bp RNA RNA 11-JAN-191 DEFINITION Staphylothermus marinus. ACCESSION No information KEYWORDS No information. SOURCE Staphylothermus marinus. ORGANISM Staphylothermus marinus. REFERENCE 1 (sites) AUTHORS Kjems,J. and Garrett,R.A. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M38363 LOCUS SPYRRGQ 2158 bp ds-DNA BCT 25-JAN-1991 DEFINITION S.marinus 23S rRNA gene, introns. ACCESSION M38363 KEYWORDS 23S ribosomal RNA. SOURCE Staphylothermus marinus DNA. ORGANISM Staphylothermus marinus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Thermoproteales; Desulfurococcaceae. REFERENCE 1 (bases 1 to 2158) AUTHORS Kjems,J. and Garrett,R.A. TITLE New ribosomal RNA introns in archaea and evidence for RNA conformational changes associated with splicing JOURNAL Proc. Natl. Acad. Sci. U.S.A. (1990) In press STANDARD simple staff_entry COMMENT Draft entry and computer-readable sequence for [Proc. Natl. Acad. Sci. U.S.A. (1990) In press] kindly submitted by R.A.Garrett, 07-SEP-1990. FEATURES Location/Qualifiers rRNA join(<1..681,738..1411, 1466..>2158) /partial /product="23s rRNA" intron 682..737 intron 1412..1465 BASE COUNT 447 a 645 c 727 g 339 t ORIGIN SPYRGGA:LOCUS SPYRGGA 2828 bp ds-DNA BCT 06-FEB-1992 SPYRGGA:DEFINITION S.marinus 23S ribosomal RNA, partial cds. SPYRGGA:ACCESSION M86623 SPYRGGA:KEYWORDS 23S ribosomal RNA. SPYRGGA:SOURCE Staphylothermus marinus DNA. SPYRGGA: ORGANISM Staphylothermus marinus SPYRGGA: Prokaryota; Archaeobacteria; Thermoproteales. SPYRGGA:REFERENCE 1 (bases 1 to 2828) SPYRGGA: AUTHORS Kjems,J., Larsen,N., Dalgaard,J.Z., Garrett,R.A. and Stetter,K.O. SPYRGGA: TITLE Phylogenetic relationships amongst the hyperthermophilic archaea SPYRGGA: determined from partial 23S rRNA gene sequences SPYRGGA: JOURNAL Syst. Appl. Microbiol. (1992) In press SPYRGGA: STANDARD full staff_review SPYRGGA:FEATURES Location/Qualifiers SPYRGGA: rRNA join(<1..681,738..1411,1466..1789) SPYRGGA: /product="23S ribosomal RNA" SPYRGGA: /standard_name="23S rRNA" SPYRGGA: /partial SPYRGGA: intron 682..737 SPYRGGA: /number=1 SPYRGGA: /standard_name="23S rRNA" SPYRGGA: intron 1412..1465 SPYRGGA: /number=2 SPYRGGA: /standard_name="23S rRNA" SPYRGGA: 3'UTR 1790..2828 SPYRGGA: /standard_name="23S rRNA" SPYRGGA:BASE COUNT 649 a 789 c 809 g 572 t 9 others SPYRGGA:ORIGIN BASE COUNT 324 a 513 c 618 g 226 t 1911 others ORIGIN 1 |~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~|~~~~~~ |~~~~~~~|~ 61 ~|~~~~~~~~ ~~~|~~~~(~ ~~~~~)~|~~ ~~~~~~~~~~ ~~~|~~~~~( ~~~~~~~)~~ 121 ~~~~~~~~~~ ~~~~~~~|~~ ~~~~~~~~~( ~~~~~)~~~| ~~~~~~~~~~ ~~~~(~~~~~ 181 )~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~(~~ ~~~~~~~~~~ )~~~~~~~~~ ~~~~~~~~|~ 241 ~~~~~~~~~| ~~(~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~)~~~~~~|~ 301 ~~~~~~~~~~ ~(~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ 361 ~)~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ ~)~~~~~~~~ 421 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~)~~ ~~~~~~|~(~ ~~)~~~~~(~ 481 ~~~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 541 ~~~(~~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ 601 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ 661 ~~~~)~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 721 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~(~~~~~)~ ~~~~~~~~~~ ~~~~~~~~~~ 781 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~|~)~ ~~~|~~~~~~ ~~~~~~~~~~ 841 ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~)~ ~~~~~~~~~~ ~~~~(~~~~) ~~~~~~~~~~ 901 ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~)~~~~~ ~~~~~~~~~~ 961 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~|~~~ ~~~~~~~~~~ ~~~(~~~~)~ 1021 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~~~ )~~~~~~|~~ ~~~~~~~~~~ 1081 ~~~~~~~~~~ (~~~~~)~~~ ~~~~~~~~~~ ~~-~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1141 (~~~~)~~~~ ~~~|~~~~~~ ~~~~|~~~|- ~~~~~~~~|~ ~~~~~|~~~~ ~~~~~~~~~~ 1201 ~~~~~~~|~~ ~~~(~~~~~) ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~|~~~~~~~~ 1261 ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~-~ ~~~~(~~~~~ ~)~~~~~~~~ ~~~~~~~~~~ 1321 (~~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1381 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~~~)~ ~~~~~~~|~( 1441 ~~~)~|~~~~ ~~(~~~~~~) ~~|~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ ~~~~~~~~~~ 1501 ~~~~~~~~~~ ~~~~|~|~~~ ~~~~~~|~~~ ~|~~~~~~~~ ~~(~~~~~~) ~~~~~~~~~~ 1561 ~~~~~~~~~~ ~~|~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~|~~ 1621 ~~~~~~~~~~ ~~~GGAUCCC GGC|GGCAGU AGCagcg-aa GAGGGG(UGG AAA-U)CCC- 1681 CUC|CGCCGG AAaGGGCCAG GGUUCCUCGG CAACG-(GUC GU)CG|GCCG AGGGUUAGC| 1741 CGG|-CCCUA ACCCGGG-CC G(UAACA)CG G--ACCCGGG GAAG-GGGAA ACGGG(UUAA 1801 UAUU)CCCGU |GC-cgcggg gg-uaga--- cgcg(-gcaa )cgcaa-gcc cggc-cuccg 1861 acgccuccgg auaggcgg|a ccggg(ggca ccc-gaaugg gugcga)ccc ggc|-uaacc 1921 ggu-gaaggc CCUGGAGAA- CC(GUAAU)G G|UGAGAAGG G--gccga-a gccgggaau- 1981 |ggg|acccc (--guau)gg ggucuu-ccg cugacuccgg -|gggcccau gaaaaggggg 2041 ccgggaa--- cgaucccccg c--g|U|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugaguag )CCCCA-GGC Gu||cuGGGG -GAUAACCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCCGC---g gucugg-agg -(caaaaac) 2221 -ccucuggac c--GCGGGU| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CAAGCC C(GAA-A)GG GUGUGUACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(ua ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACU CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UG-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGAG GUCCCCACUG UCCUGGCCCG 2521 GG-GCCCC|G CGAAUAC|-A CGGA-GCCGG (-UGCACA-G G)CCGGCAAC CCCCCGCAGG 2581 GCGAUAAGAC CCCGUGG|AG CUUCACCGC| AGCCUGGCGU U|GGCCUCCG GGCUGCGGCG 2641 CGUAGCGUAG GUGGGAGCCA GUG---AACC CGCCCC(UCC G-)GGGGCGG G-GGAGGCGC 2701 CAAUGAAACA CCACC|CACC GCAGCCCGGA GGCCuuaccc -ccgggg(-- aga-)ucccg 2761 gggggacAGC GUCAGGUGGG CGGUUUGGCU G(GGG)CGGC AC|GCCCGC( GAAAAGGUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AGC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC UUGACCGGAC CCc(ggaaca caa-)cGGGU CCGGCCGG(G 2941 AAA)CCGCGG |CCUAGCGAA CGCUCGUGC- CCC(-ccucg gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCUCCCA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG AGU(GCAGCA 3121 )GCUCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GACUCUACCC |GCGGGGGG- |UG-UGGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(CAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCAAU GGUGUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -uaGCCGGGC ACGCCA-GCC GUA-GGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|gu|ccg ccgaagggg- 3421 -------(uu u--)------ --ccucuucg gcgg--guag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUAAGCCC Cgagggag(c auugu)cucc caaGGGGU|U 3541 UAGCCC|G|C CGCUCCCAaa agcccga--c |gccgcaggc uu-----||| || // LOCUS S.marinu.2 3592 bp RNA RNA 14-JAN-1993 DEFINITION Staphylothermus marinus.. ACCESSION No information KEYWORDS No information. SOURCE Staphylothermus marinus. ORGANISM Staphylothermus marinus. REFERENCE 1 (sites) AUTHORS Dalgaard,J.Z. and Garrett,R.A. TITLE stable circles in the hyperthermophilic archaeon Pyrobaculum organotrophum Protein-coding introns from the 23S rRNA-encoding gene form JOURNAL Gene 121, 103-110 (1992) STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: M86623 LOCUS SPYRGGA 2828 bp DNA BCT 14-JAN-1993 DEFINITION S.marinus 23S ribosomal RNA, partial cds. ACCESSION M86623 NID g152908 KEYWORDS 23S ribosomal RNA. SOURCE Staphylothermus marinus DNA. ORGANISM Staphylothermus marinus Archaea; Crenarchaeota; Thermoproteales; Staphylothermus. REFERENCE 1 (bases 1 to 2828) AUTHORS Dalgaard,J.Z. and Garrett,R.A. TITLE Protein-coding introns from the 23S rRNA-encoding gene form stable circles in the hyperthermophilic archaeon Pyrobaculum organotrophum JOURNAL Gene 121, 103-110 (1992) MEDLINE 93051344 FEATURES Location/Qualifiers source 1..2828 /organism="Staphylothermus marinus" /db_xref="taxon:2280" rRNA join(<1..681,738..1411,1466..1789) /gene="23S rRNA" /product="23S ribosomal RNA" gene join(<1..681,738..1411,1466..1789) /gene="23S rRNA" intron 682..737 /gene="23S rRNA" /number=1 intron 1412..1465 /gene="23S rRNA" /number=2 3'UTR 1790..2828 /gene="23S rRNA" BASE COUNT 649 a 789 c 809 g 572 t 9 others BASE COUNT 324 a 513 c 618 g 224 t 1913 others ORIGIN 1 |~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~|~~~~~~ |~~~~~~~|~ 61 ~|~~~~~~~~ ~~~|~~~~(~ ~~~~~)~|~~ ~~~~~~~~~~ ~~~|~~~~~( ~~~~~~~)~~ 121 ~~~~~~~~~~ ~~~~~~~|~~ ~~~~~~~~~( ~~~~~)~~~| ~~~~~~~~~~ ~~~~(~~~~~ 181 )~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~(~~ ~~~~~~~~~~ )~~~~~~~~~ ~~~~~~~~|~ 241 ~~~~~~~~~| ~~(~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~)~~~~~~|~ 301 ~~~~~~~~~~ ~(~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ 361 ~)~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~(~~~~ ~)~~~~~~~~ 421 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~)~~ ~~~~~~|~(~ ~~)~~~~~(~ 481 ~~~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 541 ~~~(~~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~~~)~~~~~~ ~~~~~~~~~~ 601 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ 661 ~~~~)~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 721 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~(~~~~~)~ ~~~~~~~~~~ ~~~~~~~~~~ 781 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~|~)~ ~~~|~~~~~~ ~~~~~~~~~~ 841 ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~)~ ~~~~~~~~~~ ~~~~(~~~~) ~~~~~~~~~~ 901 ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~)~~~~~ ~~~~~~~~~~ 961 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~|~~~ ~~~~~~~~~~ ~~~(~~~~)~ 1021 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ (~~~~~~~~~ )~~~~~~|~~ ~~~~~~~~~~ 1081 ~~~~~~~~~~ (~~~~~)~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1141 (~~~~)~~~~ ~~~|~~~~~~ ~~~~|~~~|~ ~~~~~~~~|~ ~~~~~|~~~~ ~~~~~~~~~~ 1201 ~~~~~~~|~~ ~~~(~~~~~) ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~|~~~~~~~~ 1261 ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~(~~~~~ ~)~~~~~~~~ ~~~~~~~~~~ 1321 (~~~~~~)~~ ~~~~~~~~~~ ~~~~~~~~|~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ 1381 ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~~~~~~~~(~ ~~~~~~~~)~ ~~~~~~~|~( 1441 ~~~)~|~~~~ ~~(~~~~~~) ~~|~~~~~~~ ~~~~~~~~~~ ~~~~|~~~~~ ~~~~~~~~~~ 1501 ~~~~~~~~~~ ~~~~|~|~~~ ~~~~~~|~~~ ~|~~~~~~~~ ~~(~~~~~~) ~~~~~~~~~~ 1561 ~~~~~~~~~~ ~~|~~~~~~~ ~~~~~~~~~~ ~~~~~~~~~~ ~(~~~~)~~~ ~~~~~~~|~~ 1621 ~~~~~~~~~~ ~~~GGAUCCC GGC|GGCAGU AGCagcg-aa GAGGGG(UGG AAA-U)CCC- 1681 CUC|CGCCGG AAaGGGCCAG GGUUCCUCGG CAACG-(GUC GU)CG|GCCG AGGGUUAGC| 1741 CGG|-CCCUA ACCCGGG-CC G(UAACA)CG G--ACCCGGG GAAG-GGGAA ACGGG(UUAA 1801 UAUU)CCCGU |GC-cgcggg gg-uaga--- cgcg(-gcaa )cgcaa-gcc cggc-cuccg 1861 acgccuccgg auaggcgg|a ccggg(ggca ccc-gaaugg gugcga)ccc ggc|-uaacc 1921 ggu-gaaggc CCUGGAGAA- CC(GUAAU)G G|UGAGAAGG G--gccga-a gccgggaau- 1981 |ggg|acccc (--guau)gg ggucuu-ccg cugacuccgg -|gggcccau gaaaaggggg 2041 ccgggaa--- cgaucccccg c--g|U|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugaguag )CCCCA-GGC Gu||cuGGGG -GAUAACCCG GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCCGC---g gucugg-agg -(caaaaac) 2221 -ccucuggac c--GCGGGU| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CAAGCC C(GAA-A)GG GUGUGUACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(ua ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACU CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UG-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGAG GUCCCCACUG UCCUGGCCCG 2521 GG-GCCCC|G CGAAUAC|-A CGGA-GCCGG (-UGCACA-G G)CCGGCAAC CCCCCGCAGG 2581 GCGAUAAGAC CCCGUGG|AG CUUCACCGC| AGCCUGGCGU U|GGCCUCCG GGCUGCGGCG 2641 CGUAGCGUAG GUGGGAGCCA GUG---AACC CGCCCC(UCC G-)GGGGCGG G-GGAGGCGC 2701 CAAUGAAACA CCACC|CACC GCAGCCCGGA GGCCuuaccc -ccgggg(-- aga-)ucccg 2761 gggggacAGC GUCAGGUGGG CGGUUUGGCU G(GGG)CGGC AC|GCCCGC( GAAAAGGUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AGC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC UUGACCGGAC CCc(ggaaca caa-)cGGGU CCGGCCGG(G 2941 AAA)CCGCGG |CCUAGCGAA CGCUCGUGC- CCC(-ccucg gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCUCCCA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG AGU(GCAGCA 3121 )GCUCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GACUCUACCC |GCGGGGGG- |UG-UGGGCC GCCU|GA|GG 3241 GGAA-GGUGC CCU(CAGUAC (GAGA)GGAA C)GGGGCGC| CGCGGCCAAU GGUGUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -uaGCCGGGC ACGCCA-GCC GUA-GGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|gu|ccg ccgaagggg- 3421 -------(uu u--)------ --ccucuucg gcgg--guag g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUAAGCCC Cgagggag(c auugu)cucc caaGGGGU|U 3541 UAGCCC|G|C CGCUCCCAaa agcccga--c |gccgcaggc ~~~~~~~||| || // LOCUS D.mobilis 3592 bp RNA RNA 16-JAN-1986 DEFINITION Desulfurococcus mobilis. ACCESSION No information KEYWORDS No information. SOURCE Desulfurococcus mobilis. ORGANISM Desulfurococcus mobilis. REFERENCE 1 (sites) AUTHORS No information JOURNAL Nov 16, 1986 STANDARD No information COMMENTS Organism information Culture collection: ? Sequence information (bases 1 to 3592) Corresponding GenBank entry: X05480 Phylo:Archaebacteria,Sulfur-dependent,Desulfurococcus BASE COUNT 622 a 915 c 1138 g 406 t 511 others ORIGIN 1 |--------- ---------c g--acGAcGC CGCCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cCGAGGAA GGC|CGUG(G CAAGC)U|GC GAUAAGCCCG GGG|uAGGC( GCAGGCA)GC 121 CGUu-GAACC CGGGAUC|GC -CGAAUGGG( ACUU-)CCC| gccacgggu- ----(uaa-- 181 )-----accc gu-ggcugcc cgggaaa(cc cgcg--aggg )gaguaccgg gcggGAAC|C 241 CCCCGAACG| GA(AACA)UC UUAGUAGGGG GAGGAGAAGA AACC(AA--A C)GGGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA CAGCCCAaac caaa-ucuc- cacgg(gacg 361 a)ccgug-ga ga-gaugugg gguuacaggc uccacggcu- -gggg(cucg a)cccc-agc 421 aggccuaccc aucguagcc- GAAguggcc- (uggaaa)-g gcccgc|c(g ua)gaGGG(U 481 GACAGC)CCC GUA-ggcuaa acgauggggg ccug---cgc cgugg-agca GAGUACCACG 541 GCu(---ugg uuu)uGCCGU GGGAAG-UCG GGGG(Ucacc gA-)CCUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG UACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGCACCCC(g 661 caag)GGGG- GUG-AAAAGA GCCUGAAACC GGGCGGCuac a-uac-ggca cggcccu-ca 721 aggag---ug aagcccgcua aaggaaaccg gg(guga-)c ccgg---gag uacgag-gcg 781 ggcu-gacca --gggucgug ccGUCCGU(C UUGAAA|C)A CGG|GCCGGG GAGUCCAC-G 841 GCAGUGGCGA GCCUAAgggg (gucaaag)c cccGGAGGCG UAGG(GAAA) CCGacaggcc 901 cgcaaccgcc -(gcaa)-gg cggcgagggg cggGGUCC(c aaa-)GGGCC CGGAGUCACU 961 GCCGUGGGAC CAGAAACCGG GCGAUCUAGG CGGGGG|CAG GGCGAAGCCG GGG(GAAA)C 1021 CCCGGUGGAG GCCCGaaa-g gguu-cu-ga (cgugca-au )ucgu-u|cc cauGACCCCC 1081 GUCUAGGGGC (AAAAG)ACC AAUCUAGCCC GG-UGAUAGC UGGUUCCCGC CGAAGUGGGU 1141 (CUCA)GCCC AGC|CCCGCC Gga-|ggu|- ggGCCACG|G GGUAG|AGCU ACGGAUCGGG 1201 gaugcGG|AG GCC(gaaa-) GGCCU-Cg-G UCCCCGGUCC AACUCCGAAC C|UGUGGCca 1261 ccgu-aga|U GGCGGG-AGU GGGGUUCC-C CGGC(G-UAA G)GUUG-GGG GCCGCGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCAAAGUGCC GGCUA-AGUG cua---acca 1381 AAAGGGCGUC CCGCGGCUUA GACAGCGGGG AGGUGGGC(C UAACAGCA)G CCAUCCU|U( 1441 UAA)A|GAGU GC(GUAACA) GC|UCACCCG CCGAGCCGCG GGGC|CCCGA AGAUUGGUCG 1501 GGGCUcAAGC CGGC|C|GCC GAGACC|CCG G|ggcacggc uc(cgaug-) gagccgugau 1561 ccGGUAGGCG GG|CGCCGGG UC-GGGGCAG AAGCCGGG-C C(GUGA)GGU CCGGUGG|AC 1621 CCGAUCCGGG UGAAGAUCCC GGC|GGCAGU AACagcg-aa GAGGGG(UGU GAA-U)CCC- 1681 CUC|CGCCGG AAaGGGCAAG GGUUCCUCGG CAACG-(GUC GU)CG|GCCG AGGGUUAGC| 1741 CGG|-UCCUA ACCCGGG-CC G(UAACA)CG G--ACCCGGG GAAU-GGGAA ACGGG(UUAA 1801 UAUU)CCCGU |GC-cacggg gg-uagg--- ugcg(-gcaa )cgcaa-gcc ccgc-cucug 1861 acgccucugg auaggcgg|a cuggg(ggca ccc-guaugg gugcga)ccc agc|-uaacc 1921 ggu-gaaggc CCCGGAGUA- CC(GUCAU)G G|UGAGAAGG G--gccga-a gccgggaau- 1981 |ggg|gcucc (--cuug)gg agccuu-ccg ccgauuccag -|gggcccuu gaaaaggggg 2041 cggggaa--- cgaucccccg u--g|A|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(uuagaag )CCCAA-GGC Gu||guGGGG -GCUAACCCA GGCCAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g gucuug-ggg -(--cacac) 2221 -cccugggac c--GCAGGC| C(GCA)GUGC CUAGGGGGGC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CAAGCC C(GAA-A)GG GUUUGUACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUCCGU GAA----acc gggg---(uc ca)----acc cgg----cGA AGCG|CCGCU 2401 GAAG|GCC|G GGAGU(AACU CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UAa+gagAG U)UCCGGC|G CGCAUGAAUG GAUCAACGAG GUCCCCACUG UCCUGGCCUG 2521 GG-GCCCC|G UGAAGAC|-A CGGA-GCCGG (-UGCAUA-G G)CCGGCAAC CCCCCGCAGG 2581 GCGAUAAGUC CCCGUGG|AG CUUCACCGC| AGCCUGGCGC U|GGCAUCCG GGUCGCGGCG 2641 CGUAGCGUAG GUGGGAGCCU GUG---AUGC CGUGCC(UUC G-)GGCACGG U-GGAGGCGG 2701 CAAUGAAACA CCACC|CACC GUGACCUGGA UGCCuuac-- -ucccgg(-g uag-)ccggg 2761 a--ggacAAC GCCAGGUGGG CGGUUCGGCU G(GGG)CGGC AC|GCCCGC( GAAAAGAUAA 2821 C)ACGGGC|G |C|CCAAAGG UCGGC|UCAG GCGGG(UCAG AGC)UCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACCGGAC CCU(ugaaca caa-)GGGGU CCGGCCGG(G 2941 AAA)CCGCGG |CCUAGCGAA CGCUCGuGC- CCC(-ccucg gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|AGUCGUCG CGGGCGAGA( GCUCCCA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |AUCC|UGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|GGGGC( UGCCC)GCCC AUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GACUCUACCC |GCGGGGGG- |UG-CGGGCC GCCU|GA|GG 3241 GGAA-GCUGC CCU(CAGUAC (GAGA)GGAA C)GGGGCGG| CGCGGCCUCU GGUUUACCGG 3301 U|UGUCC-(U GGC)-GGGCA -acGCCGGGC A-GCCACGCC GCA-AGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCG GCC|ug|ccg ucacgg---- 3421 -------(gu ua-)------ -----ccgug gcgg--cgcg g|gc|UCCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGCGG|GG GUGUAAGCCC -aaggg--(g uua--)--cc cgaGGGGU|U 3541 CAGCCC|G|C CGCUCCCAag agcccgaa-- |cg------c cg-----||| || // LOCUS S.shibatae 3592 bp RNA RNA 11-JAN-191 DEFINITION Sulfolobus shibatae. ACCESSION No information KEYWORDS No information. SOURCE Sulfolobus shibatae. ORGANISM Sulfolobus shibatae. REFERENCE 1 (sites) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U32321 LOCUS SSU32321 3157 bp DNA BCT 15-AUG-1995 DEFINITION Sulfolobus shibatae 23S rRNA gene, partial sequence. ACCESSION U32321 KEYWORDS . SOURCE Sulfolobus shibatae. ORGANISM Sulfolobus shibatae Archaea; Crenarchaeota; Sulfolobales; Sulfolobus. REFERENCE 1 (bases 1 to 3157) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. TITLE Phylogenetic analysis of the Archaeal order of Sulfolobales based on sequences of 23S rRNA genes and 16S/23S rRNA spacers JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3157) AUTHORS Trevisanato,S.I. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) Siro I. Trevisanato, Princess Margaret Hospital, Ontario Cancer Institute, 500 Sherbourne Street, Toronto, Ontario M4X 1K9, Canada COMMENT NCBI gi: 942630 FEATURES Location/Qualifiers source 1..3157 /organism="Sulfolobus shibatae" /isolate="DSM 5389" misc_feature <1..185 /note="16S/23S rRNA spacer region" rRNA 186..>3157 /product="23S rRNA" BASE COUNT 696 a 845 c 1089 g 512 t 15 others ORIGIN BASE COUNT 646 a 800 c 1043 g 468 t 635 others ORIGIN 1 |--------- -cgacuaggg gc-ACCAAGC CAGCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cUGACGAA GGG|CGCG(G CAAGU)G|GC GAAAUGCCCG GGG|UAGGC( ACACGCU)GC 121 CUUU-GAACC CGGGAUN|CC CCGAAUGGg- -a----CCU| -cccnc---- ----(-uuu- 181 )--------g -g-ggn-gca cccga-c(uc -gnnna---g )aguucgggu guGGGAAC|U 241 CCCCGAACG| GA(AACA)UC UUAGUAGGGG AAGGAGGAGA AAUC(AA--C C)GAGA-C|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA NAGCCCAaac caaa-cccg- ccggc(gaca 361 a)gucgg-ug gg-aaugugg uguuauuggc cuccagucu- -gguu(-uaa -)agcc-gac 421 cucccuggcc uaccuagcc- GAAcucucc- (uggaau)-g gagggc|c(a ua)gaGGG(U 481 GACAGC)CCC GUA-ggcuaa agguaggugg gagg---ugg cugga-ggca GAGUACCAUC 541 CCC(---ugg uuc)GGGGGU GGGAAG-UUG GGGG(Ccacg ug-)CCUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGCAA AGUACCCC(G 661 GAAN)NGGGG GGUGAAAAGA GCCUGAAACC GGCUGGUCau aguag-ggca aggcuc--ga 721 aagga--gug aagucucccg aaggaaagag gc(gcga-)g ccnnnucgag uacgag-gga 781 gaug-gaccg --gggucuug ccUUUCGU(C CUGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 UCAGUGGCGA GCCUAAgggg (--guuaa)c cccGAAGGCA UAGG(GAAA) CCGag-ugcc 901 cgcaacccgg -(gaaa)-cc gggugagggg cagGGUCU(g uca-)GGGCC UAAAGUCACU 961 GGCGUGAGGC UAGAAACCGG GCGAUCUAGG CCGGGG|CAG GCCGAAGCCG GGG(GAAA)C 1021 CUCGGUGGAG GGCCGaauag gg-guucuga (cgugca-au )uc-guu|cc ccuGACCCCG 1081 GUCUAGGGGC (GAAAG)ACC AAUCUAGCUC GG-UGAUAGC UAGUUCCCCC CGAAAUGCGU 1141 (CCUA)GCGC AGC|CUCCCU gga-|ggu|- ugCCUACG|G GGUAG|AGua acUGAUUGGG 1201 GGU--UC|GG AGC(gaaa-) GCUCC-GG-G CUUCCAGUCA AACUCCGAAC U|CGUAGGcg 1261 ccga-aga|a GGGGAG-AGU GGGCCACU-C -GGC(G-UAA G)GUUG-GGU GGCGAAAGGG 1321 (AAACAG)CC CAGACUUG-G GUUAAGGC|C CCAAAAUGCC GGCUA-AGUG cc----aguu 1381 GAAGGGCGUC UCUAGCCUUA GACAGCGGGA AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUAG AGGC|CCCGA AGAUUGGUCG 1501 GGCUUAAGCC CGGC|U|GCC GAGACC|CAA G|cguggaga cu(caaug-) agucucca-- 1561 cgGGUAGGGG GG|CGCUGUG GU-GGGGUAG AAGGUGGG-U C(GUGA)GAU CCACUGG|AC 1621 CCGCCACAGG UGCAGAUCCC GGC|GGCAGU AACagcg-ag GAGGGG(UGA GAU-U)CCC- 1681 CUC|CGCCGG AA-GGGCCAG GGUUUCCCGG CAAUG-(GUC GU)CA|GCCG GGAGUGAGC| 1741 CGG|-UCCUA AGGCGAG-GC C(UAAUA)GG U--ACUCGCC GAAA-GGGAA GUGGG(UUAA 1801 UAUU)CCCAC |GC-ccuagg gg-uagg--- ugcg(-guaa )cgcaa-gcu ggac-ucccg 1861 acggaucugg guagggug|a guggg(---- -gc-caccgc ------)ccc auc|-caagc 1921 gcu-uaaguc CCUGGAGUG- CC(GUAAU)G G|UGAGAAGG G--gacga-a ggcgugaug- 1981 |ggn|guucc (--cuuu)gg ggacuu-cac ccaaucccag -|guccccuu gaaaagggag 2041 uccagaa--- cgauccccua g--g|A|CCG UACCGAGAAC C(GACACU)G G|UGCCCCUG 2101 GG(ugagaag )CCCAA-GGC GU||CUGGGG -GUUAACUCA GGCUAGGGAA CUCGGCAAAA 2161 UAGCCCCGUA CC(UUCG)GU AGAAGGGGU| GCCUAC---u aggguc-g-- -(--uaaac) 2221 ---cggccuc a--GUAGGU| C(GCA)GUGA CAAGAGGGAC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CUAGCC C(GAA-A)GG GUGUGAACGG GGGCUAAAUC CUGGCCACU| 2341 GGUGGUUGGU GAA----agc cggg---(uc ca)----acc ggc----cGA AGCC|CCACU 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGUG GUCCCUACUG UCCCAGCCUG 2521 GG-GCCCC|G UGAACGC|-C CAGA-GUGGG (-UUCACA-G U)CCCGCAAC UCCCCACACC 2581 GAGAGAAGAC CCCGUGG|AG CUUCACCGC| AGCCUGGCGC U|GCUCCUCG GGCGCCUAUG 2641 CGUAGAGUAG GUGGGAGGGG UCG---AACC CAUCCU(UUC G-)GGGAUGG G-GNACCCGA 2701 AGGUGAAACA CCACC|CAUG GGUGCUCGAG GAGCuuac-- --cugcc(-c gna-)ggugg 2761 ---ggacAGC GUCAGGUGGG CGGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGAGGC CCU(ccaaag uac-)GGGGU CUCGACGC(G 2941 AAG)GCGCGG |CCUAGCGAA CGCUCGUGU- CCC(-cccac gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|GGUCGUCG CGGGCAAGA( GCUCCCA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |ACCC|UGGG GGU(GCAGCU 3121 )GCCCCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUAAGG |GUGGGGAG- |UG-UGGGCU GCCU|GA|GG 3241 GGAA-GGUAC CCC(AAGUAC (GAGA)GGAA C)AGGGUAC| CGGGGCCUCU AGUUUACCGG 3301 U|UGUCC-(G GUA)-GGGCA -GUGCCGGGC A-GCCACGCC CUG-UGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|UA A-AAGAGGCG GUC|a-|ggc ---------- 3421 ------g(gg uu-)a----- ---------- -ccg--gggg c|cc|GCCCC (UAGAAGA)G 3481 CGGGUUGAUG G|nnnnn|nn nnnnnnnnnn nnnnnnn-(n nnn--)-nnn nnnnnnnn|n 3541 nnnnnn|n|n nnnnnnnnnn nnnnnnnn-- |nnnnnnnnn nn-----||| || // LOCUS S.solfatar 3592 bp RNA RNA 11-JAN-191 DEFINITION Sulfolobus solfataricus. ACCESSION No information KEYWORDS No information. SOURCE Sulfolobus solfataricus. ORGANISM Sulfolobus solfataricus. REFERENCE 1 (sites) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U32322 LOCUS SSU32322 3229 bp DNA BCT 15-AUG-1995 DEFINITION Sulfolobus solfataricus 23S rRNA gene, partial sequence. ACCESSION U32322 KEYWORDS . SOURCE Sulfolobus solfataricus. ORGANISM Sulfolobus solfataricus Archaea; Crenarchaeota; Sulfolobales; Sulfolobus. REFERENCE 1 (bases 1 to 3229) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. TITLE Phylogenetic analysis of the Archaeal order of Sulfolobales based on sequences of 23S rRNA genes and 16S/23S rRNA spacers JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3229) AUTHORS Trevisanato,S.I. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) Siro I. Trevisanato, Princess Margaret Hospital, Ontario Cancer Institute, 500 Sherbourne Street, Toronto, Ontario M4X 1K9, Canada COMMENT NCBI gi: 942631 FEATURES Location/Qualifiers source 1..3229 /organism="Sulfolobus solfataricus" /isolate="DSM 1616" misc_feature <1..186 /note="16S/23S rRNA spacer region" rRNA 187..>3229 /product="23S rRNA" BASE COUNT 704 a 867 c 1117 g 527 t 14 others ORIGIN BASE COUNT 655 a 822 c 1069 g 484 t 562 others ORIGIN 1 |--------- acgacuaggg gc-ACCAAGC CAGCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cUGACGAA GGG|CGCG(G CAAGU)G|GC GAAAUGCCCG GGG|UAGGC( ACACGCU)GC 121 CUUU-GAACC CGGGAUU|CC CCGAAUGGg- -a----CCU| gcccc----- ----(-uuu- 181 )--------- gg-ggc-gca cccgacu(c- -gnnna---- )gagucgggu guGGGAAC|U 241 CCCCGAACG| GA(AACA)UC UUAGUAGGGG AAGGAGGAGA AAUC(AA--C C)GAGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA UAGCCCAaac caaa-cccg- ccgcc(ggca 361 a)gucgg-ug gg-aaugugg uguuauuggc cuccagucu- -gguu(-uaa -)agcc-ggc 421 cucccuggcc uaccuagcc- GAAcucucc- (uggaau)-g gagggc|c(a ua)gaGGG(U 481 GACAGC)CCC GUA-ggcuaa agguaggugg gagg---uga cugga-ggca GCGUACCAUC 541 CCC(---ugg uuc)GGGGGU GGGAAG-UUG GGGG(Ccacg ug-)CCUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGCAA AG-ACCCC(G 661 GAAG)GGGG- GUG-AAAAGA GCCUGAAACC GGCUGGUCau aguag-ggca aggcuc--ga 721 aagga--gug aagucucccg aaggaaagag gc(gcna-)g ccu-----ag uacgag-gga 781 gaug-gaccg --gggucuug ccUUUCGU(C CUGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 CCAGUGGCGA GCCUAAgggg (--guuaa)c cccGAAGGCA UAGG(GAAA) CCGag-ugcc 901 cgcagcccgg -(gaaa)-cc nggugagggg cagGGUCU(g cca-)GGGCC UAAAGUCACU 961 GGCGUGAGGC UAGAAACCGG GCGAUCUAGG CCGGGG|CAG GCCGAAGCCG GGG(GAAA)C 1021 CUCGGUGGAG GGCCGnnuag gg-guucuga (cgugca-au )uc-guu|cc ccuGACCCCG 1081 GUCUAGGGGC (GAAAG)ACC AAUCUAGCUC GG-UGAUAGC UAGUUCCCCC CGAAAUGCGU 1141 (CCUA)GCGC AGC|CUUCCU gga-|ggu|- ugCCUACG|G GGUAG|AGua acUGAUUGGG 1201 GGU--UC|GG AGC(gaaa-) GCUCC-GG-G CUUCCAGUCA AACUCCGAAC U|CGUAGGcg 1261 ccga-aga|a GGGGAG-AGU GGGCCACU-C -GGC(G-UAA G)GUUG-GGU GGCGAAAGGG 1321 (AAACAG)CC CAGACUUG-G GUUAAGGC|C CCAAAAUGCC GGCUA-AGUG cc----aguu 1381 AAGGGGCGUC UCUAGCCUUA GACAGCGGGA AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUAG AGGC|CCCGA AGAUUGGUCG 1501 GGGCUUAAGC CGGC|U|GCC GAGACC|CAA G|cguggagg cu(cauug-) agucucca-- 1561 ugGGUAGGGG GG|CGCUGUG GU-GGGGUAG AAGGUGGG-U C(GUGA)GAU CCACUGG|AC 1621 CCGCCACAGG UGCAGAUCCC GGC|GGCAGU AACagcg-ag GAGGGG(UAA GAC-U)CCC- 1681 CUC|CGCCGG AA-GGGCUAG GGUUUCCCGG CAAUG-(GUC GU)CA|GCCG GGAGUGAGC| 1741 CGG|-UCCUA AGGCGAG-GC C(UAAUA)GG U--ACUCGCC GAAA-GGGAA GUGGG(UUAA 1801 UAUU)CCCAC |GC-ccuagg gg-uagg--- cgcg(-guaa )cgcaa-gcu ggac-uccug 1861 acggaucugg guagggug|a guggg(---- -gc-caccgc ------)ccc auc|-caagc 1921 gcu-uaaguc CCUGGAGUG- CC(GUAAU)G G|UGAGAAGG G--gacga-a ggcgugaug- 1981 |ggn|guucc (--cuuu)gg ggacuu-cac ccaaucccag -|guucccuu gaaaagggag 2041 uccagaa--- cgauccccua g--g|A|CCG UACCGAGAAC C(GACACU)G G|UGCCCCUG 2101 GG(ugagaag )CCCAA-GGC GU||CUGGGG -GUUAACUCA GGCUAGGGAA CUCGGCAAAA 2161 UAGCCCCGUA CC(UUCG)GU AGAAGGGGU| GCCUAC---u gggguc-g-- -(--uaaac) 2221 ---cggccuc a--GUAGGU| C(GCA)GUGA CAAGGGGGAC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CUAGCC C(GAA-A)GG GUGUGUACGG GGGCUAAAUC CUGGCCACU| 2341 GGUGGAUGGU GAA----agc cggg---(uc ca)----acc ggu----cGA AGCC|CCACU 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGUG GUCCCUACUG UCCCAGCCUG 2521 GG-GCCCC|G UGAACGC|-C CAGA-GUGGG (-UUCACA-G U)CCCGCAAC UCCCCACACC 2581 GAGAGAAGAC CCCGUGG|AG CUUCACCGC| AGCCUGGCGC U|GCUCCUCG GGCGCCUAUG 2641 CGUAGAGUAG GUGGGAGGGG UCG---AACC CAUCCU(UUC G-)GGGAUGG G-GGACCCGA 2701 AGGUGAAACA CCACC|CAUG GGUGCUCGAG GAGCuuac-- -cugccc(-n nnn-)gggug 2761 g--ggacAGC GUCAGGUGGG CGGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUGGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC UUGACGAGAC CCU(ccaaag uac-)GGGGU CUCGACGC(G 2941 AAA)GCGCGG |CCUAGCGAA CGCUCGUGC- CCC(-cccac gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|GGUCGUCG CGGGCGAGA( GCUCCCA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |ACCC|UGGG GGU(GCAGCU 3121 )GCCCCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GCGA )GACGGGUCG GACUCUAAGG |GUGGGGAG- |UG-UGGGCU GCCU|GA|GG 3241 GGAA-GGUAC CCC(AAGUAC (GAGA)GGAA C)AGGGUAC| CGGGGCCUCU AGUUUACCGG 3301 U|UGUCC-(G GUA)-GGGCA -CUGCCGGGC A-GCCACGCC CUG-UGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|GA A-AAGAGGCA GCC|a-|ugc ggg------- 3421 -------(gu uaa)------ --------cc cgcg--agag c|cc|ACCUC (UAGAAGA)G 3481 GGGGUUGAUG G|GNNCG|GG GUGUAAGUCC Cgagggc-(g aaa--)-guc cgaGGGAU|U 3541 UAGCCC|G|C GCCUCCCAau cgggcaag-- |ccc-ucuu- -------||| || // LOCUS S.acidoc.1 3592 bp RNA RNA 09-JAN-1985 DEFINITION Sulfolobus acidocaldarius subsp. Now this strain is known to be Sulfolobus acidocaldarius str. previously called Sulfolobus solfataricus. ACCESSION No information KEYWORDS No information. SOURCE Sulfolobus acidocaldarius subsp. Now this strain is known to be Sulfolobus acidocaldarius str. previously called Sulfolobus solfataricus. ORGANISM Sulfolobus acidocaldarius. REFERENCE 1 (sites) AUTHORS No information JOURNAL S.solfataricus:who:published STANDARD No information COMMENTS Organism information Culture collection: ? Sequence information (bases 1 to 3592) Corresponding GenBank entry: M67495 Phylo:Archaebacteria,Sulfur-dependent,Sulfolobus LOCUS SSORGLSA 3033 bp ds-DNA BCT 13-NOV-1991 DEFINITION S.solfataricus large subunit ribosomal RNA gene, complete cds. ACCESSION M67495 KEYWORDS large subunit ribosomal RNA. SOURCE Sulfolobus solfataricus (strain DSM 1616) DNA. ORGANISM Sulfolobus solfataricus Prokaryota; Bacteria; Mendosicutes; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 3033) AUTHORS Woese,C.R. JOURNAL Unpublished (1991) STANDARD full staff_entry FEATURES Location/Qualifiers rRNA <1..>3033 /product="large subunit ribosomal RNA" BASE COUNT 699 a 803 c 1029 g 502 t ORIGIN LOCUS SSORGLSA 3033 bp ds-DNA BCT 13-NOV-1991 DEFINITION S.solfataricus large subunit ribosomal RNA gene, complete cds. ACCESSION M67495 KEYWORDS large subunit ribosomal RNA. SOURCE Sulfolobus solfataricus (strain DSM 1616) DNA. ORGANISM Sulfolobus solfataricus Prokaryota; Archaeobacteria; Sulfolobales. REFERENCE 1 (bases 1 to 3033) AUTHORS Woese,C.R. JOURNAL Unpublished (1991) STANDARD full staff_entry FEATURES Location/Qualifiers rRNA <1..>3033 /product="large subunit ribosomal RNA" BASE COUNT 699 a 803 c 1029 g 502 t BASE COUNT 699 a 803 c 1029 g 502 t 559 others ORIGIN 1 |--------- -uacccaggg g--ccGAAGC CUCCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 a|cCGAAGAA GGG|CGCG(G CAAGC)A|GC GAAAUGCUCG GGU|GAGGC( GCAAGCA)GC 121 CGUU-GACCC CGAGGUC|CC -CUAAUGGG( AUAU-)CCU| gccgg----- ----(guuu- 181 )--------- cc-ggc-gcu cccgguu(-- -uau------ )-aacuggga guGGGAAC|C 241 CCCCGAACG| GA(AACA)UC UUAGUAGGGG GAGGAAAAGA AAUC(AA--U U)GAGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA CAGCCCAaac uaaa-ccug- ccgau(gaua 361 a)gucgg-ug gg-gaugugg uguuac-gac cucuagccu- -gagg(uucg a)ccuc-ggc 421 uuuccuaacc uaucuagcc- GAAcucccc- (uggaac)-g ggggcc|c(a ua)gaGGG(U 481 GAAAGC)CCC GUA-ggcuaa agauaggugg aaag---ugg cuaga-ggua GAGUACCAUC 541 CCC(---ugg uuu)GGGGGU GGGAAG-UUA GGGG(Acacg uG-)CCUCUA AGGCUAAAUA 601 ugu-CCCGAG ACCGAUAGCA AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(g 661 gaag)GGGGA GUGCCAAAGA GCCUGAAACC GGGUGGUUau a-cag-ggug uggcuc--ga 721 aagaa--gug aacccuuccg aaggaa-agg gc(gcaa-)g cccu---uag uacgag-gaa 781 gggc-gaucg --gggucacg ccuUUCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAC-A 841 UCAGUGGCGA GCUUAAggag (--auuau)c uccGAAGGCA UAGG(GAAA) CCAag-ugcc 901 cgcagccuag -(uuu-)-cu aggcgagggg cagGGUCU(g uca-)GGGCC UGAAGCCACU 961 GAUGUGAGGC UAGAAACCGG GCGAUCUAGU CCGGGG|CAG GCUGAAGGUG GGG(GAAA)C 1021 CCCACUGGAG GGCCGaauag gg-guucuga (cgugca-au )uc-guu|cc cuuGACCUCG 1081 GACUAGGGGC (AAAAG)ACC AAUCUAGCCC GG-UGAUAGC UAGUUCCCCC CGAAAUGCGU 1141 (CCUA)GCGC AGC|CUCCCU aaa-|ggc|- agCUCGCG|G GGUAG|AGUG ACAGAUCGGG 1201 ggc--UC|CA GGC(gaaa-) GCCUG-GG-G CUUCCGGUCU AACUCCGAAC C|CACGAGcg 1261 ccga-aga|a GGGGGG-AGU GGGUCACU-C -GGC(G-UAA G)GUUG-GGU GGCAAAAGGG 1321 (GAACAA)CC CAGACCUG-G GUUAAGGC|C CCAAAGUCCC GGCUA-AGUG cc----aacg 1381 AAAAGGCGUC UCCAGCCUUA GACAGCGGGA AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUGG AGGC|CCUAA AGAUUGGUCG 1501 GGGCUCAAGC CGGG|C|GCC GAGACC|CAG G|aggggu-c uc(uacua-) gag-aucc-u 1561 cuGGUAGGGG GG|CGCUGUG AU-GGGGUAG AAGGUGGG-U C(GUGA)GAU CCACUGG|AC 1621 CCGUCACAGG UGCAGAUCCC GGC|GGUAGU AACagcg-aa GGGGGG(UGA GAA-U)CCC- 1681 CCU|CGCCGG AA-GGGCAAG GGAUUCCCGG CAACG-(UUC GU)CA|GCCG GGAGUUAGC| 1741 CGG|-uCCUA AGGUAGG-GC C(UAAUA)GG U--ACCUACC GAAA-GGGAA AGGGG(UUAA 1801 CAUU)CCCCU |GC-cucccg gg-uagg--- ugcg(-guaa )cgcaa-gcc agac-uccug 1861 acggauuggg guagggag|a guagg(---- -ac-caccgu ------)ccu acc|-caagc 1921 acu-caagcc CUUGGAGAG- CC(GUAAC)G G|UGAGAAGA G--ggcga-a ggugugaug- 1981 |ggc|cuucc (--guua)gg aggguu-cuc cugaucccua -|guccccau gaaaagggag 2041 ucuggaa--- cgaucccggg a--g|A|CCG UACCUAGAAC C(GACACU)G G|UGCCCCUG 2101 GG(ugagaag )CCCAA-GGC Gu||cuGAGG -GGUAACCCA GGCUAGGGAA CUCGGCAAAU 2161 UAGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUAU---c gugguu-u-- -(--aaca-) 2221 ---aagccac g--AUAGGU| C(GCA)GUGA CCAGAGGGAC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CUAGCC C(GAA-A)GG GUGUGUACGG GGGCUAAAUC CUGGCCACU| 2341 GGUGGUUGGU UAA----auc cggg---(uu ca)----acc ggg----cGA AGCC|CCACC 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGCG GUCCCUACUG UCCCAGCCUG 2521 GG-GCCUC|G UGAACGC|-C CUGA-GCCGG (-UGCACA-G U)CCGGCAUC UCCCUACACC 2581 GAGAGAAGAC CCCGUGG|AG CUUCACCGC| AGCCUGGCGU U|GUCCCUCG GGCGUUUAUG 2641 CGUAGAGUAG GUGGGAGGGG UCG---AACC UUGCCU(UUC Gg)GGGCAGG G--GACCCGA 2701 AAGUGAAACA CCACC|CAUG GACGCUCGAG GGACuaac-- --cucuc(-g aaa-)gagag 2761 ---gaacAAC GUCAGGUGGG CGGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUAGA|GC(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGUGAC CCU(uccaag uac-)GGGGU CACGACGC(G 2941 AAA)GCGGGG |CCUAGCGAA CGCUCGuGC- CCC(-cacac gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|GGUCGUCG CGGGCGAGA( GCUCACA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |ACCC|UGGA GGU(GCAGCU 3121 )GCCUCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AGCGUGAGCU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUAAGG |GUAGGGAG- |UG-CGGACC GCUU|GA|GG 3241 GGAA-GGAAC CCC(UAGUAC (GAGA)GGAA C)AGGGUUC| CGGGGCCUCC AGUUUACCGG 3301 U|UGUCC-(G GUA)-GGGCA -auGCCGGGC A-GCCGCGCC CUG-AGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|UA A-AAGAGGCG GUC|a-|ggc ---------- 3421 -------(gu ua-)------ ---------- -gcc--gggg c|cu|UCCCC (UAGAAGA)G 3481 GGGGUUGAUG G|GGUGG|GG AUGUAAGCUC Caagguu-(g uaa--)-gac cgaGGAGU|U 3541 UAGUCC|G|C CACUCCCAau caggcucc-- |ccccugg-- -------||| || // LOCUS S.acidoc.2 3592 bp RNA RNA 11-JAN-191 DEFINITION Sulfolobus acidocaldarius. ACCESSION No information KEYWORDS No information. SOURCE Sulfolobus acidocaldarius. ORGANISM Sulfolobus acidocaldarius. REFERENCE 1 (sites) AUTHORS Durovic,P.V. and Dennis,P.P. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U05018 LOCUS SAU05018 7151 bp DNA BCT 16-MAR-1994 DEFINITION Sulfolobus acidocaldarius 16S and 23S rRNA genes, complete sequence. ACCESSION U05018 KEYWORDS . SOURCE Sulfolobus acidocaldarius ORGANISM Sulfolobus acidocaldarius Archaea; Crenarchaeota; Sulfolobales; Sulfolobus. REFERENCE 1 (bases 1 to 7151) AUTHORS Durovic,P.V. and Dennis,P.P. TITLE Separate pathways for excision and processing of 16S and 23S rRNA from the primary rRNA operon transcript from the hyperthermophilic archaebacterium Sulfolobus acidocaldarius: similarities to eukaryotic rRNA processing JOURNAL Unpublished STANDARD full automatic REFERENCE 2 (bases 1 to 7151) AUTHORS Durovic,P.V. TITLE Direct Submission JOURNAL Submitted (12-JAN-1994) Peter V. Durovic, Biochemistry and Molecular Biology, University of British Columbia, Faculty of Medicine, 2146 Health Sciences Mall, Vancouver, British Columbia V6T 1Z3, Canada STANDARD full automatic COMMENT NCBI gi: 460149 FEATURES Location/Qualifiers rRNA 850..2403 /product="16S ribosomal RNA" /note="The 16S rRNA in this organism is highly unusual in that it retains an extra 61 nucleotides at its 3' end of intergenic spacer sequence that in all other characterized organisms is removed by processing and assembly into small ribosome subunits" rRNA 2539..5580 /product="23S ribosomal RNA" /note="The predominant 5' end is at nucleotide 2539, shorter forms isolated starting between 2539 and 2550" source 1..7151 /organism="Sulfolobus acidocaldarius" BASE COUNT 1872 a 1593 c 2152 g 1534 t ORIGIN BASE COUNT 700 a 803 c 1033 g 506 t 550 others ORIGIN 1 |----gaguu cuacccaggg g--ccGAAGC CUCCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 a|cCGAAGAA GGG|CGCG(G CAAGC)A|GC GAAAUGCUCG GGU|GAGGC( GCAAGCA)GC 121 CGUU-GACCC CGAGGUC|CC -CUAAUGGG( AUAU-)CCU| gccgg----- ----(guuu- 181 )--------- cc-ggc-gcu cccgguu(-- -uau------ )-aacuggga guGGGAAC|C 241 CCCCGAACG| GA(AACA)UC UUAGUAGGGG GAGGAAAAGA AAUC(AA--U U)GAGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA CAGCCCAaac uaaa-ccug- ccgau(gaua 361 a)gucgg-ug gg-gaugugg uguuac-gac cucuagccu- -gagg(uucg a)ccuc-ggc 421 uuuccuaacc uaucuagcc- GAAcucccc- (uggaac)-g ggggcc|c(a ua)gaGGG(U 481 GAAAGC)CCC GUA-ggcuaa agauaggugg aaag---ugg cuaga-ggua GAGUACCAUC 541 CCC(---ugg uuu)GGGGGU GGGAAG-UUA GGGG(Acacg ug-)CCUCUA AGGCUAAAUA 601 ugu-CCCGAG ACCGAUAGCA AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(g 661 gaag)GGGGA GUGCCAAAGA GCCUGAAACC GGGUGGUUau a-cag-ggug uggcuc--ga 721 aagaa--gug aacccuuccg aaggaa-agg gc(gcaa-)g cccu---uag uacgag-gaa 781 gggc-gaucg --gggucacg ccuUUCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAC-A 841 UCAGUGGCGA GCUUAAggag (--auuau)c uccGAAGGCA UAGG(GAAA) CCAag-ugcc 901 cgcagccuag -(uuu-)-cu aggcgagggg cagGGUCU(g uca-)GGGCC UGAAGCCACU 961 GAUGUGAGGC UAGAAACCGG GCGAUCUAGU CCGGGG|CAG GCUGAAGGUG GGG(GAAA)C 1021 CCCACUGGAG GGCCGaauag gg-guucuga (cgugca-au )uc-guu|cc cuuGACCUCG 1081 GACUAGGGGC (AAAAG)ACC AAUCUAGCCC GG-UGAUAGC UAGUUCCCCC CGAAAUGCGU 1141 (CCUA)GCGC AGC|CUCCCU aaa-|ggc|- agCUCGCG|G GGUAG|AGUG ACAGAUCGGG 1201 GGC--UC|CA GGC(gaaa-) GCCUG-GG-G CUUCCGGUCU AACUCCGAAC C|CACGAGcg 1261 ccga-aga|a GGGGGG-AGU GGGUCACU-C -GGC(G-UAA G)GUUG-GGU GGCAAAAGGG 1321 (GAACAA)CC CAGACCUG-G GUUAAGGC|C CCAAAGUCCC GGCUA-AGUG cc----aacg 1381 AAAAGGCGUC UCCAGCCUUA GACAGCGGGA AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUGG AGGC|CCUAA AGAUUGGUCG 1501 GGGCUCAAGC CGGG|C|GCC GAGACC|CAG G|agggguuc uc(uacua-) gag-aucc-u 1561 cuGGUAGGGG GG|CGCUGUG AU-GGGGUAG AAGGUGGG-U C(GUGA)GAU CCACUGG|AC 1621 CCGUCACAGG UGCAGAUCCC GGC|GGUAGU AACagcg-aa GGGGGG(UGA GAA-U)CCC- 1681 CCU|CGCCGG AA-GGGCAAG GGAUUCCCGG CAACG-(UUC GU)CA|GCCG GGAGUUAGC| 1741 CGG|-UCCUA AGGUAGG-GC C(UAAUA)GG U--ACCUACC GAAA-GGGAA AGGGG(UUAA 1801 CAUU)CCCCU |GC-cucccg gg-uagg--- ugcg(-guaa )cgcaa-gcc agac-uccug 1861 acggauuggg guagggag|a guagg(---- -ac-caccgu ------)ccu acc|-caagc 1921 acu-caagcc CUUGGAGAG- CC(GUAAC)G G|UGAGAAGA G--GGCga-a ggugugaug- 1981 |ggc|cuucc (--guua)gg aggguu-cuc cugaucccua -|guccccau gaaaagggag 2041 ucuggaa--- cgaucccggg a--g|A|CCG UACCUAGAAC C(GACACU)G G|UGCCCCUG 2101 GG(ugagaag )CCCAA-GGC Gu||cuGAGG -GGUAACCCA GGCUAGGGAA CUCGGCAAAU 2161 UAGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUAU---c gugguu-u-- -(--aaca-) 2221 ---aagccac g--AUAGGU| C(GCA)GUGA CCAGAGGGAC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CUAGCC C(GAA-A)GG GUGUGUACGG GGGCUAAAUC CUGGCCACU| 2341 GGUGGUUGGU UAA----auc cggg---(uu ca)----acc ggg----cGA AGCC|CCACC 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGCG GUCCCUACUG UCCCAGCCUG 2521 GG-GCCUC|G UGAACGC|-C CUGA-GCCGG (-UGCACA-G U)CCGGCAUC UCCCUACACC 2581 GAGAGAAGAC CCCGUGG|AG CUUCACCGC| AGCCUGGCGU U|GUCCCUCG GGCGUUUAUG 2641 CGUAGAGUAG GUGGGAGGGG UCG---AACC UUGGUU(UCG G-)GGGCAGG G--GACCCGA 2701 AAGUGAAACA CCACC|CAUG GACGCUCGAG GGACuaac-- --cucuc(-g aaa-)gagag 2761 ---gaacAAC GUCAGGUGGG CGGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUAGA|GC(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGUGAC CCU(uccaag uac-)GGGGU CACGACGC(G 2941 AAA)GCGGGG |CCUAGCGAA CGCUCGUGC- CCC(-cacac gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|GGUCGUCG CGGGCGAGA( GCUCACA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |ACCC|UGGA GGU(GCAGCU 3121 )GCCUCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AGCGUGAGCU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUAAGG |GUAGGGAG- |CUCGGGACC GCUU|GA|GG 3241 GGAAGGGAAC CCC(UAGUAC (GAGA)GGAA C)AGGGUUC| CGGGGCCUCC AGUUUACCGG 3301 U|UGUCC-(G GUA)-GGGCA -auGCCGGGC A-GCCGCGCC CUG-AGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|UA A-AAGAGGCG GUC|a-|ggc ---------- 3421 -------(gu ua-)------ ---------- -gcc--gggg c|cu|UCCCC (UAGAAGA)G 3481 GGGGUUGAUG G|GGUGG|GG AUGUAAGCUC Caagguu-(g uaa--)-gac cgaGGAGU|U 3541 UAGUCC|G|C CACUCCCAau caggcucc-- |ccccuggu- -------||| || // LOCUS S.acidoc.3 3592 bp RNA RNA 11-JAN-191 DEFINITION Sulfolobus acidocaldarius. ACCESSION No information KEYWORDS No information. SOURCE Sulfolobus acidocaldarius. ORGANISM Sulfolobus acidocaldarius. REFERENCE 1 (sites) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U32320 LOCUS SAU32320 3238 bp DNA BCT 15-AUG-1995 DEFINITION Sulfolobus acidocaldarius 23S rRNA gene, partial sequence. ACCESSION U32320 KEYWORDS . SOURCE Sulfolobus acidocaldarius. ORGANISM Sulfolobus acidocaldarius Archaea; Crenarchaeota; Sulfolobales; Sulfolobus. REFERENCE 1 (bases 1 to 3238) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. TITLE Phylogenetic analysis of the Archaeal order of Sulfolobales based on sequences of 23S rRNA genes and 16S/23S rRNA spacers JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3238) AUTHORS Trevisanato,S.I. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) Siro I. Trevisanato, Princess Margaret Hospital, Ontario Cancer Institute, 500 Sherbourne Street, Toronto, Ontario M4X 1K9, Canada COMMENT NCBI gi: 942629 FEATURES Location/Qualifiers source 1..3238 /organism="Sulfolobus acidocaldarius" /isolate="DSM 639" misc_feature <1..203 /note="16S/23S rRNA spacer region" rRNA 204..>3238 /product="23S rRNA" BASE COUNT 750 a 849 c 1080 g 559 t ORIGIN BASE COUNT 700 a 803 c 1030 g 502 t 557 others ORIGIN 1 |--------- -uacccaggg g--CCGAAGC CUCCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 a|cCGAAGAA GGG|CGCG(G CAAGC)A|GC GAAAUGCUCG GGU|GAGGC( GCAAGCA)GC 121 CGUU-GACCC CGAGGUC|CC -CUAAUGGG( AUAU-)CCU| gccgg----- ----(guuu- 181 )--------- cc-ggc-gcu cccgguu(-- -uau------ )-aacuggga guGGGAAC|C 241 CCCCGAACG| GA(AACA)UC UUAGUAGGGG GAGGAAAAGA AAUC(AA--U U)GAGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA CAGCCCAaac uaaa-ccug- ccgau(gaua 361 a)gucgg-ug gg-gaugugg uguuac-gac cucuagccu- -gagg(uucg a)ccuc-ggc 421 uuuccuaacc uaucuagcc- GAAcucccc- (uggaac)-g ggggcc|c(a ua)gaGGG(U 481 GAAAGC)CCC GUA-ggcuaa agauaggugg aaag---ugg cuaga-ggua GAGUACCAUC 541 CCC(---ugg uuu)GGGGGU GGGAAG-UUA GGGG(Acacg ug-)CCUCUA AGGCUAAAUA 601 ugu-CCCGAG ACCGAUAGCA AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(G 661 GAAG)GGGGA GUG-CAAAGA GCCUGAAACC GGGUGGUUau a-cag-ggug uggcuc--ga 721 aagaa--gug aacccuuccg aaggaaaggg gc(gcaa-)g cccu---uag uacgag-gaa 781 gggc-gaucg --gggucacg ccUUUCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAC-A 841 UCAGUGGCGA GCUUAAggag (--auuau)c uccGAAGGCA UAGG(GAAA) CCAag-ugcc 901 cgcagccuag -(uuu-)-cu aggcgagggg cagGGUCU(g uca-)GGGCC UGAAGCCACU 961 GAUGUGAGGC UAGAAACCGG GCGAUCUAGU CCGGGG|CAG GCUGAAGGUG GGG(GAAA)C 1021 CCCACUGGAG GGCCGaauag gg-guucuga (cgugca-au )uc-guu|cc cuuGACCUCG 1081 GACUAGGGGC (AAAAG)ACC AAUCUAGCCC GG-UGAUAGC UAGUUCCCCC CGAAAUGCGU 1141 (CCUA)GCGC AGC|CUCCCU aaa-|ggc|- agCUCGCG|G GGUAG|AGug acAGAUCGGG 1201 GGC--UC|CA GGC(gaaa-) GCCUG-GG-G CUUCCGGUCU AACUCCGAAC C|CACGAGcg 1261 ccga-aga|a GGGGGG-AGU GGGUCACU-C -GGC(G-UAA G)GUUG-GGU GGCAAAAGGG 1321 (GAACAA)CC CAGACCUG-G GUUAAGGC|C CCAAAGUCCC GGCUA-AGUG cc----aacg 1381 AAAAGGCGUC UCCAGCCUUA GACAGCGGGA AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUGG AGGC|CCUAA AGAUUGGUCG 1501 GGGCUCAAGC CGGG|C|GCC GAGACC|CAG G|aggggguc uc(uacua-) gagauccu-- 1561 cuGGUAGGGG GG|CGCUGUG AU-GGGGUAG AAGGUGGG-U C(GUGA)GAU CCACUGG|AC 1621 CCGUCACAGG UGCAGAUCCC GGC|GGUAGU AACagcg-aa GGGGGG(UGA GAA-U)CCC- 1681 CCU|CGCCGG AA-GGGCAAG GGAUUCCCGG CAACG-(UUC GU)CA|GCCG GGAGUUAGC| 1741 CGG|-UCCUA AGGUAGG-GC C(UAAUA)GG U--ACCUACC GAAA-GGGAA AGGGG(UUAA 1801 CAUU)CCCCU |GC-cucccg gg-uagg--- ugcg(-guaa )cgcaa-gcc agac-uccug 1861 acggauuggg guagggag|a guagg(---- -ac-caccgu ------)ccu acc|-caagc 1921 acu-caagcc CUUGGAGAG- CC(GUAAC)G G|UGAGAAGA G--ggcga-a ggugugaug- 1981 |ggc|cuucc (--guua)gg aggguu-cuc cugaucccua -|guccccau gaaaagggag 2041 ucuggaa--- cgaucccggg a--g|A|CCG UACCUAGAAC C(GACACU)G G|UGCCCCUG 2101 GG(ugagaag )CCCAA-GGC GU||CUGAGG -GGUAACCCA GGCUAGGGAA CUCGGCAAAU 2161 UAACCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUAU---c gugguu-u-- -(--aaca-) 2221 ---aagccac g--AUAGGU| C(GCA)GUGA CCAGAGGGAC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CUAGCC C(GAA-A)GG GUGUGUACGG GGGCUAAAUC CUGGCCACU| 2341 GGUGGUUGGU UAA----auc cggg---(uu ca)----acc ggg----cGA AGCC|CCACC 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGCG GUCCCUACUG UCCCAGCCUG 2521 GG-GCCUC|G UGAACGC|-C CUGA-GCCGG (-UGCACA-G U)CCGGCAUC UCCCUACACC 2581 GAGAGAAGAC CCCGUGG|AG CUUCACCGC| AGCCUGGCGU U|GUCCCUCG GGCGUUUAUG 2641 CGUAGAGUAG GUGGGAGGGG UCG---AACC UGUCCU(UUC G-)GGGGCAG G-GGACCCGA 2701 AAGUGAAACA CCACC|CAUG GACGCUCGAG GGACuaac-- --cucuc(-g aaa-)gagag 2761 ---gaacAAC GUCAGGUGGG CGGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUAGA|GC(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGUGAC CCU(uccaag uac-)GGGGU CACGACGC(G 2941 AAA)GCGGGG |CCUAGCGAA CGCUCGUGC- CCC(-cacac gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|GGUCGUCG CGGGCGAGA( GCUCACA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |ACCC|UGGA GGU(GCAGCU 3121 )GCCUCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AGCGUGAGCU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUAAGG |GUAGGGAGc |UG-CGGACC GCUU|GA|GG 3241 GGAA-GGAAC CCC(UAGUAC (GAGA)GGAA C)AGGGUUC| CGGGGCCUCC AGUUUACCGG 3301 U|UGUCC-(G GUA)-GGGCA -AUGCCGGGC A-GCCGCGCC CUG-AGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|UA A-AAGAGGCG GUC|a-|ggc ---------- 3421 -------(gu ua-)------ ---------- -gcc--gggg c|cu|UCCCC (UAGAAGA)G 3481 GGGGUUGAUG G|GGUGG|GG AUGUAAGCUC Caagguu-(g uaa--)-gac cgaGGAGU|U 3541 UAGUCC|G|C CACUCCCAau caggcucc-- |ccccugg-- -------||| || // LOCUS S.azoricus 3592 bp RNA RNA 11-JAN-191 DEFINITION Stygiolobus azoricus. ACCESSION No information KEYWORDS No information. SOURCE Stygiolobus azoricus. ORGANISM Stygiolobus azoricus. REFERENCE 1 (sites) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U32319 LOCUS SAU32319 3113 bp DNA BCT 15-AUG-1995 DEFINITION Stygiolobus azoricus 23S rRNA gene, partial sequence. ACCESSION U32319 KEYWORDS . SOURCE Stygiolobus azoricus. ORGANISM Stygiolobus azoricus Archaea; Crenarchaeota; Sulfolobales; Stygiolobus. REFERENCE 1 (bases 1 to 3113) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. TITLE Phylogenetic analysis of the Archaeal order of Sulfolobales based on sequences of 23S rRNA genes and 16S/23S rRNA spacers JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3113) AUTHORS Trevisanato,S.I. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) Siro I. Trevisanato, Princess Margaret Hospital, Ontario Cancer Institute, 500 Sherbourne Street, Toronto, Ontario M4X 1K9, Canada COMMENT NCBI gi: 942628 FEATURES Location/Qualifiers source 1..3113 /organism="Stygiolobus azoricus" /strain="FC6" /isolate="DSM 6296" misc_feature <1..204 /note="16S/23S rRNA spacer region" rRNA 205..>3113 /product="23S rRNA" BASE COUNT 700 a 862 c 1078 g 472 t 1 others ORIGIN BASE COUNT 651 a 809 c 1022 g 426 t 684 others ORIGIN 1 |--------- --aacccagg gc-ACCAUGC CGCCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cCGAAGAA GGG|CGUG(G CAAGC)G|GC GAAAUGCCCG GGU|GAGGC( GCAACGA)GC 121 CGUA-GACCC CGGGAUU|CC -CGAAUGGG( AUAU-)CCU| gccgg----- ----(gucu- 181 )--------- cc-ggc-gcu ccc----(-- -ucau----- )-----ggga gcGGGAAC|C 241 CCCCGAACG| GA(AACA)UC UUAGUAGGGG GAGGAAAAGA AAUC(AA--U C)GAGA-U|C 301 CCCUGAGUAG G(GGCGA)CC GAAAGGGGGA CAGCCCAaac caaa-uccg- ucggc(gaaa 361 a)gucgg-cg ga-gaugugg uguuaucggc ccuagguc-- -aggg(gcaa -)ccc--aac 421 cucccuagcc caccuagcc- GAAcucucc- (uggaac)-g gagggc|c(a aa)gaGGG(U 481 GAUAGC)CCC GUA-ggcuaa agguggguga gagg---uga ccuag-ggca GAGUACCAUC 541 CCC(---ugg uuu)GGGGGU GGGAAG-UUG GGGG(Acaug cg-)CCUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGAACCCC(G 661 GGAG)GGGGA GUG-AAAAGA GCCUGAAACC GGGUGGUUau a-cag-ggca uggccc--ga 721 aagag--aug aguccucccg aaggaaaggg ac(gcga-)g uccu---gag uacgag-gga 781 ggag-gaucg --gggucaug ccUUUCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAU-G 841 CCAGUGGCGA GCCUAAgggg (--gucaa)c cccGAAGGCG UAGG(GAAA) CCGaa-cgcc 901 cgcagccggg -(naaa)-cc cggcgagggg cggGGUCU(g uca-)GGGCC UGAAGUCACU 961 GGCAUGAGGC UAGAAACCGG GCGAUCUAGU CCGGGG|CAG GCCGAAGGCG GGG(GAAA)C 1021 CCCGCUGGAG GGCCGaauag gg-guucuga (cgugca-au )uc-guu|cc ccuGACCCCG 1081 GACUAGGGGC (AAAAG)ACC AAUCUAGCCC GG-UGAUAGC UAGUUCCCCC CGAAAUGCGU 1141 (CCUA)GCGC AGC|CUCCCU gga-|ggu|- cgCCCGCG|G GGUAG|AGug acAGAUCGGG 1201 GGC--UC|CA AGC(gaaa-) GCUUG-GG-G CUCUCGGUCU AACUCCGAAC C|UACGGGca 1261 ccga-aga|a GGGGGG-AGU GGGUCACC-C -GGC(G-UAA G)GUUG-GGU GGCAAAAGGG 1321 (GAACAA)CC CAGACCCG-G GUUAAGGC|C CCAAAAUGCC GGCUA-AGUG cc----aauc 1381 GAAGGGCGUC CCCAGCCCCA GACAGCGGGA AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUGG GGGC|CCCGA AGAUUGGUCG 1501 GGGCUCAAGC CGGC|U|GCC GAGACC|CGG G|uagugggg cu(cacug-) agccucac-- 1561 uaGGUAGGGG GG|CGCUGCG GU-GGGGUAG AAGGUGGG-C C(GUGA)GGU CCGCUGG|AC 1621 UCGCCGCAGG UGCAGAUCCC GGC|GGCAGU AACagcg-aa GAGGGG(UGA GAA-U)CCC- 1681 CUC|CGCCGG AA-GGGCAAG GGUUUCCUGA CAAUG-(GUC GU)CA|GUCA GGAGUUAGC| 1741 CGG|-UCCUA AGGCAGG-GC C(CAAUA)GG U--ACCUGCC GAAA-GGGAA AGGGG(UUAA 1801 UAUU)CCCCU |GC-cucccc ug-uagg--- ugcg(-guaa )cgcaa-gcc gggc-uccug 1861 acggaucggg auaggggg|a guagg(---- -gc-aaccgc ------)ccu auc|-caagc 1921 acu-uaagcc CCUGGAGAA- CC(GUAAC)G G|UGAGAAGG G--ggcga-a ggugugaug- 1981 |ggc|ccucc (--guua)gg aggguu-ccc cugaauccug -|guccccau gaaaagggag 2041 cccggaa--- cgaucggggg a--g|A|CCG UACCUAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagaag )CCCAA-GGC GU||CUGGGG -GGUAACCCA GGCUAGGGAA CUCGGCAAAU 2161 UAGCCCUGUA CC(UUCG)GA AGAAGGGGU| GCCUAU---c aggguc-g-- -(--caaaa) 2221 ---cggcccu g--AUAGGU| C(GCA)GUGA CAAGGGGGAC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CUAGCC C(GAA-A)GG GUGUGUACGG GGGCUAAAUC CUGGCCACU| 2341 GGCGGUCGGU GAA----acc cggg---(uc ca)----acc ggg----cGA AGCC|CCGCU 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAAAGCG GUCCCCACUG UCCCAGCCUG 2521 GG-GCCCC|G UGAACGC|-C CUGA-GCCGG (-UGCACA-G U)CCGGCAAC CCCCUACACC 2581 GAGAGAAGAC CCCGUAG|AG CUUCACCGC| AGCCUGGCAU U|GUCCUCCG GGCAUUCAUG 2641 CGUAGCGUAG GUGGGAGGGG UCG---AACC CACUCC(UUC G-)GGGGUGG G-GGACCCGA 2701 AAGUGAAACA CCACC|CAUG AGUGCUCGGA GGACuaac-- ---cccu(-- uau-)gggg- 2761 ---gaacAGU GUCAGGUGGG CGGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGAGGC CCU(cccaag uac-)GGGGC CUCGACGC(G 2941 AAA)GCGCGG |CCUAGCGAA CGCUCGUGC- CCC(-cacgu aug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCU|CG GG|GAUAACA G|GGUCGUCG CGGGCGAGA( GCUCACA)UC 3061 GACCCCGCGG UUUGCUACAU CGAUGUC|GG CU|C|UUCCC |ACCC|UGGG GGU(GCAGCU 3121 )GCCCCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AGCGUGACGU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUAAGG |GUAGGGGG- |UG-CGGACC GCCU|GC|GG 3241 GAAA-GGAAC CCC(UAGUAC (GAGA)GGAA C)AGGGUUC| CGGGGCCUCC AGUUUACCGG 3301 U|UGUCC-(G GUA)-GGGCA -CUGCCGGGC A-GCCGCGCC CUG-AGGGGU AACCGCU(GA 3361 AAGCAUCUA) AGCGGGAACC CCUCCCC|UA A-AAGAGGCG GUC|nn|nnn nnnnnnnnn- 3421 -------(nn n--)------ --nnnnnnnn nnnn--nnnn n|nn|nnnnn (nnnnnnn)n 3481 nnnnnnnnnn n|nnnnn|nn nnnnnnnnnn nnnnnnnn(n nnnn-)nnnn nnnnnnnn|n 3541 nnnnnn|n|n nnnnnnnnnn nnnnnnnnnn |nnnnnnnnn nn-----||| || // LOCUS A.infernus 3592 bp RNA RNA 11-JAN-191 DEFINITION Acidianus infernus. ACCESSION No information KEYWORDS No information. SOURCE Acidianus infernus. ORGANISM Acidianus infernus. REFERENCE 1 (sites) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U32318 LOCUS AIU32318 3140 bp DNA BCT 15-AUG-1995 DEFINITION Acidianus infernus 23S rRNA gene, partial sequence. ACCESSION U32318 KEYWORDS . SOURCE Acidianus infernus. ORGANISM Acidianus infernus Archaea; Crenarchaeota; Sulfolobales; Acidianus. REFERENCE 1 (bases 1 to 3140) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. TITLE Phylogenetic analysis of the Archaeal order of Sulfolobales based on sequences of 23S rRNA genes and 16S/23S rRNA spacers JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3140) AUTHORS Trevisanato,S.I. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) Siro I. Trevisanato, Princess Margaret Hospital, Ontario Cancer Institute, 500 Sherbourne Street, Toronto, Ontario M4X 1K9, Canada COMMENT NCBI gi: 942627 FEATURES Location/Qualifiers source 1..3140 /organism="Acidianus infernus" /isolate="So4a" misc_feature <1..185 /note="16S/23S rRNA spacer region" rRNA 186..>3140 /product="23S rRNA" BASE COUNT 687 a 876 c 1101 g 475 t 1 others ORIGIN BASE COUNT 645 a 822 c 1052 g 435 t 638 others ORIGIN 1 |--------- -gcgcaaggg gc-ACCAAGC CAGCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cCGANGAA GGG|CGCG(G CAAGC)-|GC GAAAUGCCCG GGG|UAGGC( GCAAGAG)CC 121 CGUU-GAUCC CGGGAUC|CC CCGAAUGGG( ACCU-)CCU| gcccc----- ----(auuu- 181 )--------- gg-ggc-gca cccgu--(-- guaaaa-a-- )---gcgggu gcGGGAAC|C 241 CCCCGAACG| GA(AGCA)UC UUAGUAGGGG GAGGAGGAGA AAUC(AAcuC U)GAGA-U|C 301 CCCCGAGUAG G(CGCGA)CC GAAAGGGGGA CAGCCCAaac caaa-ccug- ccggc(gaua 361 a)gccgg-ug gg-gaugugg uguuauaggc ccuaggucu- -gggg(gcaa -)cccc-agc 421 cucccuagcu ugccuagcc- GAAcucccu- (uggaau)-a gggggc|c(a aa)gaGGG(U 481 GACAGC)CCC GUA-ggcgaa aggcaagugg gagg---cga ccuag-ggca GAGUACCAUC 541 CCC(---ugg uuu)GGGGGU GGGAAG-UUG GGGG(Acaug ug-)CCUCCA UGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AA-GCUGAAA AGCACCCC(G 661 GAAG)GGGG- GUG-AAAAGU GCCUGAAACC GGCUGGCUac a-cag-ggua gggcuc--ga 721 aagga--gug aagcccuccg aaggaaggag -c(gcaa-)g cccu---uag uacgag-gag 781 ggug-gaccg --ggguccua ccUUUCGU(C UUGAAA|C)A CGG|GCCGGG GAGUUCAU-G 841 CCAGUGGCGA GCCUAAgggg (--gucaa)c cccGAAGGCG UAGG(GAAA) CCGag-ugcc 901 cgcaacccgg -(gaaa)-cc gggugagggg cagGGUCC(g uca-)GGGCC UGGAGUCACU 961 GGCAUGAGGC UAGAAACCGG GCGAUCUAGA CCGGGG|CAG GCCGAAGGCG GGG(GAAA)C 1021 CCCGCUGGAG GGCCGaauag gg-guucuga (cgugca-au )uc-guu|cc cuuGACCCCG 1081 GUCUAGGGGU (AAAAG)GCC AAUCUAGCCC GG-UGAUAGC UAGUUCCCCC CGAAAUGCGU 1141 (CUUA)GCGC AGC|CUCCCU gga-|ggu|- ggCCUGCG|G GGUAG|AGuc acUGAUUGGG 1201 GGC--UC|GA GGC(gaaa-) GCCUC-GG-G CUCCCAGUCA AACUCCGAAC C|UGCAGGcg 1261 ccug-aga|a GGGGGG-AGU GGGUCACC-C -GGC(G-UAA G)GUUG-GGU GGCAAGAGGG 1321 (GAACAA)CC CAGACCUG-G GUUAAGGU|C CCUAAGUGCU GGCUA-AGUG cc----augg 1381 GAAGAGCGUC CCCAGCCUUA GACAGCGGGG AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUGG GGGC|CCUGA AGACUGGUCG 1501 GGGCUCAAGC CAGC|C|ACC GAGACC|CAG G|gggugggg cu(cauug-) agccc-ac-- 1561 ucGGUAGGGG GG|CGUCGUG GU-GGGUCAG AAGGUGGG-C C(GUGA)GGU CCACUGG|AC 1621 CCACCACGAG UGCCGAUCCC GGC|GGCAGU AACagcg-aa GGAGGG(UGA GAA-U)CCC- 1681 UCC|CGCCGA AA-GGGCAAG GGUUUCCCGG CAAUG-(GUC GU)CA|GCCG GGAGUUAGC| 1741 CGG|-UCCUA AGGUAGG-GC U(UAACU)GG U--ACCUACC GAAA-GGGAA AGGGG(UUAA 1801 UAUU)CCCCU |GC-cacggg gg-uagg--- ugcg(-gcaa )cgcaa-gcc gggc-uccug 1861 acggaucggg cuagggag|a c----(---- ----guaa-- ------)--- -gc|-caagc 1921 gcu-uaagcc CCUGGAGAG- CC(GUAAU)G G|UGAGAAGG G--gguga-a ggcgugaug- 1981 |ggc|ccucc (--guua)gg aggguu-cuc ccgaugccug -|guccccau gaaaagggag 2041 cccggaa--- cgaucccccg u--g|A|CCG UACCUAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugagaag )CCCAA-GGC GC||AUGGGG -GCUAACCCA GGCUAGGGAA CUCGGCAAAU 2161 UGGCCCCGUA CC(UUCG)GA AGAAGGGGU| GCCUAC---c acagua-g-- -(--uaacc) 2221 ---cugcugu g--GUAGGU| U(GCA)GUGA CCAGGGGGGC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CUAGCC C(GUA-A)GG GUGUGAACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUCGGU GAA----acc cggg---(ua ca)----acc ggg----cGA AGCC|CCGCU 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(CAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (CA-----AG U)UCCGGC|G UGCAUGAAUG GAUCAACUGG GCCCCCACUG UCCCAGCCUG 2521 GG-GCCCC|G UGAACGC|-C CAGA-GUGGG (-UUCACA-G U)CCCGCAAC CCCCUACACC 2581 GAGAGAAGAC CCCGUGG|AG CUUCACUGC| AGCCUGGCGU U|GGUUCUCG GGCGCUCAUG 2641 CGUAGAGUAG GUGGGAGGCG UCG---AAGC CGCCUC(UUC G-)GGGGCGG U-GGAGCCGA 2701 AAGUGAAACA CCACC|CAUG AGCGCUCGAG AACCuaac-- ----ccc(-g aga-)ggg-- 2761 ---ggacAGC GUCAGGUGGG CAGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG UCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGAGUC CCU(ucaaag cac-)GGGGA CUCGACGC(G 2941 AAA)GCGCGG |CCUAGCGAA CGCUCAUGC- CCC(-cgcac aug)GGG|GC UGGGC-augu 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|GGUCGUCG CGGGCGAGA( GCUCCCA)UC 3061 GACCCCGCGG UUUGCUUCAU CGAUGUC|GG CU|C|UUCCC |ACCC|AGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUAAGG |GUAGGGGG- |UG-CAGGCC GCCU|GA|GG 3241 GGAA-GGUAC CCC(UAGUAC (GAGA)GGAA C)AGGGUAC| CGGGGCCUCU AGUUUACCGG 3301 U|UGUCC-(G GCC)aGGGCA -GUGCCGGGC A-GCCACGCC UCG-AGGGGU AAUCGCU(GA 3361 AAGCAUCUA) AGCGAGAACC CCUCCCC|AA A-AAGAGGCG GCC|g-|ugc agguggcc-- 3421 -------(ga aa-)------ ---ggcugcc ugcg--ggag c|cc|ACUCC (UAGAAGA)G 3481 GGGnnnnnnn n|nnnnn|nn nnnnnnnnnn nnnnnnnn(n nnnn-)nnnn nnnnnnnn|n 3541 nnnnnn|n|n nnnnnnnnnn nnnnnnnnnn |nnnnnnnnn nn-----||| || // LOCUS A.brierley 3592 bp RNA RNA 11-JAN-191 DEFINITION Acidianus brierleyi. ACCESSION No information KEYWORDS No information. SOURCE Acidianus brierleyi. ORGANISM Acidianus brierleyi. REFERENCE 1 (sites) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. JOURNAL No information STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: U32317 LOCUS ABU32317 3246 bp DNA BCT 15-AUG-1995 DEFINITION Acidianus brierleyi 23S rRNA gene, partial sequence. ACCESSION U32317 KEYWORDS . SOURCE Acidianus brierleyi. ORGANISM Acidianus brierleyi Archaea; Crenarchaeota; Sulfolobales; Acidianus. REFERENCE 1 (bases 1 to 3246) AUTHORS Trevisanato,S.I., Segerer,A.H., Stetter,K.O. and Garrett,R.A. TITLE Phylogenetic analysis of the Archaeal order of Sulfolobales based on sequences of 23S rRNA genes and 16S/23S rRNA spacers JOURNAL Unpublished (1995) REFERENCE 2 (bases 1 to 3246) AUTHORS Trevisanato,S.I. TITLE Direct Submission JOURNAL Submitted (25-JUL-1995) Siro I. Trevisanato, Princess Margaret Hospital, Ontario Cancer Institute, 500 Sherbourne Street, Toronto, Ontario M4X 1K9, Canada COMMENT NCBI gi: 942626 FEATURES Location/Qualifiers source 1..3246 /organism="Acidianus brierleyi" misc_feature <1..200 /note="16S/23S rRNA spacer region" rRNA 201..>3246 /product="23S rRNA" BASE COUNT 778 a 823 c 1071 g 573 t 1 others ORIGIN BASE COUNT 726 a 777 c 1022 g 520 t 547 others ORIGIN 1 |--------- -ugcgcaagg gc-ACCAAGC CAUUCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cCGAAGAA GGG|CGCG(G CAAGC)G|GC GAUAUGCCUG GGG|UAGGC( GCAAGCA)GC 121 CUUU-GAUCC CAGGAUU|CC -CGAAUGGG( ACUU-)CCU| gccc------ ----(auua- 181 )--------- -g-ggc-guu cccuu--(-- ccaaaa-a-- )---aaggga acGGGAAC|U 241 CUCCGAACG| GA(AGCA)UC UUAGUAGGAG AAGGAAGAGA AAUC(AA--A A)GAGA-U|U 301 CCCUGAGUAG G(GGCGA)UC GAAAGGGGAA UAGCCCAaac caaa-ucug- ccggu(gaua 361 a)gccgg-ug ga-gaugugg uguuau--gc uucuugccu- gcggg(uucg a)cucgcaac 421 cuuucuagcu caucuagcc- GAAcuccuc- (uggaau)-g agggac|c(a ua)gaGGG(U 481 GAUAGU)CCC GUA-gguuaa aggugagugg aagg---ugg caaga-agca GAGUACCAUC 541 CCC(---ugg uuu)GGGGGU GGGAAG-UUA GGGG(Acacg ug-)CCUCUA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC U(GUGA)AGG AAAGCUGAAA AGAACCCG(G 661 GAA-)-GGGA GUG-AAAAGA GCCUGAAACC GAAUGGUUac a-cag-ggca gagcuc--ga 721 aagag--aug agcccuucua aaggagggag ac(gcaa-)g ucucu---ag uacaag-gaa 781 gggu-gaucg --gaguucug cuUUUCGU(C UUGAAA|C)A CGG|GCCGGG GAGCUCAU-A 841 UCAGUGGCAA GCCUAAgagg (--guaaa)c cucGAAGGCG UAGG(GAAA) CCGag-ugcu 901 cgcaaccuag -(gcaa)-cu aggugaggag cagGGUCU(g uca-)GGGCC UGGAGUCACU 961 GGUAUGAGGC UAGAAACCGG GCGAUCUAGA CCAGGG|CAG GCCGAAGCGG GGG(GAAA)C 1021 UCCGGUGGAG GGCCGaaugg gg-guucuga (cgugca-au )uc-guu|cc cuuGACCUUG 1081 GUCUAGGGGC (AAAAG)ACC AAUCUAGCCC GG-UGAUAUC UAGUUCCUCC CGAAAUGCGC 1141 (CCUA)GCGC AGC|CUCUCU gga-|ggu|- ggCUUGCG|G GGUAG|AGag acUGAUUGGG 1201 GGU--UG|GU AGC(gaaa-) GCUAC-CG-A UUUCCAGUCA AACUCCGAAC C|UGCAGGca 1261 ccug-aga|a GGAGGG-AGU GGAUCACU-C -AGC(G-UAA G)GUCG-AGU GGUAAGAGGG 1321 (GAACAA)CC CCAACUCG-G GUUAAGGU|C CCUAAGUACU GGCUA-AGUG ca----aauu 1381 GAAAAGCGUC UCAAGCCCUA GACAGCGGGU AGGUGGGC(C CAGCAGCA)G CCAUCCU|C( 1441 UAA)G|GAGU GC(GUAACA) GC|UCACCCG CCGAGGCUUG AGGC|CUUUA AGACUGGUCN 1501 GGGCUAAAGC CAGC|C|ACC GAGACC|CGA G|gggugaga cu(cacug-) agucuc-c-- 1561 caGGUAGGGA GG|CGCCGUG AU-AGGGUAG AAGGUGGG-C C(GAGA)GGU CCACUGG|AC 1621 CUUUCACGGG UGCAGAUCCC GGC|GGCAGU AAUagcg-aa AGGGAG(UGA GAA-U)CUC- 1681 CCU|CGCCGA AA-GGGCAAG GGUUUCCCGG CAAUG-(GUC GU)CA|GCCG GGAGUGAGC| 1741 CGG|-UCCUA AGGUGAG-GC C(UAACU)GG U--ACUCACC GAAA-GGGAA AGGAG(UUAA 1801 UAUU)CUCCU |GC-cucgga gg-uagg--- ugcg(-guaa )cgcaa-gcu agac-uccug 1861 acgaaucggg guaggggg|g gugag(---- -gu-caccgc ------)cuc acc|-caagu 1921 acu-uaaguc CCUGGAGAG- CC(GUAAU)G G|UGAGAAGG G--gacga-a ggugcgaug- 1981 |ggc|uuucc (--guau)gg agaguu-cuc cugaucccug -|guucccau gaaaagggag 2041 ucuagag--- cgauccuccg a--g|U|CCG UACCCAGAAC C(GACACA)G G|UGCCCCUA 2101 GG(ugagaag )CCUAA-GGC GU||CUGGGG -GUUAACUCA GUCUAGGGAA CUCGGCAAAA 2161 UGGUCCUGUA CC(UUCG)GA AGAAGGGAC| GCCUAC---c acugua-g-- -(--uaacc) 2221 ---cugcaau g--GUAGGU| C(GCA)GUGA CCAGGGGGAC C|UGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC UCG|CUAGCC C(GUA-A)GU GUGUGAACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUCGGU GAA----acc cggg---(ua ca)----acc ggg----cGA AGCC|CCGCU 2401 GAAG|GCC|G GGGGU(AACU CUG)ACCCUC |U(CAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (CA-----AG U)UCCGGC|G UGCAUGAAUG GAUCAACUGG GCCCCCACUG UCCCAGCCUG 2521 GG-GCCCC|G UGAACGC|-C CAGA-GUGAG (-UUCACA-G U)CCCGCAAC CCCCUACACC 2581 GAGAGAAGAC CCCGUGG|AG CUUCACUGC| AGCCUGGCGU U|GGUUCUCG GGCGCUCAUG 2641 CGUAGAGUAG GUGGGAGGCG UCG---AAGC CGCCUC(UUC G-)GGGGCGG U-GGAGCCGA 2701 AAGUGAAACA CCACC|CAUG AGCGCUCGAG AACCuaac-- ----ccc(-g aga-)ggg-- 2761 ---ggacAGC GUCAGGUGGG CAGUUCGGCU G(GGG)CGGC AC|UCCCGC( GAAAAGAUAA 2821 C)ACGGGA|G |C|CCAAAGG CCGGC|UCAG GCGGU(ACAG AAC)GCCGCC GUAGA|GU(G 2881 |C-AAGG)GC AAAAGCCGGC CUGACGAGUC CCU(ucaaag cac-)GGGGA CUCGACGC(G 2941 AAA)GCGGGG |CCUAGCGAA CGCUCAUGC- CCC(-cgcac aug)GGG|GC UGGGC-augu 3001 c|aga--aAA GUUACCC|CG GG|GAUAACA G|GGUCGUCG CGGGCGAGA( GCUCCCA)UC 3061 GACCCCGCGG UUUGCUUCAU CGAUGUC|GG CU|C|UUCCC |ACCC|UGGG GGU(GCAGCA 3121 )GCCCCCAAG GGU|AGGGC( UGCCC)GCCC GUUAAAGGGG AACGUGAGCU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUAAGG |GUAGGGGG- |UG-CAGGCC GCCU|GA|GG 3241 GGAA-GGUAC CCC(UAGUAC (GAGA)GGAA C)AGGGUAC| CGGGGCCUCU AGUUUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -GUGCCGGGC A-GCCACGCC UCG-AGGGGU AAUCGCU(GA 3361 AAGCAUCUA) AGCGAGAACC CCUCCCC|AA A-AAGAGGCG GCC|G-|ugc agguggcc-- 3421 -------(ga aa-)------ ---ggcugcc ugcg--ggag c|cc|ACUCC (UAGAAGA)G 3481 GGGGUUGAUG G|GGUGG|GG GUGUAAGUCU Cgagggc-(g aaa--)-guc cgaGGGAU|U 3541 CAGCCC|G|C CACUCCCAac --ggcaag-- |cccaugcag u------||| || // LOCUS Tf.pendens 3592 bp RNA RNA 04-OCT-1988 DEFINITION Thermofilum pendens. ACCESSION No information KEYWORDS No information. SOURCE Thermofilum pendens. ORGANISM Thermofilum pendens. REFERENCE 1 (sites) AUTHORS Kjems,J. JOURNAL Tf.pendens:date:10.4.88 STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: X14835 LOCUS X14835 8935 bp UNA 15-SEP-1990 DEFINITION Thermofilum pendens DNA for 16S and 23S ribosomal RNA, tRNA-Met, and tRNA-Gly. ACCESSION X14835 KEYWORDS . SOURCE ORGANISM Unknown Unclassified. REFERENCE 1 (bases 1 to 8935) AUTHORS Kjems,J. JOURNAL Unpublished (1989) see COMMENT for author address STANDARD unannotated staff_entry REFERENCE 2 (bases 1 to 8935) AUTHORS Kjems,J., Leffers,H., Olesen,T., Ingelore,H. and Garrett,R.A. TITLE Sequence, organisation and transcription of the ribosomal RNA operon and the downstream tRNA and protein genes in the archaebacterium Thermofilum pendens JOURNAL Unpublished (1990) see COMMENT for author address STANDARD unannotated staff_entry BASE COUNT 1894 a 2462 c 2941 g 1638 t BASE COUNT 627 a 889 c 1124 g 429 t 523 others ORIGIN 1 |--------- ------acgg c--gcUAAGC CACCCGGUGG AUG|GCUCGG |CUCGGG-|c 61 g|cCGAGGAA GGC|CGUG(G CAAGC)G|GC GAUACGCCCC GGG|GAGCC( GCAUGCA)GG 121 CUUC-GAUCC GGGGAUC|GC -CGAAUGGG( ACCU-)CCU| gccguggg-- ----(ucaa- 181 )------ccc ac-ggc-gcu cgggaaa(cc cgca--aggg )gaguaccga gcGGGAAC|C 241 CCCCCAACG| GA(AACA)UC UUAGUAGGGG GAGGAGAAGA AACC(AA--U U)GGGA-U|C 301 CCCUGAGUAG G(G-CGA)CC GAAAGGGGGA GAGCCCAaac cgaa-cguc- caugc(gaag 361 a)gcaug-ga cg-gaugugg gguugc-agg g-ccccgcg- -----(---- -)------uc 421 cccccuagcc cggauagcc- GAAgucggc- (uggaaa)-g ccgcgc|c(g ua)gaGGG(U 481 GACAGC)CCC GUA-ggcuaa auccgggugg ggga--uggc gggugucccu GAGUACCACG 541 GCu(---ugg uuu)uGCCGU GGGAAG-CUG GGGG(Gcacc gA-)CCUCCA AGGCUAAAUA 601 cgu-CCCGAG ACCGAUAGCG AACUAAGUAC C(GUGA)GGG AAAGCUGAAA AGCACCCC(g 661 gaag)GGGG- GUG-AAAAGA GCCUGAAACC GGGUGGCGac a-ggu-ggcg cggccc--ga 721 aaggg--uag auccuccccg aaggaaaccc gg(gcga-)c cggg---gag uacgag-ggg 781 aggg-gaccg --gggucgcg ccuUACGU(C UAGAAA|C)A CGG|GCCGGG GAGUUCAC-G 841 GCUGUGGCGA GCCUAAgggg (-uucaag)c cccGAAGGCG UAGG(GAAA) CCGacaagcc 901 cguagccagc c(gcau)ggc uggugugggg cggGGUCU(g aaa-)GGGCC CGUAGUCACA 961 GCCGUGAGAC CAGAAACCGA GCGAUCUAGG CCGGGG|CAG GGUGAAGCCC GGC(GAAA)G 1021 CCGGGUGGAG GCCCG-aa-g gg-guucuga (cgugca-au )uc-guu|cc cauGACCUCG 1081 GCCUAGGGGC (AAAAG)ACC AAUCAAGCUC GG-UGAUAGC UGGUUCCCCC CGAAGCGGAU 1141 (CCCA)GUCC GGC|CCGCCC uga-|ggu|- ugCCGGUU|G GGUAG|GGC- ACUGAUUGGG 1201 AGuccgG|GG ACC(gaaa-) GGUCC-CA-G CUCCCUUUCA AACCCCGAAC C|GACCGGca 1261 ccgu-aga|u GGGCGG-AGA CGGGUUCU-G GCGG(A-UAA G)CCGCCAG- GCCGAGAGGG 1321 (GAACAA)CC CAGACCGG-G GUUAAGGC|C CCCAAGUGCC GGCUAaAGUG uca--agcca 1381 gAAGGGUGUC CCCCGCCUUA GACAGCGGGG CCGUAGGC(U UAGAAGCA)G CCAUCGG|C( 1441 UAA)G|AAGU GC(GUAACA) GC|UUACCCG CCGAGGCGGG GGGC|CCCGA AGAUU-ACCG 1501 GGACU-AAGC CGGC|C|GCC GAGACC|CCG G|ggcacccc gc(auu---) gcggggugau 1561 ccGGUAGGGG GG|CGUCCAG GU-GGACUGG AAGCCGGG-C C(GUGA)GGU CCGGUGG|AU 1621 CCGCCUGGAA UGAAAAUCCC GGC|GGUAGU AAGagcgcaa GAGGGG(UGA GAA-U)CCC- 1681 CUC|CGCCGA AA-GGGCAAG GGUUCCUCAG CAACG-(GUC GU)CG|GCUG AGGGUUAGC| 1741 CGG|-UCCUA AGGCGGC-CC U(UAACC)GG U---gccgcc GAAA-GGGAA AGGGG(UUAA 1801 UAUU)CCCCU |GC-cguggg g--uacgcuu ugcg(-gcaa )cgcga-gcc ccgc-uccug 1861 acgccucggg auaggggg|a guggg(---- -ac-ugccgu ------)ccc auc|-caagc 1921 gcu-gaagcc CGCUGAGUG- CC(GUCAU)G G|CGAGAAGC G--gguga-a ggcgcgaug- 1981 |ggc|ccgcc (--guua)gg cggguu-ccc ccgacuccug -|gggcccgu gaaaagggag 2041 cggggaa--- ggag-cccca c--g|C|CCG UACCGAGAAC C(GACACA)G G|UGCCCCUG 2101 GG(ugaaaag )CCCAA-GGC GU||GGCGGG -G-UAAcCCA GGCUAGGGAA CUCGGCAAAU 2161 UAGCCCCGUA AC(UUCG)GG AGAAGGGGU| GCCUGC---g gucuug-ggg -(--cauac) 2221 -cccugggac c--GCAGGU| C(GCA)GUGA CAAGGGGGAC C|CGACUGU( UUAAUAAAA) 2281 ACAUAGGUCC CCG|CGAGCC C(GAA-A)GG GUGAGUACGG GGGCUGAAUC CUGGCCACU| 2341 GGCGGUACGU GAA----acc cggg---(ua ca)----acc ggg----cGA AGCG|CCGCC 2401 GAAG|GCC|G GGAGU(AACU CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GCCGGG 2461 (UA-----AG U)UCCGGC|G CGCAUGAAUG GAUCAACGAG GUCCCCACUG UCCCAGCCUG 2521 GG-ACCCG|C UGAACCC|-G CAA--CCAGG (-UGCAGA-G U)CCUGGGAG UCCCGGUGGG 2581 GCGAGAAGAC CCCGUGG|AG CUUCACAGC| AGCCUGGCGU U|GAGACACG GCUGCGGGUG 2641 CGUAGCGUAG GCGGGAGCAA UGA----ACC UGGCCC(UCC G-)GGGUCAG G-GGAUGCGG 2701 CCAUGAAACA CCGCU|CACC UGCGGCUGUG UCCCuaac-- -cccggg(-g uuu-)cccgg 2761 g--ggacAGC GCCAGGUGGG CUGUUCGGCU G(GGG)CGGC AC|ACCCCU( GAGAAGGUAU 2821 C)AGGGGU|G |C|CCAAAGC UCGGC|UCAG GCGGG(UCAG AAC)UCCGCC GUAGA|GG(G 2881 |C-AAGG)CC AAAAGCCGGG CUGACUGCGC CCU(uaaacg cgu-)GGGGC GCAGGCGG(G 2941 AAA)CCGGGG |CCUAGCGAA CGCUCGUGC- CCC(-cuucg gug)GGG|GC CGGGC-auga 3001 c|aga--aAA GUUACCC|CG GG|AGUAACC G|GCUCGUCG CGGGCGAGA( GUUCACA)UC 3061 GACCCCGCGG UUUGGUACCC AGACGUC|GU CU|C|UUCCU |AGCU|UGGC CCU(GCAGCA 3121 )GGGGCCAAG AGU|GGGGC( UGCUC)GCCC AUUAAAAGGG AACGUGAGAU GGGUUUAGAC 3181 C|GUC(GCGA )GACAGGUCG GACUCUACCU |GCCGGGAC- |CG-UUGGCC GCCU|GA|GG 3241 GGAU-GGUCC GCA(CAGUAC (GAGA)GGAA C)UGCGGGC| CGUGGCCUCU AGUGUACCGG 3301 U|UGUCC-(G GCA)-GGGCA -cuGCCGGGC A-GCCACGCC GCA-AGGGGU AACCGCU(GA 3361 AGGCAUCUA) AGCGGGAAAC CCACCCC|GA G-ACGAGGCG GCC|ac|ucc cgauccguug 3421 cucccg-(gc ga-)cgggg- gcuuuggguc ggga--cgag g|gc|ACCCG (UAGAAGA)C 3481 GGGGUUGAUG G|GGUGG|CG GUGUAUGCGC Cgagg---(g uuu--)---c cgaGGCGC|- 3541 GAGCCG|G|C CACUCCCAau agcccgagg- |cugu----c uac----||| || // LOCUS C.symbio.A 3592 bp RNA RNA 14-SEP-1998 DEFINITION Cenarchaeum symbiosum strain A. str. A hypothetical protein 01 gene, complete cds; . ACCESSION AF0830 1 KEYWORDS . SOURCE Cenarchaeum symbiosum strain A. str. A hypothetical protein 01 gene, complete cds. ORGANISM Cenarchaeum symbiosum strain A. REFERENCE 1 (bases 1 to 32998) AUTHORS Schleper,C., DeLong,E.F., Preston,C.M., Feldman,R.A., Wu,K.Y. and Swanson,R.V. TITLE Genomic analysis reveals chromosomal variation in natural populations of the uncultured psychrophilic archaeon Cenarchaeum symbiosum JOURNAL J. Bacteriol. (1998) In press STANDARD No information REFERENCE 2 (bases 1 to 32998) AUTHORS Feldman,R.A. TITLE Direct Submission JOURNAL Submitted (07-AUG-1998) Genomics, Diversa Corp., 10665 Sorrento Valley Rd., San Diego, CA 92121, USA STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AF083071 BASE COUNT 770 a 674 c 927 g 624 t 597 others ORIGIN 1 |--ggcugaa uaugccggug ua-aagacGG CAAUAGGAGG AUU|GCUAGG |CUUGAG-|g 61 a|GAGAAGAA GGA|CGUG(G CAAGC)U|GC GAUAAGCUCG GGG|UAGGU( GCACGCA)CC 121 CGUU-GAUCC CGAGAUU|UC -CGAAUAGG( a-au-)CCU| ---------- -----guaua 181 ---------- --------cu ccgc---(-- -gcaa----- )----gcgga ggUCGAAC|C 241 GUGGGAAUU| GA(AGCA)UC UUAGUACCAC GAGGAAGAGA AAUC(AA--U A)GAGA-U|U 301 UCCCAAGUAG A(GGCGA)UC GAAAAGGAAA GAGCCCAaac ugaa-ucug- ccgcg(g-ua 361 a)cgcgg-cc ga-gaugugg uguuuu---g gugcggcg-- -----(---- -)-----caa 421 uggcc-ccag gaaauaguu- GAAguguuc- (ugaaau)-g uuccgg|c(u ua)gaGGG(U 481 GAUACC)CCC cua-gacgaa guggaauggg gcag---ggc cagca-ccc- GAGUAGUUGU 541 CCU(---ugg cag)UGGGCA GCGAAG-AUU GGUG(aaagu ag-)CAUCAA AGGCUAAAUA 601 ucu-CUCAAG ACCGAUAGUG UACU-AGUAC C(GUGA)GGG AAAGCUGAAA AGUACCCC(g 661 -gaa)GGGGG GUG-AAAAGU GCAUGAAACC UAUUGCUuac agacg-ugca uggcau--ga 721 aagac--gau uuuuucggac cgga-gguuc ca(gcaa-)u ggagcu--ga aaguccggag 781 cucaugucgu cagugucaug cgUUCCGU(C UCGAAA|C)A CGG|GCCAGG GAGCUUGC-U 841 GUCAUGGCGA GGUUAAgccu (---agaa)a ggcguAGCCG AAGG(GAAA) CCGaa-uucu 901 cgcagc---- -(auu-)--- --gugaggag aagCGUGU(g aaa-)GUGCG UUAAGUCAUG 961 GCGGUAAGGC UAGAAGCCAG UCGAUCUAUU CCUGAG|CAA GAUGAAGGUG GGC(GAAA)G 1021 CCCACUGGAG GUCUGca--g ca-guucuga (cgugca-ac )uc-guu|ug uguGACUUGG 1081 GAAUAGGGGU (CAAAA)ACC AAUCUAGACU GG-CGCUUGC UAGUUCCAAC CGAAGCGUCU 1141 (CGCA)GGGC GUG|CUAUGU Gga-|gau|- GGUGUUUC|C GGUAG|AGC- ACUGAUAGGG 1201 UGGcgcg|gg gaa(gaga-) uuccu-ca-C CAUCCAUUCA AACUCCGAAC G|GAUACAca 1261 ucgc-aga|a GCAUGG-ACA CGGGUUCGUG UGAC(G-UAA G)GUUC-ACG ACCGAGAGGG 1321 (GUUUAA)CC CAGACUGG-A GUUAAGGU|C CCCAAAUUUC UGCUA-AGUG uca---agcc 1381 AAAGUGCGUG UCAUCGCCAA GACAGCAGGG AGGUAAGC(U CAGAAGUA)G CUAUCCU|U( 1441 CAA)A|AAGU GU(GUAACA) AC|UUACCUG CCGAGCGAGG GCGC|CACAA AAACGGACGG 1501 GG-CUAAAGC AGAG|U|ACC GAUACU|CCA G|acgugcac c-(gugaua) -ggugcac-c 1561 guGGUAGGUU GG|CGUAGUG AU-UGGGACG AAGCAGGG-C G(GUGA)CGU CCUGUGG|AC 1621 CGAUUUCUAU UGCAGAUCCU GGU|GGCAGU AACagca-ua GUGCGG(UGA GAA-U)CCG- 1681 CAC|CACCGU AA-GCGCAAG GGUUUCCAGG CAAUGC(GUC GU)CA|GCCU GGAGUUAGU| 1741 CGA|-UCCUA AGGACUGcCU C(-aaca)GA G---UAGUCC GAAU-GGGAA ACCAG(UUAA 1801 UAUU)CUGGU |AC-cuugca gggagcg--- ---u(-uuga )a------cc gcau-gguu- 1861 -cgcuuccgg auaggggg|a acaug(---- -ac-cgucgu ------)cgu guc|-gaacc 1921 guucgagggc CUGGGAGUG- UC(GUAAU)G A|CGAGAACA G--gcccg-a ggcgggagu- 1981 |ggc|uuugc (--guua)gc agaguu-uuc ccgauuccag -|gagaccau gaaa-gaccg 2041 ugcggcua-- gcaaacugca a--g|U|UCG UACCCAGAUC C(GACACA)G G|UGCGC-UG 2101 GU(uuaguaa )ACUAA-GGU GC||UACGGG uauaccGUAU GGCGAGGGAA UUCGGCAAAU 2161 UAGUCCUGUA GC(UUAG)GU AUAAGGGAU| GCCUGC---a gcuguc-au- -(-gagga-) 2221 --auggcggc a--GCAGGU| U(GCA)GUGA CAAGGGGGUU U|CGACUGU( UUAAUAAAA) 2281 ACACAGGCGA CUG|CUAGCC C(GAA-A)GG GUAUGUAUAG UCGCUGAAUC CUGGCCGGU| 2341 GCCGGUAUCU AAA----acu uggg---(uu ca)----acc gag----cUA AGGA|CCGGU 2401 UAAC|GCC|G GGAGU(AACU CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GUCGGG 2461 (UA-----AG U)UCCGAC|G UGCAUGAAUG GAACAACGAG AGCCCCACUG UCCCCGCC-U 2521 acaacccg|G UGAAGCC|ac auaa-GGUGG (acgaACA-G U)CCAUCAUC UUCCAUCGGG 2581 GAGAGAAGAC CCCGUGG|AG UUUUACUGC| AGCUUGUGCU U|GUGGUAUA GCAGGGAGUG 2641 CAUAGCGUAU CUGGGAGUCA UUU---CCUG GCGUUC(UUU G-)GGACGUC A-GGGGACGC 2701 CGAUGUAACA CCAGA|CAUU UCCUGCAGUG CCACuuac-- --ccggu(-g aag-)accgg 2761 ---ggacaUG UGCAGGUGGG CAGUUCGGCU G(GGG)CGGC AC|CCCUUU( GAAAAGAUAU 2821 C)AAAGGG|G |C|CCAAAGU UUAGU|UCAA GCGGG(UCAG AAA)UCCGCU GAAGA|GG(G 2881 |C-AAAG)CC AAAAACUAGA CUGAAUGGAU UUC(caaacg cac-)GGAAU UCAUAGAC(G 2941 AAA)GUCUGG |CUUAGCGAU CCUUCGUGU- GCC(-cacua uug)GGG|CC CGGAG-guga 3001 c|aga--aAA AUUACCC|CA GG|GAUAACA G|GCUCGUCG CGGGCGAGA( GCUCCUA)UC 3061 GACCCCGCGG UUUGGUACCU CGAUGUC|GG CU|U|UUCCU |AUCC|UGGU CCU(GCAGCA 3121 )GGGGCCAAG GGU|GAGGC( UGCUC)GCCU AUUAAAAGGG AACAUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCCUCUGCCU |GAUGGAAG- |CG--CGAAU AUCU|GA|GG 3241 GGAA-GUCGC UCU(UAGUAC (GAGA)GGAA C)AGAGAGG| CGUGGCCACU GGUCUACCGG 3301 U|UGUCC-(G ACA)-GGGCA --GGCCGGGC A-GCUAAGCC AUC-UAGGCU AAGUGCU(GA 3361 AAGCAUCUA) AGCACGAAUC CUAUCCC|GA G-ACAAGAUA UUC|c-|uuc ---------- 3421 -------(uu cg-)------ ---------- -gau--gaag g|uu|CGCAG (AAGAAGA)C 3481 UGCGUUGAUA G|GAAGG|UG GUGUAuccCa caagc---(u ucg--)---g cgagu-gg|u 3541 gAGCCA|G|C CUUUACUAau cgaccaaa-- |acaccgguu -------||| || // LOCUS C.symbio.B 3592 bp RNA RNA 14-SEP-1998 DEFINITION Cenarchaeum symbiosum strain B. str. genes, complete cds B histone H1 DNA binding protein (hc2), hypothetical protein, lysyl tRNA synthetase (syk), and hypothetical protein 01; . ACCESSION AF083072 KEYWORDS . SOURCE Cenarchaeum symbiosum strain B. str. genes, complete cds B histone H1 DNA binding protein (hc2), hypothetical protein, lysyl tRNA synthetase (syk), and hypothetical protein 01. ORGANISM Cenarchaeum symbiosum strain B. REFERENCE 1 (bases 1 to 42432) AUTHORS and Swanson,R.V. Schleper,C., DeLong,E.F., Preston,C.M., Feldman,R.A., Wu,K.Y. TITLE Cenarchaeum symbiosum Genomic analysis reveals chromosomal variation in natural populations of the uncultured psychrophilic archaeon JOURNAL J. Bacteriol. (1998) In press STANDARD No information REFERENCE 2 (bases 1 to 42432) AUTHORS Feldman,R.A. TITLE Direct Submission JOURNAL Submitted (07-AUG-1998) Genomics, Diversa Corp., 10665 Sorrento Valley Rd., San Diego, CA 92121, USA STANDARD No information COMMENTS Sequence information (bases 1 to 3592) Corresponding GenBank entry: AF083072 BASE COUNT 767 a 685 c 930 g 613 t 597 others ORIGIN 1 |---gcugaa uaugccggug ua-aagacGG CAAUAGGAGG AUU|GCUAGG |CUUGAG-|g 61 a|GAGAGGAA GGA|CGUG(G CAAGC)U|GC GAUAAGCUCG GGG|UAGGU( GCACGCA)CC 121 CGUU-GAUCC CGAGAUU|UC -CGAAUAGG( a-au-)CCU| ---------- -----guaua 181 ---------- --------cu ccgc---(-- -gcaa----- )----gcgga ggUCGAAC|C 241 GUGGGAAUU| GA(AGCA)UC UUAGUACCAC GAGGAAGAGA AAUC(AA--U A)GAGA-U|U 301 UCCCAAGUAG A(GGCGA)UC GAAAAGGAAA GAGCCCAaac ugaa-ucug- ccgcg(g-ua 361 a)cgcgg-cc ga-gaugugg uguuuu---g gugcggcg-- -----(---- -)-----caa 421 uggcc-ccac gaaauaguc- AAAguguuc- (ugaaau)-g uuccgg|c(u ua)gaGGG(U 481 GAUACC)CCC cua-gacgaa guggaauggg gcag---ggc cagca-ccc- GAGUAGUUGU 541 CCU(---ugg cag)UGGGCA GCGAAG-AUU GGUG(aaagu ag-)CAUCAA AGGCUAAAUA 601 ucu-CUCAAG ACCGAUAGUG UACU-AGUAC C(GUGA)GGG AAAGCUGAAA AGUACCCC(g 661 -gaa)GGGGG GUG-AAAAGU GCAUGAAACC UAUUGCUuac agacg-ugca cggcau--ga 721 aagac--gau uuuuucggac cgga-gguuc ca(gcaa-)u ggagcu--aa aaguccggag 781 cucaugucgu cagugucgug cgUUCCGU(C UCGAAA|C)A CGG|GCCAGG GAGCUUGC-U 841 GUCAUGGCGA GGUUAAgccu (---agaa)a ggcguAGCCG AAGG(GAAA) CCGaa-uucu 901 cgcagc---- -(auu-)--- --gugaggag aagCGUGU(g aaa-)GUGCG UUGAGUCAUG 961 GCGGUAAGGC UAGAAGCCAG UCGAUCUAUU CCUGAG|CAA GAUGAAGGUG GGC(GAAA)G 1021 CCCACUGGAG GUCUGca--g ca-guucuga (cgugca-ac )uc-guu|ug uguGACUUGG 1081 GAAUAGGGGU (CAAAA)ACC AAUCUAGACU GG-CGCUUGC UAGUUCCAAC CGAAGCGUCU 1141 (CGCA)GGGC GUG|CUAUGU Gga-|gau|- GGUGUUUC|C GGUAG|AGC- ACUGAUAGGG 1201 UGGcgcg|gg gaa(gaga-) uuccu-ca-C CAUCCAUUCA AACUCCGAAC G|GAUACAca 1261 ucgc-aga|a GCAUGG-ACA CGGGUUCGUG UGAC(G-UAA G)GUUC-ACG ACCGAGAGGG 1321 (GUUUAA)CC CAGACUGG-A GUUAAGGU|C CCCAAAUUUC UGCUA-AGUG uca---agcc 1381 AAAGUGCGUG UCGUCGCCAA GACAGCAGGG AGGUAAGC(U CAGAAGUA)G CUAUCCU|U( 1441 CAA)A|AAGU GU(GUAACA) AC|UUACCUG CCGAGCGAGG GCGC|CACAA AAACGGACGG 1501 GG-CUAAAGC AGAG|U|ACC GAUACU|CCG G|acgugcac c-(guggca) -ggugcac-c 1561 guGGUAGGUU GG|CGUAGUG AU-UGGGACG AAGCAGGG-C G(GUGA)CGU CCUGUGG|AC 1621 CGAUUUCUAU UGCAGAUCCU GGC|GGCAGU AACagca-ua GUGCGG(UGA GAA-U)CCG- 1681 CAC|CACCGA AA-GCGCAAG GGUUUCCAGG CAAUGC(GUC GU)CA|GCCU GGAGUUAGU| 1741 CGA|-UCCUA AGGGCUGcCU C(-aaca)GA G---UAGUCC GAAU-GGGAA ACCAG(UUAA 1801 UAUU)CUGGU |AC-cuugca gggagca--- ---u(-uuga )a------cc gcau-gguu- 1861 -cgcuuccgg auaggggg|a gcaug(---- -ac-cgucgu ------)cgu guc|-gaacc 1921 guucgagggc CUGGGAGUG- UC(GUAAU)G A|CGAGAACA G--gcccg-a ggcgggagu- 1981 |ggc|uuugc (--guua)gc agaguu-ccc ccgacuccag -|gagaccau gaaa-gaccg 2041 ugcggcua-- gcaaacugca a--g|U|UCG UACCCAGAUC C(GACACA)G G|UGCGC-UG 2101 GU(uuaguaa )ACUAA-GGU GC||UACGGG uauaccGUAU GGCGAGGGAA UUCGGCAAAU 2161 UAGUCCUGUA GC(UUAG)GU AUAAGGGAU| GCCUGC---a gcuguc-au- -(-gagga-) 2221 --auggcggc a--GCAGGU| U(GCA)GUGA CAAGGGGGUU U|CGACUGU( UUAAUAAAA) 2281 ACACAGGCGA CUG|CUAGCC C(GAA-A)GG GUAUGUAUAG UCGCUGAAUC CUGGCCGGU| 2341 GCCGGUAUCU AAA----acu uggg---(uu ca)----acc gag----cUA AGGA|CCGGU 2401 UAAC|GCC|G GGAGU(AACU CUG)ACUCUC |U(UAA)GGU AGCCAAAUGC CUU|GUCGGG 2461 (UA-----AG U)UCCGAC|G UGCAUGAAUG GAACAACGAG AGCCCCACUG UCCCCGCC-U 2521 acaacccg|G UGAAGCC|ac auaa-GGUGG (acgaACA-G U)CCAUCAUC UUCCAUCGGG 2581 GAGAGAAGAC CCCGUGG|AG UUUUACUGC| AGCUUGUGCU U|GUGGUAUA GCAGGGAGUG 2641 CAUAGCGUAU CUGGGAGUCU UUU---CCUG GCGUUC(UUU G-)GGACGUC A-GGGGACGC 2701 CGAUGUAACA CCAGA|CAUU UCCUGCAGUG CCACuuac-- --ccggu(-g aag-)accgg 2761 ---ggacaCG UGCAGGUGGG CAGUUCGGCU G(GGG)CGGC AC|CCCUUU( GAAAAGAUAU 2821 C)AAAGGG|G |C|CCAAAGU UUAGU|UCAA GCGGG(UCAG AAA)UCCGCU GAAGA|GG(G 2881 |C-AAAG)CC AAAAACUAGA CUGAAUGGAU UUC(caaacg cac-)GGAAU UCAUAGAC(G 2941 AAA)GUCUGG |CUUAGCGAU CCUUCGUGU- GCC(-cacua uug)GGG|CC CGGAG-guga 3001 c|aga--aAA AUUACCC|CA GG|GAUAACA G|GCUCGUCG CGGGCGAGA( GCUCCUA)UC 3061 GACCCCGCGG UUUGGUACCU CGAUGUC|GG CU|U|UUCCU |AUCC|UGGU CCU(GCAGCA 3121 )GGGGCCAAG GGU|GAGGC( UGCUC)GCCU AUUAAAAGGG AACAUGAGCU GGGUUUAGAC 3181 C|GUC(GUGA )GACAGGUCG GCCUCUGCCU |GAUGGAAG- |CG--CGAAU AUCU|GA|GG 3241 GGAA-GUCGC UCC(UAGUAC (GAGA)GGAA C)AGAGAGG| CGUGGCCACU GGUCUACCGG 3301 U|UGUCC-(G ACA)-GGGCA --GGCCGGGC A-GCUAAGCC AUC-UAGGCU AAGUGCU(GA 3361 AAGCAUCUA) AGCACGAAUC CUAUCCC|GA G-ACAAGAUA UUC|c-|uuc ---------- 3421 -------(uu cg-)------ ---------- -gau--gaag g|uu|CGCAG (AAGAAGA)C 3481 UGCGUUGAUA G|GAAGG|UG GUGUAaccCa caagc---(u ucg--)---g cgagu-gg|a 3541 gAGCCA|G|C CUUUACUAau cgaccaaa-- |acaccgguu c------||| || //